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PREFACE 


My purpose in writing this book was to provide a clear, accessible treatment of discrete 
mathematics for students majoring or minoring in computer science, mathematics, math- 
ematics education, and engineering. The goal of the book is to lay the mathematical foun- 
dation for computer science courses such as data structures, algorithms, relational database 
theory, automata theory and formal languages, compiler design, and cryptography, and for 
mathematics courses such as linear and abstract algebra, combinatorics, probability, logic 
and set theory, and number theory. By combining discussion of theory and practice, I have 
tried to show that mathematics has engaging and important applications as well as being 
interesting and beautiful in its own right. 

A good background in algebra is the only prerequisite; the course may be taken by stu- 
dents either before or after a course in calculus. Previous editions of the book have been 
used successfully by students at hundreds of institutions in North and South America, 
Europe, the Middle East, Asia, and Australia. 

Recent curricular recommendations from the Institute for Electrical and Electronic 
Engineers Computer Society (IEEE-CS) and the Association for Computing Machinery 
(ACM) include discrete mathematics as the largest portion of “core knowledge” for com- 
puter science students and state that students should take at least a one-semester course in 
the subject as part of their first-year studies, with a two-semester course preferred when 
possible. This book includes the topics recommended by those organizations and can be 
used effectively for either a one-semester or a two-semester course. 

At one time, most of the topics in discrete mathematics were taught only to upper-level 
undergraduates. Discovering how to present these topics in ways that can be understood by 
first- and second-year students was the major and most interesting challenge of writing this 
book. The presentation was developed over a long period of experimentation during which 
my students were in many ways my teachers. Their questions, comments, and written work 
showed me what concepts and techniques caused them difficulty, and their reaction to my 
exposition showed me what worked to build their understanding and to encourage their 
interest. Many of the changes in this edition have resulted from continuing interaction with 
students. 


Themes of a Discrete Mathematics Course 


Discrete mathematics describes processes that consist of a sequence of individual steps. 
This contrasts with calculus, which describes processes that change in a continuous fash- 
ion. Whereas the ideas of calculus were fundamental to the science and technology of the 
industrial revolution, the ideas of discrete mathematics underlie the science and technol- 
ogy of the computer age. The main themes of a first course in discrete mathematics are 
logic and proof, induction and recursion, discrete structures, combinatorics and discrete 
probability, algorithms and their analysis, and applications and modeling. 


xiii 
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xiv PREFACE 


Logicand Proof Probably the most important goal of a first course in discrete mathemat- 
ics is to help students develop the ability to think abstractly. This means learning to use 
logically valid forms of argument and avoid common logical errors, appreciating what it 
means to reason from definitions, knowing how to use both direct and indirect arguments 
to derive new results from those already known to be true, and being able to work with 
symbolic representations as if they were concrete objects. 


Induction and Recursion An exciting development of recent years has been the increased 
appreciation for the power and beauty of “recursive thinking.” To think recursively means 
to address a problem by assuming that similar problems of a smaller nature have already 
been solved and figuring out how to put those solutions together to solve the larger prob- 
lem. Such thinking is widely used in the analysis of algorithms, where recurrence relations 
that result from recursive thinking often give rise to formulas that are verified by math- 
ematical induction. 


Discrete Structures Discrete mathematical structures are the abstract structures that de- 
scribe, categorize, and reveal the underlying relationships among discrete mathematical 
objects. Those studied in this book are the sets of integers and rational numbers, general 
sets, Boolean algebras, functions, relations, graphs and trees, formal languages and regular 
expressions, and finite-state automata. 


Combinatorics and Discrete Probability Combinatorics is the mathematics of count- 
ing and arranging objects, and probability is the study of laws concerning the measure- 
ment of random or chance events. Discrete probability focuses on situations involving 
discrete sets of objects, such as finding the likelihood of obtaining a certain number of 
heads when an unbiased coin is tossed a certain number of times. Skill in using combina- 
torics and probability is needed in almost every discipline where mathematics is applied, 
from economics to biology, to computer science, to chemistry and physics, to business 
management. 


Algorithms and Their Analysis The word algorithm was largely unknown in the middle 
of the twentieth century, yet now it is one of the first words encountered in the study of 
computer science. To solve a problem on a computer, it is necessary to find an algorithm, or 
step-by-step sequence of instructions, for the computer to follow. Designing an algorithm 
requires an understanding of the mathematics underlying the problem to be solved. Deter- 
mining whether or not an algorithm is correct requires a sophisticated use of mathematical 
induction. Calculating the amount of time or memory space the algorithm will need in 
order to compare it to other algorithms that produce the same output requires knowledge 
of combinatorics, recurrence relations, functions, and O-, Q-, and @-notations. 


Applications and Modeling Mathematical topics are best understood when they are seen 
in a variety of contexts and used to solve problems in a broad range of applied situations. 
One of the profound lessons of mathematics is that the same mathematical model can be 
used to solve problems in situations that appear superficially to be totally dissimilar. A goal 
of this book is to show students the extraordinary practical utility of some very abstract 
mathematical ideas. 


Special Features of This Book 


International Metric Version This metric version differs from the U.S. version of 
Discrete Mathematics with Applications, Fifth Edition as follows: The units of measurement 
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PREFACE XV 


used in the examples and exercises have been converted from the U.S. Customary System 
(USCS) of units (also referred to as English or Imperial units) to Metric units. The appen- 
dix containing solutions to selected exercises and the instructor’s solutions manual have 
been converted to metric where appropriate. 


Mathematical Reasoning The feature that most distinguishes this book from other dis- 
crete mathematics texts is that it teaches—explicitly but in a way that is accessible to 
first- and second-year college and university students—the unspoken logic and reason- 
ing that underlie mathematical thought. For many years I taught an intensively interactive 
transition-to-abstract-mathematics course to mathematics and computer science majors. 
This experience showed me that while it is possible to teach the majority of students to 
understand and construct straightforward mathematical arguments, the obstacles to doing 
so cannot be passed over lightly. To be successful, a text for such a course must address 
students’ difficulties with logic and language directly and at some length. It must also 
include enough concrete examples and exercises to enable students to develop the mental 
models needed to conceptualize more abstract problems. The treatment of logic and proof 
in this book blends common sense and rigor in a way that explains the essentials, yet avoids 
overloading students with formal detail. 


Spiral Approach to Concept Development A number of concepts in this book appear in 
increasingly more sophisticated forms in successive chapters to help students develop the 
ability to deal effectively with increasing levels of abstraction. For example, by the time 
students encounter the relatively advanced mathematics of Fermat’s little theorem in Sec- 
tion 8.4, they have been introduced to the logic of mathematical discourse in Chapters 1, 
2, and 3, learned the basic methods of proof and the concepts of mod and div in Chapter 
4, explored mod and div as functions in Chapter 7, and become familiar with equivalence 
relations in Sections 8.2 and 8.3. This approach builds in useful review and develops math- 
ematical maturity in natural stages. 


Support for the Student Students at colleges and universities inevitably have to learn a 
great deal on their own. Though it is often frustrating, learning to learn through self-study 
is a crucial step toward eventual success in a professional career. This book has a number 
of features to facilitate students’ transition to independent learning. 


Worked Examples 

The book contains over 500 worked examples, which are written using a problem- 
solution format and are keyed in type and in difficulty to the exercises. Many solutions 
for the proof problems are developed in two stages: first a discussion of how one might 
come to think of the proof or disproof and then a summary of the solution, which is 
enclosed in a box. This format allows students to read the problem and skip imme- 
diately to the summary, if they wish, only going back to the discussion if they have 
trouble understanding the summary. The format also saves time for students who are 
rereading the text in preparation for an examination. 


Marginal Notes and Test Yourself Questions 

Notes about issues of particular importance and cautionary comments to help students 
avoid common mistakes are included in the margins throughout the book. Questions 
designed to focus attention on the main ideas of each section are located between the 
text and the exercises. For convenience, the questions use a fill-in-the-blank format, 
and the answers are found immediately after the exercises. 
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Exercises 

The book contains almost 2600 exercises. The sets at the end of each section have 
been designed so that students with widely varying backgrounds and ability levels will 
find some exercises they can be sure to do successfully and also some exercises that 
will challenge them. 


Solutions for Exercises 

To provide adequate feedback for students between class sessions, Appendix B 
contains at least one, and often several, complete solutions for every type of exer- 
cise in the book. A blue exercise number indicates that there is a solution in Ap- 
pendix B; the letter H is added for a solution that is less than complete. When two 
or more exercises use the same solution strategy, there is a full solution for the first 
and either another full solution or a partial solution for later ones. Exercises with 
several parts often have an answer and/or hint for one or more of the parts to help 
students determine whether they are on track so that they can make adjustments 
if needed. 

Students are strongly urged not to consult solutions until they have tried their best 
to answer questions on their own. Once they have done so, however, comparing their 
answers with those given can lead to significantly improved understanding. There are 
also plenty of exercises without solutions to help students learn to grapple with math- 
ematical problems in a realistic environment. 


Reference Features 

Many students have written me to say that the book helped them succeed in their ad- 
vanced courses. One even wrote that he had used one edition so extensively that it had 
fallen apart, and he actually went out and bought a copy of the next edition, which he 
was continuing to use in a master’s program. Figures and tables are included where 
doing so would help readers to a better understanding. In most, a second color is used 
to highlight meaning. My rationale for screening statements of definitions and theo- 
rems, for putting titles on exercises, and for giving the meanings of symbols and a list 
of reference formulas in the endpapers is to make it easier for students to use this book 
for review in a current course and as a reference in later ones. 


Support for the Instructor I have received a great deal of valuable feedback from in- 
structors who have used previous editions of this book. Many aspects of the book have 
been improved through their suggestions. In addition to the following items, there is ad- 
ditional instructor support on the book’s website, described later in the preface. 


Exercises 

The large variety of exercises at all levels of difficulty allows instructors great free- 
dom to tailor a course to the abilities of their students. Exercises with solutions in 
the back of the book have numbers in blue, and those whose solutions are given 
in a separate Student Solutions Manual and Study Guide have numbers that are a 
multiple of three. There are exercises of every type in the book that have no answer 
in either location so that instructors can assign whatever mixture they prefer of 
exercises with and without answers. The ample number of exercises of all kinds 
gives instructors a significant choice of problems to use for review assignments and 
exams. Instructors are invited to use the many exercises stated as questions rather 
than in “prove that” form to stimulate class discussion on the role of proof and coun- 
terexample in problem solving. 
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Flexible Sections 

Most sections are divided into subsections so that an instructor can choose to cover 
certain subsections only and either omit the rest or leave them for students to study on 
their own. The division into subsections also makes it easier for instructors to break 
up sections if they wish to spend more than one day on them. 


Presentation of Proof Methods 

It is inevitable that most of the proofs and disproofs in this book will seem easy to 
instructors. Many students, however, find them difficult. In showing students how 
to discover and construct proofs and disproofs, I have tried to describe the kinds of 
approaches that mathematicians use when confronting challenging problems in their 
own research. 


Complete Instructor Solutions 
Complete instructor solutions to all exercises are available to anyone teaching a course 
from this book. They are available through the Instructor’s Companion Website. 


Highlights of the Fifth Edition 


The changes made for this edition are based on suggestions from colleagues and other 
long-time users of previous editions, on continuing interactions with my students, and on 
developments within the evolving fields of computer science and mathematics. 


Reorganization 

e In response to instructor requests to move the introduction of certain topics ear- 
lier in the book, Section 1.2 now includes a definition and examples of strings. 
In addition, a new Section 1.4 contains definitions and examples of graphs and 
includes an introduction to graph coloring and the four-color theorem. 

e The handshake theorem and its applications have been moved from Chapter 10 to 
Section 4.9. This gives students an early experience of using direct and indirect 
proof in a novel setting and was made possible because the elements of graph 
theory are now introduced in Chapter 1. 


Improved Pedagogy 

e The exposition has been reexamined throughout and carefully revised as needed. 

e Exercises have been added for topics where students seemed to need addi- 
tional practice, and they have been modified, as needed, to address student 
difficulties. 

e Additional hints and full answers have been incorporated into Appendix B to 
give students more help for difficult topics. 

e The introductory material in Chapter 4 was made more accessible by being di- 
vided into two sections. The first introduces basic concepts about proof and dis- 
proof in the context of elementary number theory, and the second adds examples 
and advice for writing proofs. 


Logic and Applications 

e Discussion was added about the role of bound variables and scope in mathemati- 
cal writing and computer programming. 

e The section on two’s complements was significantly simplified. 

e Language for expressing universal quantifiers was revised to provide a clearer 
basis for the idea of the generic particular in mathematical proof. 

e The material on Boolean algebras was expanded. 
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Proof and Applications 

e A greater variety of examples and exercises for number theory and set theory 
proofs is now included. 

e The directions for writing proofs and the discussion of common mistakes have 
been revised and expanded in response to interaction with students. 

e Discussion of historical background and recent mathematical results has been 
augmented. 

e Material was added on using cryptographic hash functions to secure the trans- 
mission of digital data and on using cryptography to authenticate the sender of a 
transmitted message. 


Induction and Recursion 

e The sections on ordinary and strong mathematical induction were reorganized 
and expanded to increase the emphasis on applications. 

e In the section on recursive definitions, the format used for proofs by structural 
induction was revised to parallel the format used for proofs by ordinary and 
strong mathematical induction. The set of examples and exercises illustrating 
recursive definitions and structural induction was significantly increased. The 
recursive definition for the set of strings over a finite set and for the length of a 
string were revised, and structural induction proofs for fundamental string prop- 
erties are now included. 


Graph Theory and the Analysis of Algorithm Efficiency 

e Instructors who wish to give their students an early experience of graph theory 
can now do so by combining the introduction to graphs in Chapter | with the 
handshake theorem in Chapter 4. 

e There is a new subsection on binary search trees in Chapter 10. 

e The discussion of O-, Q-, and ©-notations was significantly simplified. 

e Many exercises on algorithm efficiency were added or revised to make the con- 
cepts more accessible. 


Student Resources 
The Student Companion Website for this book includes: 


e A general orientation for each chapter 

e Review materials for each chapter 

e Proof tips 

e A link to the author’s personal website, which contains errata information and links 
for interactive animations, tutorials, and other discrete mathematics resources on the 
Internet 


Instructor’s Resources 
login.cengage.com 
The Instructor’s Companion Website for this book contains: 


e Suggestions for how to approach the material of each chapter 
e The Complete Instructor’s Solutions Manual 

e Ideas for projects and writing assignments 

e Review materials to share with students 

e Lecture Note PowerPoint slides 

e Images from the book 
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e A test bank of questions for exams and quizzes 
e Migration guide from 4th to 5th edition 
Additional resources for the book are available at http://condor.depaul.edu/sepp. 


WebAssign 


www.webassign.com 


WebAssign from Cengage Discrete Mathematics with Applications, Fifth Edition, Metric 
Version is an online homework system, which instructors can choose to pair with the 
book. For students, it offers tutorial help in solving exercises, including review of rel- 
evant material, short instructional videos, and instant feedback on how they are doing. 
For instructors, it offers the ability to create customized homework sets, most of which 
are graded automatically and produce results directly into an online grade roster. Real- 
time access to their students’ performance makes it possible for instructors to adjust the 
presentation of material on an ongoing basis. 


Student Solutions Manual and Study Guide 
(ISBN: 978-0-357-03520-7) 


In writing this book, I hoped that the exposition in the text, the worked examples, and the 
exercise solutions would provide all that a student would need to successfully master the 
material of the course. I continue to believe that any student who understands the solutions 
for all the exercises with complete solutions in Appendix B will have achieved an excellent 
command of the subject. Nonetheless, in response to requests for supplementary materials, 
I developed the Student Solutions Manual and Study Guide, available separately from the 
book, which contains complete solutions for all the exercises whose numbers are a multiple 
of 3. The guide also includes alternative explanations for some of the concepts and review 
questions for each chapter. 


Organization 


This book may be used effectively for a one- or two-semester course. Chapters contain 
core sections, sections covering optional mathematical material, and sections covering 
optional applications. Instructors have the flexibility to choose whatever mixture will 
best serve the needs of their students. The following table shows a division of the sections 
into categories. 


Sections Containing Optional | Sections Containing Optional 
Chapter | Core Sections Mathematical Material Computer Science Applications 

1 1.1-1.3 1.4 1.4 

2 2.1-2.3 2.3 2.4, 2.5 

3 3.1-3.4 3:3 3.3 

4 4.1-4.5, 4.7 4.6, 4.8, 4.9 4.10 

5 5.1, 5.2, 5.6, 5.7 5.3, 5.4, 5.8 5.1,.5.5,.5.9 

6 6.1 6.2-6.4 6.1, 6.4 

7 Tels 2 7.3, 74 7.1, 7.2, 7.4 
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8.1-8.3 8.4, 8.5 8.4, 8.5 
9 9.1-9.4 9.5-9.9 9.3 
10 10.1, 10.4 10.2, 10.3, 10.5 10.1, 10.4-10.6 
11 11.1, 11.2 11.4 11.3, 11.5 
12 12.1, 12.2 12.3 12.1-12.3 


The following tree diagram shows, approximately, how the chapters of this book depend 
on each other. Chapters on different branches of the tree are sufficiently independent that 
instructors need to make at most minor adjustments if they skip chapters, or sections of 
chapters, but follow paths along branches of the tree. 

In most cases, covering only the core sections of the chapters is adequate preparation 
for moving down the tree. 
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SPEAKING 
MATHEMATICALLY 


Therefore O students study mathematics and do not build 
without foundations. —Leonardo da Vinci (1452-1519) 


The aim of this book is to introduce you to a mathematical way of thinking that can serve 
you in a wide variety of situations. Often when you start work on a mathematical prob- 
lem, you may have only a vague sense of how to proceed. You may begin by looking at 
examples, drawing pictures, playing around with notation, rereading the problem to focus 
on more of its details, and so forth. The closer you get to a solution, however, the more 
your thinking has to crystallize. And the more you need to understand, the more you need 
language that expresses mathematical ideas clearly, precisely, and unambiguously. 

This chapter will introduce you to some of the special language that is a foundation 
for much mathematical thought, the language of variables, sets, relations, and functions. 
Think of the chapter like the exercises you would do before an important sporting event. 
Its goal is to warm up your mental muscles so that you can do your best. 


ERE Variables 


A variable is sometimes thought of as a mathematical “John Doe” because you can use it 
as a placeholder when you want to talk about something but either (1) you imagine that it 
has one or more values but you don’t know what they are, or (2) you want whatever you 
say about it to be equally true for all elements in a given set, and so you don’t want to be 
restricted to considering only a particular, concrete value for it. To illustrate the first use, 
consider asking 


Is there a number with the following property: doubling it and adding 3 
gives the same result as squaring it? 


In this sentence you can introduce a variable to replace the potentially ambiguous 
word “it”: 


Is there a number x with the property that 2x +3 = x 


The advantage of using a variable is that it allows you to give a temporary name to what 
you are seeking so that you can perform concrete computations with it to help discover its 
possible values. To emphasize the role of the variable as a placeholder, you might write the 
following: 


Is there a number LI with the property that 2-U+3 = 29 


The emptiness of the box can help you imagine filling it in with a variety of different val- 
ues, some of which might make the two sides equal and others of which might not. 
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In this sense, a variable in a computer program is similar to a mathematical variable 
because it creates a location in computer memory (either actual or virtual) into which 
values can be placed. 

To illustrate the second use of variables, consider the statement 


No matter what number might be chosen, if it is greater than 2, then its 
square is greater than 4. 


In this case introducing a variable to give a temporary name to the (arbitrary) number you 
might choose enables you to maintain the generality of the statement, and replacing all in- 
stances of the word “it” by the name of the variable ensures that possible ambiguity is avoided: 


No matter what number n might be chosen, if n is greater than 2, then 
nis greater than 4. 


CANSAN] Writing Sentences Using Variables 


Use variables to rewrite the following sentences more formally. 


a. Are there numbers with the property that the sum of their squares equals the square of 
their sum? 


b. Given any real number, its square is nonnegative. 


Solution 
Note In part (a) the a. Are there numbers a and b with the property that a? + b? = (a + b}? 
answer is yes. For Or: Are there numbers a and b such that a? + b? = (a+ by*? 
instance, a = 1 and b = 0 Or: Do there exist any numbers a and b such that a+b = (a+ b)? 


would work. Can you . a . 
think of other numbers b. Given any real number r, r° is nonnegative. 


that would also work? Or: For any real number r, r=0. 
ye 
Or: For every real number r, r° = 0. ia 


Some Important Kinds of Mathematical Statements 


Three of the most important kinds of sentences in mathematics are universal statements, 
conditional statements, and existential statements: 


A universal statement says that a certain property is true for all elements in a set. 
(For example: All positive numbers are greater than zero.) 


A conditional statement says that if one thing is true then some other thing also 


has to be true. (For example: If 378 is divisible by 18, then 378 is divisible by 6.) 


Given a property that may or may not be true, an existential statement says that 
there is at least one thing for which the property is true. (For example: There is a 
prime number that is even.) 


In later sections we will define each kind of statement carefully and discuss all of them 
in detail. The aim here is for you to realize that combinations of these statements can be 
expressed in a variety of different ways. One way uses ordinary, everyday language and 
another expresses the statement using one or more variables. The exercises are designed to 
help you start becoming comfortable in translating from one way to another. 
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Note If you introduce x 
in the first part of the sen- 
tence, be sure to include it 
in the second part of the 
sentence. 


Note For a number b to 
be an additive inverse for 
a number a means that 
a+b=0. 


1.1 VARIABLES 3 


Universal Conditional Statements 


Universal statements contain some variation of the words “for every” and conditional 
statements contain versions of the words “if-then.” A universal conditional statement is 
a statement that is both universal and conditional. Here is an example: 


For every animal a, if a is a dog, then a is a mammal. 


One of the most important facts about universal conditional statements is that they can be 
rewritten in ways that make them appear to be purely universal or purely conditional. For 
example, the previous statement can be rewritten in a way that makes its conditional nature 
explicit but its universal nature implicit: 


If ais a dog, then a is a mammal. 


Or: If an animal is a dog, then the animal is a mammal. 


The statement can also be expressed so as to make its universal nature explicit and its 
conditional nature implicit: 


For every dog a, a is amammal. 
Or: All dogs are mammals. 


The crucial point is that the ability to translate among various ways of expressing univer- 
sal conditional statements is enormously useful for doing mathematics and many parts of 
computer science. 


Rewriting a Universal Conditional Statement 
Fill in the blanks to rewrite the following statement: 


For every real number x, if x is nonzero then x’ is positive. 


a. If areal number is nonzero, then its square 
b. For every nonzero real number x, 

c. Ifx______, then 

d. The square of any nonzero real number is 


e. All nonzero real numbers have 


Solution 

a. is positive 

b. x? is positive 

c. is a nonzero real number; x’ is positive 
d. positive 


e. positive squares (or: squares that are positive) E 


Universal Existential Statements 


A universal existential statement is a statement that is universal because its first part says 
that a certain property is true for all objects of a given type, and it is existential because its 
second part asserts the existence of something. For example: 


Every real number has an additive inverse. 
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In this statement the property “has an additive inverse” applies universally to all real num- 
bers. “Has an additive inverse” asserts the existence of something—an additive inverse— 
for each real number. However, the nature of the additive inverse depends on the real 
number; different real numbers have different additive inverses. Knowing that an additive 
inverse is a real number, you can rewrite this statement in several ways, some less formal 
and some more formal:* 


All real numbers have additive inverses. 

Or: For every real number r, there is an additive inverse for r. 

Or: For every real number r, there is a real number s such that s is an 
additive inverse for r. 


Introducing names for the variables simplifies references in further discussion. For in- 
stance, after the third version of the statement you might go on to write: When r is positive, 
s is negative, when r is negative, s is positive, and when r is zero, s is also zero. 

One of the most important reasons for using variables in mathematics is that it gives you 
the ability to refer to quantities unambiguously throughout a lengthy mathematical argu- 
ment, while not restricting you to consider only specific values for them. 


CAMS Rewriting a Universal Existential Statement 


Fill in the blanks to rewrite the following statement: Every pot has a lid. 
a. All pots _____.. 

b. For every pot P, there is 

c. For every pot P, there is a lid L such that 


Solution 
a. have lids 


b. alid for P 
c. Lisa lid for P a 


Existential Universal Statements 


An existential universal statement is a statement that is existential because its first part 
asserts that a certain object exists and is universal because its second part says that the 
object satisfies a certain property for all things of a certain kind. For example: 


There is a positive integer that is less than or equal to every positive integer. 


This statement is true because the number one is a positive integer, and it satisfies the 
property of being less than or equal to every positive integer. We can rewrite the statement 
in several ways, some less formal and some more formal: 


Some positive integer is less than or equal to every positive integer. 

Or: There is a positive integer m that is less than or equal to every 
positive integer. 

Or: There is a positive integer m such that every positive integer is 
greater than or equal to m. 

Or: There is a positive integer m with the property that for every 
positive integer n, m <n. 


*A conditional could be used to help express this statement, but we postpone the additional complexity to a 
later chapter. 
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ZCWJGAKI Rewriting an Existential Universal Statement 
Fill in the blanks to rewrite the following statement in three different ways: 


There is a person in my class who is at least as old as every person in 
my class. 


a. Some ____ is at least as old as 

b. There is a person p in my class such that p is 

c. There is a person p in my class with the property that for every person q in my class, 
pis ____. 

Solution 

a. person in my class; every person in my class 

b. at least as old as every person in my class 


c. at least as old as q E 


Some of the most important mathematical concepts, such as the definition of limit of a 
sequence, can only be defined using phrases that are universal, existential, and conditional, 
and they require the use of all three phrases “for every,” “there is,’ and “if-then.” For 
example, if a), a2, a3, . . . is a sequence of real numbers, saying that 


the limit of a, as n approaches infinity is L 
means that 


for every positive real number e, there is an integer N such that 
for every integer n, ifn > N then —e <a,—L<e. 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. A universal statement asserts that a certain property 3. Given a property that may or may not be true, 
is for : an existential statement asserts that _____ for 
which the property is true. 


2. A conditional statement asserts that if one 
thing _______ then some other thing 


EXERCISE SET 1.1 


Appendix B contains either full or partial solutions to all exercises with blue numbers. When the solution is not complete, 
the exercise number has an “H” next to it. A “*” next to an exercise number signals that the exercise is more challenging 
than usual. Be careful not to get into the habit of turning to the solutions too quickly. Make every effort to work exercises 
on your own before checking your answers. See the Preface for additional sources of assistance and further study. 


In each of 1-6, fill in the blanks using a variable or variables a. Is there an integer n such that n has ? 
to rewrite the given statement. b. Does there exist such that if n is divided 
l ; ; : 9 
1. Is there a real number whose square is —1? by 5 the remainder is 2 and if i 
a. Is there a real number x such that___? Note: There are integers with this property. Can you 
b. Does there exist such that x? = —1? think of one? 


2. Is there an integer that has a remainder of 2 when 
it is divided by 5 and a remainder of 3 when it is 
divided by 6? 


3. Given any two distinct real numbers, there is a 
real number in between them. 
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In each of 8-13, fill in the blanks to rewrite the given 13. 


CHAPTER1 SPEAKING MATHEMATICALLY 


a. Given any two distinct real numbers a and b, 
there is a real number c such that c is 

b. For any two ___., such that c is 
between a and b. 


Given any real number, there is a real number that 

is greater. 

a. Given any real number r, there is 
that s is . 

b. For any 


s such 


such that s > r. 


3 


. The reciprocal of any positive real number is positive. 
a. Given any positive real number r, the reciprocal 10. 


of i 
b. For any real number r, if r is 
c. Ifa real number r , then 


, then 


The cube root of any negative real number is 
negative. 


a. Given any negative real number s, the cube T1. 


root of ____ 
b. For any real number s, if s is 
c. If areal number s , then 


, then 


. Rewrite the following statements less formally, 
without using variables. Determine, as best as you 12. 


can, whether the statements are true or false. 

a. There are real numbers u and v with the prop- 
erty thatut+tv<u-v. 

b. There is a real number x such that x? < x. 

c. For every positive integer n, n =n. 

d. For all real numbers a and b, |a + b| = |a| + |b]. 


statement. 


8. For every object J, if J is a square then J has four 


ANSWERS FOR TEST YOURSELF 


1. true; all elements of a set 


sides. 
a. All squares ____ 
b. Every square 


c. If an object is a square, then it 
d. IfJ , then J__. 
e. For every square J, 


For every equation E, if E is quadratic then £ has 
at most two real solutions. 
All quadratic equations 
Every quadratic equation ____ 
If an equation is quadratic, then it 
IfE , then £ 7 

For every quadratic equation E, 


panpgp 


Every nonzero real number has a reciprocal. 

a. All nonzero real numbers 

b. For every nonzero real number r, there is 
—— forr. 

c. For every nonzero real number r, there is a real 
number s such that 


Every positive number has a positive square root. 

a. All positive numbers ——— 

b. For every positive number e, there is for e. 

c. For every positive number e, there is a positive 
number r such that —— 


There is a real number whose product with every 
number leaves the number unchanged. 
a. Some has the property that its 
b. There is a real number r such that the product 
ofr 
c. There is a real number r with the property that 
for every real number s, 


There is a real number whose product with every 

real number equals zero. 

a. Some has the property that its 

b. There is a real number a such that the product 
ofa 

c. There is a real number a with the property that 
for every real number b, ____ 


EFJ The Language of Sets 


2. is true; also has to be true 3. there is at least one thing 


. .. when we attempt to express in mathematical symbols a condition proposed in 


words. First, we must understand thoroughly the condition. Second, we must be 


familiar with the forms of mathematical expression. —George Polya (1887-1985) 


Use of the word set as a formal mathematical term was introduced in 1879 by Georg 
Cantor (1845-1918). For most mathematical purposes we can think of a set intuitively, as 
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Note The Z is the first 
letter of the German word 
for integers, Zahlen. It 
stands for the set of all 
integers and should not 
be used as a shorthand for 
the word integer. 

When the Symbols R, 
Q, and Z are handwrit- 
ten, they appear as R, Q, 
and Z. 
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Cantor did, simply as a collection of elements. For instance, if C is the set of all countries 
that are currently in the United Nations, then the United States is an element of C, and if J 
is the set of all integers from | to 100, then the number 57 is an element of J. 


Set-Roster Notation 


If S is a set, the notation x € § means that x is an element of S. The notation x € S 
means that x is not an element of S$. A set may be specified using the set-roster 
notation by writing all of its elements between braces. For example, {1, 2, 3} denotes 
the set whose elements are 1, 2, and 3. A variation of the notation is sometimes used 
to describe a very large set, as when we write {1, 2, 3, ..., 100} to refer to the set of 
all integers from 1 to 100. A similar notation can also describe an infinite set, as when 
we write {1, 2, 3, ...} to refer to the set of all positive integers. (The symbol ... is 
called an ellipsis and is read “and so forth.”) 


The axiom of extension says that a set is completely determined by what its elements 
are—not the order in which they might be listed or the fact that some elements might be 
listed more than once. 


Using the Set-Roster Notation 

a. Let A = {1, 2, 3}, B = {3, 1, 2}, and C = {1, 1, 2, 3, 3, 3}. What are the elements of 
A, B, and C? How are A, B, and C related? 

b. Is {0} = 0? 

c. How many elements are in the set {1, {1}}? 

d. For each nonnegative integer n, let U,, = {n, —n}. Find Uj, U5, and Uo. 


Solution 


a. A, B, and C have exactly the same three elements: 1, 2, and 3. Therefore, A, B, and C 
are simply different ways to represent the same set. 


b. {0} # 0 because {0} is a set with one element, namely 0, whereas 0 is just the symbol 
that represents the number zero. 


c. The set {1, {1}} has two elements: 1 and the set whose only element is 1. 
d. U, = {1, —1}, U2 = {2, —2}, Uo = (0, —0} = {0, 0} = {0}. E 


Certain sets of numbers are so frequently referred to that they are given special sym- 
bolic names. These are summarized in the following table. 


Symbol | Set 
R the set of all real numbers 
Z the set of all integers 
Q the set of all rational numbers, or quotients of integers 


Addition of a superscript + or — or the letters nonneg indicates that only the positive or 
negative or nonnegative elements of the set, respectively, are to be included. Thus R* 
denotes the set of positive real numbers, and Z”°””®S refers to the set of nonnegative 
integers: 0, 1, 2, 3, 4, and so forth. Some authors refer to the set of nonnegative integers 
as the set of natural numbers and denote it as N. Other authors call only the positive 
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integers natural numbers. To prevent confusion, we simply avoid using the phrase natural 
numbers in this book. 

The set of real numbers is usually pictured as the set of all points on a line, as shown 
below. The number 0 corresponds to a middle point, called the origin. A unit of distance is 
marked off, and each point to the right of the origin corresponds to a positive real number 
found by computing its distance from the origin. Each point to the left of the origin cor- 
responds to a negative real number, which is denoted by computing its distance from the 
origin and putting a minus sign in front of the resulting number. The set of real numbers is 
therefore divided into three parts: the set of positive real numbers, the set of negative real 
numbers, and the number 0. Note that 0 is neither positive nor negative. Labels are given 
for a few real numbers corresponding to points on the line shown below. 


3 2 1 0 1 2 3 
|e t t 


+- +- 
5 5 -0.8 
2 


wie @ 
2 

NI 
N 
lon 
i 


The real number line is called continuous because it is imagined to have no holes. The 
set of integers corresponds to a collection of points located at fixed intervals along the real 
number line. Thus every integer is a real number, and because the integers are all sepa- 
rated from each other, the set of integers is called discrete. The name discrete mathematics 
comes from the distinction between continuous and discrete mathematical objects. 

Another way to specify a set uses what is called the set-builder notation. 


ae. Sere eee Set-Builder Notation 


hand brace as “the set of 


all” and the vertical line Let S denote a set and let P(x) be a property that elements of S may or may not satisfy. 
as “such that.” In all oth- We may define a new set to be the set of all elements x in S such that P(x) is true. 
er mathematical contexts, We denote this set as follows: 

however, we do not use 

a vertical line to denote {xES| Pœ} 

the words “such that”; we A 

abbreviate “such that” as Me set airal ua 

“5: t. or sth or“ 


Occasionally we will write {x | P(x)} without being specific about where the element x 
comes from. It turns out that unrestricted use of this notation can lead to genuine contradic- 
tions in set theory. We will discuss one of these in Section 6.4 and will be careful to use this 
notation purely as a convenience in cases where the set S could be specified if necessary. 


CEAMSE] Using the Set-Builder Notation 


Given that R denotes the set of all real numbers, Z the set of all integers, and Z* the set of 
all positive integers, describe each of the following sets. 


a. {x ER|-2<x<5} 
b. {xEZ|-2<x< 5} 
c. {xEZ*|-2<x<5} 


Solution 


a. {x E R| —2 <x <5} is the open interval of real numbers (strictly) between —2 and 
5. It is pictured as follows: 


3 2-1 0 1 2 3 4 5 6 7 8 
<$} a E e a 
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b. {x E Z|—2 < x <5} is the set of all integers (strictly) between —2 and 5. It is equal 
to the set {—1, 0, 1, 2, 3, 4}. 


c. Since all the integers in Z” are positive, {x & Zt | —2 <x < 5}= {1,2,3,4}. E 


Subsets 


A basic relation between sets is that of subset. 


IfA and B are sets, then A is called a subset of B, written A C B, if, and only if, every 
element of A is also an element of B. 
Symbolically: 


ACB meansthat for every element x, if x E A then x E B. 


The phrases A is contained in B and B contains A are alternative ways of saying that 
A is a subset of B. 


It follows from the definition of subset that for a set A not to be a subset of a set B means 
that there is at least one element of A that is not an element of B. Symbolically: 


AGB meansthat there is at least one element x such that x € A and x € B. 


Definition 


Let A and B be sets. A is a proper subset of B if, and only if, every element of A is 
in B but there is at least one element of B that is not in A. 


[Example 1.2.3 Subsets 


Let A= Z*,B = {n€ Z|0 =n = 100}, and C = {100, 200, 300, 400, 500}. Evaluate 
the truth and falsity of each of the following statements. 


a BCA 

b. Cis a proper subset of A 

c. Cand B have at least one element in common 
d.CCB 

ê GGE 


Solution 
a. False. Zero is not a positive integer. Thus zero is in B but zero is not in A, and so B © A. 


b. True. Each element in C is a positive integer and, hence, is in A, but there are elements 
in A that are not in C. For instance, 1 is in A and not in C. 


c. True. For example, 100 is in both C and B. 
d. False. For example, 200 is in C but not in B. 


e. True. Every element in C is in C. In general, the definition of subset implies that all 
sets are subsets of themselves. E 
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Distinction between € and C 
Which of the following are true statements? 
a. 2€ {1, 2,3} b. {2} € {1, 2, 3} E 2:G41 223) 
d. {2}C{1,2,3} e. {2}C{{1}, {2}} £ {2}E({1}, {2} 


Solution Only (a), (d), and (f) are true. 

For (b) to be true, the set {1, 2, 3} would have to contain the element {2}. But the only 
elements of {1, 2, 3} are 1, 2, and 3, and 2 is not equal to {2}. Hence (b) is false. 

For (c) to be true, the number 2 would have to be a set and every element in the set 2 
would have to be an element of {1, 2, 3}. This is not the case, so (c) is false. 

For (e) to be true, every element in the set containing only the number 2 would have to 
be an element of the set whose elements are {1} and {2}. But 2 is not equal to either {1} or 
{2}, and so (e) is false. E 


Cartesian Products 


With the introduction of Georg Cantor’s set theory in the late nineteenth century, it 
began to seem possible to put mathematics on a firm logical foundation by developing 
all of its various branches from set theory and logic alone. A major stumbling block was 
how to use sets to define an ordered pair because the definition of a set is unaffected 
by the order in which its elements are listed. For example, {a, b} and {b, a} represent 
the same set, whereas in an ordered pair we want to be able to indicate which element 
comes first. 

In 1914 crucial breakthroughs were made by Norbert Wiener (1894-1964), a young 
American who had recently received his Ph.D. from Harvard, and the German mathemati- 
cian Felix Hausdorff (1868—1942). Both gave definitions showing that an ordered pair can 
be defined as a certain type of set, but both definitions were somewhat awkward. Finally, 
in 1921, the Polish mathematician Kazimierz Kuratowski (1896-1980) published the fol- 


ArchivePL/Alamy Stock Photo 


Kazimierz Kuratowski s Pia . i u 
(1896-1980) lowing definition, which has since become standard. It says that an ordered pair is a set of 


the form 


{{a}, {a, b}}. 


This set has elements, {a} and {a, b}. If a # b, then the two sets are distinct and a is in both 
sets whereas b is not. This allows us to distinguish between a and b and say that a is the 
first element of the ordered pair and b is the second element of the pair. If a = b, then we 
can simply say that a is both the first and the second element of the pair. In this case the set 
that defines the ordered pair becomes {{a}, {a, a}}, which equals {{a}}. 

However, it was only long after ordered pairs had been used extensively in mathematics 
that mathematicians realized that it was possible to define them entirely in terms of sets, 
and, in any case, the set notation would be cumbersome to use on a regular basis. The usual 
notation for ordered pairs refers to {{a}, {a, b}} more simply as (a, b). 


Given elements a and b, the symbol (a, b) denotes the ordered pair consisting of 
a and b together with the specification that a is the first element of the pair and b 
is the second element. Two ordered pairs (a, b) and (c, d) are equal if, and only if, 
a = c and b = d. Symbolically: 


(a,b) = (c,d) means that a= candb = d. 
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CASE 49 Ordered Pairs 
a. Is (1, 2) = (2, 1)? 
b. Is (3, 5) = (V9, 3)? 
c. What is the first element of (1, 1)? 


Solution 
a. No. By definition of equality of ordered pairs, 


(1, 2) = (2, 1) if, and only if, 1 = 2 and 2 = 1. 
But 1 # 2, and so the ordered pairs are not equal. 


b. Yes. By definition of equality of ordered pairs, 


5 1\. . 7 5 at 
(s. 5) = (va. 5) if, and only if, 3 = V9 and 10 2° 


Because these equations are both true, the ordered pairs are equal. 


c. In the ordered pair (1, 1), the first and the second elements are both 1. E 


The notation for an ordered n-tuple generalizes the notation for an ordered pair to a set 
with any finite number of elements. It also takes both order and multiplicity into account. 


Definition 


Let n be a positive integer and let x1, X2, . . . , x, be (not necessarily distinct) ele- 
ments. The ordered n-tuple, (x1, X2, ...,X,), consists of x), X2,..., x, together with 
the ordering: first xı, then x2, and so forth up to x,. An ordered 2-tuple is called an 
ordered pair, and an ordered 3-tuple is called an ordered triple. 

Two ordered n-tuples (x1, X2,..., X,) and (y1, Y2, - - - , Yn) are equal if, and only 
Iti, 28) = Veia = Macon, EMOL = Nye 

Symbolically: 


(X1, X2,--- Xn) = Oi, Yar---> Vn) > X= V1, X2 = Yo,--- Xp = Vn 


Ordered n-tuples 
a. Is (1, 2, 3, 4) = (1, 2, 4, 3)? 
b. Is (3, (-2), 5) = (V9, 4, 2)? 
Solution 
a. No. By definition of equality of ordered 4-tuples, 
(d, 2, 3,4) = (1,2,4,3) © 1=1,2 =2,3 =4,and4 =3 


But 3 # 4, and so the ordered 4-tuples are not equal. 
b. Yes. By definition of equality of ordered triples, 


(s. (C27, 5) = (va. 4, a) = 3=V9and(-2) = 4 and 5 = = 


Because these equations are all true, the two ordered triples are equal. E 
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Definition 


Given sets A,,A>,...,A,, the Cartesian product of Aj, A>,...,A,, denoted 
A, X A, X ++- X A,, is the set of all ordered n-tuples (a4, do,..., a„) where a, E Aj, 
w = Aon ncs oth, SA 

Symbolically: 


A, X Ap X*** XA, = {(Qj, Qy,..-, Gy) o E Aj, Ay E Ad, ..., An E Ap}. 


In particular, 


A, X Ag = {(a, a) | a; E A; and ay E A2} 


is the Cartesian product of A; and A3. 


Seow Cartesian Products 


Let A = {x, y}, B = {1, 2, 3}, and C = {a, b}. 
a. FindA X B. 


b. Find B X A. 

c. Find A X A. 

d. How many elements are in A X B, B X A, and A X A? 

e. Find (A X B) X C 

f. FindA X BX C 

g. Let R denote the set of all real numbers. Describe R X R. 
Solution 

a. AX B= {(x, 1), Q, D, @ 2), Q, D, @ 3), 0 3) 

b. BX A= {(1, x), CL y), 2, x), 2 y), G, x), G, V) 

c. AXA = {x x), œ y), OX) O, V 


Note This is why it d. A X B has 6 elements. Note that this is the number of elements in A times the number 
makes sense to call a of elements in B. B X A has 6 elements, the number of elements in B times the num- 
Cartesian product a ber of elements in A. A X A has 4 elements, the number of elements in A times the 
product! number of elements in A. 


e. The Cartesian product of A and B is a set, so it may be used as one of the sets making 
up another Cartesian product. This is the case for (A X B) X C. 


(AX B)XC {(u, v)|u EAX Band v € C} by definition of Cartesian product 
{((@x, 1), a), (x, 2), a), (œx, 3), a), (Q, 1), a), 
(Q, 2), a), (, 3), a), (x, 1), b), (Œ, 2), b), (Œ, 3), b), 


(O, 1), b), (O, 2), b), (Q, 3), b)} 


f. The Cartesian product A X B X C is superficially similar to but is not quite the same 
mathematical object as (A X B) X C. (A X B) X C is a set of ordered pairs of which 
one element is itself an ordered pair, whereas A X B X C is a set of ordered triples. By 
definition of Cartesian product, 
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1.2 THE LANGUAGE OF SETS 13 


AXBXC {(u, v, w)|u E A, v E B, and w E C} 
{(x, 1, a), (x, 2, a), (x, 3, a), Q, 1, a), Q, 2, a), Q, 3, a), Œ, 1, b), 


(x, 2, b), (x, 3, b), , 1, b), , 2, b), Y, 3, b)}. 


g. R X R is the set of all ordered pairs (x, y) where both x and y are real numbers. If 
horizontal and vertical axes are drawn on a plane and a unit length is marked off, then 
each ordered pair in R X R corresponds to a unique point in the plane, with the first 
and second elements of the pair indicating, respectively, the horizontal and vertical 
positions of the point. The term Cartesian plane is often used to refer to a plane with 
this coordinate system, as illustrated in Figure 1.2.1. 


YA 
gA: 
= 
Sr 24 
1+ e (2,1) 
t } t } } } t t > 
4 3 2 1 1 2 3 4 a 
ete 
(-2, -2) e -2 + e (1, —2) 
=34 
FIGURE 1.2.1 A Cartesian Plane E 


Another notation, which is important in both mathematics and computer science, denotes 
objects called strings.* 


Definition 


Let n be a positive integer. Given a finite set A, a string of length n over A is an or- 
dered n-tuple of elements of A written without parentheses or commas. The elements 
of A are called the characters of the string. The null string over A is defined to be 
the “string” with no characters. It is often denoted à and is said to have length 0. If 
A = {0, 1}, then a string over A is called a bit string. 


Example 1.2.8 Strings 


Let A = {a, b}. List all the strings of length 3 over A with at least two characters that are 
the same. 


Solution 


aab, aba, baa, aaa, bba, bab, abb, bbb 


In computer programming it is important to distinguish among different kinds of data 
structures and to respect the notations that are used for them. Similarly in mathematics, it 
is important to distinguish among, say, {a, b, c}, {{a, b}, c}, (a, b, c), (a, (b, ©), abc, and so 
forth, because these are all significantly different objects. E 


*A more formal definition of string, using recursion, is given in Section 5.9. 
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TEST YOURSELF 


1. When the elements of a set are given using the 
set-roster notation, the order in which they are 
listed 


. The symbol R denotes —____ 
. The symbol Z denotes ____ 


bh w N 


. The symbol Q denotes 


5. The notation {x| P(x)} is read 


EXERCISE SET 1.2 


. For a set A to be a subset of a set B means that 


. Given sets A and B, the Cartesian product A X B 


1S 


. Given sets A, B, and C, the Cartesian product 


AX BX Cis 


. A string of length n over a set S is an ordered n-tuple 


1. Which of the following sets are equal? 


A = {a,b,c,d} 
C= {d, b, a, c} D = {a, a, d, e, c, e} 


2. Write in words how to read each of the following 
out loud. 
a. {x ER 
b. {x E€R|xSO0orx=h} 
c. {n © Z| nisa factor of 6} 
d. {n E Z |n is a factor of 6} 


0<x<1} 


3. a. Is4 = {4}? 
b. How many elements are in the set {3, 4, 3, 5}? 
c. How many elements are in the set {1, {1}, {1, {1}}}? 


Is 2 E {2}? 

How many elements are in the set {2, 2, 2, 2}? 
How many elements are in the set {0, {0}}? 

. Is {0} E {{0}, {1}? 

Is 0 E {{O}, {1}}? 


pangs 


H5. Which of the following sets are equal? 
A = {0, 1, 2} 
B= {xER|-1 =x<3} 
C= {xER|-1<x<3} 
D={xEZ|-1<x< 3} 
E={x€Z*|-l<x<3} 

H 6. For each integer n, let T, = {n, n}. How many 


elements are in each of 75, T_3, T,, and Ty? Justify 
your answers. 


7. Use the set-roster notation to indicate the elements 
in each of the following sets. 
a. S= {n E€ Z |n = (-1), for some integer k}. 
b. T= {mE Z |m = 1+(— 1f, for some integer i}. 


of elements of S, written without or 
ce U={rEeZ|2=rs-2} 
_ d. V={sE€Z|s>2ors <3} 
PSG) e W={tEZ|1<r< -3} 
f. X= {uEZ|us4oru2= 1} 


. Let A = {c, d, f.g}, B = {fj}, and C = {d, g}. 


Answer each of the following questions. Give 
reasons for your answers. 


a. IsB CA? 
. IsC CA? 
IsC CC? 
© Is Ca proper subset of A? 


an g 


a. Is 3 € {1, 2, 3}? 
b. Is 1 C {1}? 

c. Is {2} E {1,2} 

d. Is {3} E {1, {2}, {3}}? 
e. Is1 € {1}? 

f. Is {2} C {1, {2}, {13}? 
g. Is {1} C {1,2} 

h. Is 1 € {{1}, 2}? 

i. Is {1} C {1, {2}? 

j. Is {1} C {1}? 


a. Is (D°, -23 = (—2?, (-2))? 
b. Is (5, —5) = (-5, 5)? 

c. Is(8 — 9,W—1) = (1, -1)? 
d. Is (Z, (-2)°) = (2, -8)9 


. Let A = {w, x, y, z} and B = {a, b}. Use the set- 


roster notation to write each of the following sets, and 
indicate the number of elements that are in each set. 
a AXB 

b. BXA 

c. AXA 

d. BXB 
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12. Let S = {2, 4, 6} and T = {1, 3, 5}. Use the set- 14. Let R = {a}, S = {x, y}, and T = {p, q, r}. Find 
roster notation to write each of the following sets, and each of the following sets. 
indicate the number of elements that are in each set. a. RX (S X T) 
a. SXT b. (RX S) XT 
b. TXS ce RXSXT 
c SXS 


d. TXT 


13. Let A = {1,2,3}, B = {u}, and C = {m, n}. Find 
each of the following sets. 


a. AX(BXC) 
b. (AX B)XC 


c AXB 


ANSWERS FOR TEST YOURSELF 


xC 


15. Let S = {0, 1}. List all the strings of length 4 over 
S that contain three or more 0’s. 


16. Let T = {x, y}. List all the strings of length 5 over 
T that have exactly one y. 


1. does not matter 2. the set of all real numbers 3. the set inB 7. the set of all ordered pairs (a, b) where a is in A and 
of all integers 4. the set of all rational numbers 5. the set bisinB 8. the set of ordered triples of the form (a, b, c) 
of all x such that P(x) 6. every element in A is an element where a E A,b E B,andc E C 9. parentheses; commas 


The Language of Relations and Functions 


Mathematics is a language. —Josiah Willard Gibbs (1839-1903) 


There are many kinds of relationships in the world. For instance, we say that two people 
are related by blood if they share a common ancestor and that they are related by marriage 
if one shares a common ancestor with the spouse of the other. We also speak of the rela- 
tionship between student and teacher, between people who work for the same employer, 
and between people who share a common ethnic background. 

Similarly, the objects of mathematics may be related in various ways. A set A may 
be said to be related to a set B if A is a subset of B, or if A is not a subset of B, or if A 
and B have at least one element in common. A number x may be said to be related to a 
number y if x < y, or if x is a factor of y, or if x + y = |. Two identifiers in a computer 
program may be said to be related if they have the same first eight characters, or if the 
same memory location is used to store their values when the program is executed. And 
the list could go on! 

Let A = {0, 1,2} and B = {1, 2, 3} and let us say that an element x in A is related to an 
element y in B if, and only if, x is less than y. Let us use the notation x R y as a shorthand 
for the sentence “x is related to y.” Then 


ORI since O<1, 
OR2 since 0<2, 
OR3 since 0<3, 
1 R2 since 1<2, 
1 R3 since 1<3, and 
2R3 since 2<23. 
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On the other hand, if the notation x Ky represents the sentence “x is not related to y,” then 


1R1 since 141, 
2R1 since 2 £1, and 
2R2 since 242. 


Recall that the Cartesian product of A and B, A X B, consists of all ordered pairs whose 
first element is in A and whose second element is in B: 


A X B= {(x, y) |x E A andy E B}. 
In this case, 
A X B = {(0, 1), (0, 2), (0, 3), (1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3)}. 


The elements of some ordered pairs in A X B are related, whereas the elements of other 
ordered pairs are not. Consider the set of all ordered pairs in A X B whose elements are 
related 


{(0, 1), (0, 2), (0, 3), (1, 2), CL, 3), (2, 3)}. 


Observe that knowing which ordered pairs lie in this set is equivalent to knowing which 
elements are related to which. The relation itself can therefore be thought of as the totality of 
ordered pairs whose elements are related by the given condition. The formal mathematical 
definition of relation, based on this idea, was introduced by the American mathematician 
and logician C. S. Peirce in the nineteenth century. 


Definition 


Let A and B be sets. A relation R from A to B is a subset of A X B. Given an ordered 
pair (x, y)in A X B, x is related to y by R, written x R y, if, and only if, (x, y) is in R. 
The set A is called the domain of R and the set B is called its co-domain. 

The notation for a relation R may be written symbolically as follows: 


xRy meansthat (x,y) ER. 
The notation xR y means that x is not related to y by R: 
xR y meansthat (x,y) ER. 


Genoceesm A Relation as a Subset 


Let A = {1, 2} and B = {1, 2, 3} and define a relation R from A to B as follows: Given any 
(x,y) EA X B, 


(x,y) ER means that 2 is an integer. 


a. State explicitly which ordered pairs are in A X B and which are in R. 
b. Is 1 R3?Is2 R3?Is2 R2? 


c. What are the domain and co-domain of R? 


Solution 


a. AX B= {(1, 1), (l, 2), (1, 3), (2, 1), (2, 2), (2, 3)}. To determine explicitly the compo- 
sition of R, examine each ordered pair in A X B to see whether its elements satisfy the 
defining condition for R. 
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(1, 1) E R because 1 = - = 0, which is an integer. 


(1, 2) € R because at = 4, which is not an integer. 


(1, 3) E R because 14 = > = —1, which is an integer. 
(2, 1) € R because zat = 5, which is not an integer. 

(2, 2) E R because me = 2 = 0, which is an integer. 

(2, 3) É R because 25 = Z, which is not an integer. 


Thus 


R = {(, D, (L, 3), (2, 2)} 


b. Yes, 1 R3 because (1, 3) E R. 
No, 2 R3 because (2, 3) € R. 
Yes, 2 R 2 because (2, 2) E R. 


c. The domain of R is {1, 2} and the co-domain is {1, 2, 3}. E 


ECAAER The Circle Relation 


Define a relation C from R to R as follows: For any (x, y) E R X R, 
(x,y) E C means that x +y = 1. 
1 V3 
a. Is (1, 0) E C? Is (0, 0) E C7 Is ( 5 5€ C? Is —2 C 0? Is 0 C (—1)? Is 1 C 1? 


b. What are the domain and co-domain of C? 


c. Draw a graph for C by plotting the points of C in the Cartesian plane. 


Solution 
a. Yes, (1, 0) E C because 1°+0° = 1. 


No, (0, 0) € C because 0° + 0° = 0 #1. 

Yes, (-5, v3) (= C because ( In 3p = 1+2 =1. 
No, —2 É 0 because (—2)? +0? = 4 # 1. 

Yes, 0 C (-1) because 0? + (—1)? = 1. 

No, 1 É 1 because 17+ 1° =2 #1. 


b. The domain and co-domain of C are both R, the set of all real numbers. 


=Y 
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Arrow Diagram of a Relation 


Suppose R is a relation from a set A to a set B. The arrow diagram for R is obtained as 
follows: 


1. Represent the elements of A as points in one region and the elements of B as points in 
another region. 


2. For each x in A and y in B, draw an arrow from x to y if, and only if, x is related to y by 
R. Symbolically: 


Draw an arrow from x to y 
if, and only if, xRy 
if, and only if, (x,y) ER. 


Gece Arrow Diagrams of Relations 


Let A = {1,2,3} and B = {1, 2, 3} and define relations S and T from A to B as follows: 
For every (x, y) E A X B, 


(x,y) ES meansthat x< y S is a “less than” relation. 
Draw arrow diagrams for S and T. 


T = {(2, 1), (2, 5)}. 
pee jes 


These example relations illustrate that it is possible to have several arrows coming out 
of the same element of A pointing in different directions. Also, it is quite possible to have 
an element of A that does not have an arrow coming out of it. E 


Solution 


Functions 


In Section 1.2 we showed that ordered pairs can be defined in terms of sets and we defined 
Cartesian products in terms of ordered pairs. In this section we introduced relations as subsets 
of Cartesian products. Thus we can now define functions in a way that depends only on the 
concept of set. Although this definition is not obviously related to the way we usually work 
with functions in mathematics, it is satisfying from a theoretical point of view, and computer 
scientists like it because it is particularly well suited for operating with functions on a computer. 


Definition 


A function F from a set A to a set B is a relation with domain A and co-domain B 
that satisfies the following two properties: 


1. For every element x in A, there is an element y in B such that (x, y) E F. 


2. For all elements x in A and y and z in B, 


if @,y)GFand@2EF, then y=z. 
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Properties (1) and (2) can be stated less formally as follows: A relation F from A to B is 
a function if, and only if: 


1. Every element of A is the first element of an ordered pair of F. 
2. No two distinct ordered pairs in F have the same first element. 


In most mathematical situations we think of a function as sending elements from one 
set, the domain, to elements of another set, the co-domain. Because of the definition of 
function, each element in the domain corresponds to one and only one element of the 
co-domain. 

More precisely, if F is a function from a set A to a set B, then given any element x in A, 
property (1) from the function definition guarantees that there is at least one element of B 
that is related to x by F and property (2) guarantees that there is at most one such element. 
This makes it possible to give the element that corresponds to x a special name. 


Function Notation 


If A and B are sets and F is a function from A to B, then given any element x in A, the 
unique element in B that is related to x by F is denoted F(x), which is read “F of x.” 


Functions and Relations on Finite Sets 
Let A = {2, 4, 6} and B = {1, 3, 5}. Which of the relations R, S, and T defined below are 
functions from A to B? 
a. R= {(2, 5), (4, 1), (4, 3), (6, 5)} 
b. For every (x, y) E A X B, (x, y) E S means that y = x+ 1. 


c. Tis defined by the arrow diagram 


Solution 

a. R is not a function because it does not satisfy property (2). The ordered pairs (4, 1) 
and (4, 3) have the same first element but different second elements. You can see this 
graphically if you draw the arrow diagram for R. There are two arrows coming out of 
4: One points to | and the other points to 3. 


b. Sis not a function because it does not satisfy property (1). It is not true that every 
element of A is the first element of an ordered pair in S. For example, 6 € A but there 
is no yin B such that y = 6+1 = 7. You can also see this graphically by drawing the 
arrow diagram for S. 
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Note In part (c), 

T(4) = T(6). This illustrates 
the fact that although no 
element of the domain of a 
function can be related to 
more than one element of 
the co-domain, several ele- 
ments in the domain can be 
related to the same element 
in the co-domain. 
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c. Tis a function: Each element in {2, 4, 6} is related to some element in {1, 3, 5}, and no 
element in {2, 4, 6} is related to more than one element in {1, 3, 5}. When these proper- 
ties are stated in terms of the arrow diagram, they become (1) there is an arrow coming 
out of each element of the domain, and (2) no element of the domain has more than one 
arrow coming out of it. So you can write 7(2) = 5, T(4) = 1, and 7(6) = 1. E 


Functions and Relations on Sets of Strings 


Let A = {a, b} and let S be the set of all strings over A. 


a. Define a relation L from S to Z”°””®® as follows: For every string s in S and for every 
nonnegative integer n, 


(s, n) E L means that the length of s is n. 


Observe that L is a function because every string in S has one and only one length. 
Find L(abaaba) and L(bbb). 


b. Define a relation C from S to S as follows: For all strings s and t in S, 


(s, t) E C means that t = as, 


where as is the string obtained by appending a on the left of the characters in s. (C is 
called concatenation by a on the left.) Observe that C is a function because every 
string in S consists entirely of a’s and b’s and adding an additional a on the left creates 
a new strong that also consists of a’s and b’s and thus is also in S. Find C(abaaba) and 
C(bbb). 


Solution 

a. L(abaaba) = 6 and L(bbb) = 3 

b. C(abaaba) = aabaaba and C(bbb) = abbb E 
Function Machines 


Another useful way to think of a function is as a machine. Suppose fis a function from X to 
Y and an input x of X is given. Imagine f to be a machine that processes x in a certain way 
to produce the output f(x). This is illustrated in Figure 1.3.1. 


function machine > 


fœ Output 


Input 


FIGURE 1.3.1 
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Seu wee Functions Defined by Formulas 


The squaring function f from R to R is defined by the formula f(x) = x for every real 
number x. This means that no matter what real number input is substituted for x, the output 
of f will be the square of that number. This ea can be represented by writing fO= 
In other words, f sends each real number x to x’, or, symbolically, f: x —> x’. Note that the 
variable x is a dummy variable; any other sviiibol could replace it, as long as the replace- 
ment is made everywhere the x appears. 

The successor function g from Z to Z is defined by the formula g(n) = n + 1. Thus, no mat- 
ter what integer is substituted for n, the output of g will be that number plus 1: g0) = U + 1. In 
other words, g sends each integer n to n + 1, or, symbolically, g: n > n + 1. 

An example of a constant function is the function h from Q to Z defined by the for- 
mula h(r) = 2 for all rational numbers r. This function sends each rational number r to 2. 
In other words, no matter what the input, the output is always 2: A(O) = 2 or h: r > 2. 

The functions f, g, and h are represented by the function machines in Figure 1.3.2. 


squaring successor constant 
function function function 


fx) =x? g(n)=n+1 h(r) =2 


(a) (b) (c) 
FIGURE 1.3.2 E 


A function is an entity in its own right. It can be thought of as a certain relationship 
between sets or as an input/output machine that operates according to a certain rule. This 
is the reason why a function is generally denoted by a single symbol or string of symbols, 
such as f, G, of log, or sin. 

A relation is a subset of a Cartesian product and a function is a special kind of relation. 
Specifically, if f and g are functions from a set A to a set B, then 


f={@%y EAXBly=f@} and g={(x,y) EA XB|y = g}. 
It follows that 


f equals g, writtenf =g, if, and only if, f(x) = g(x) for all x in A. 


Equality of Functions 


Define functions f and g from R to R by the following formulas: 
f(x) = |x| for every x E R. 


g(x) = Vea for every x E R. 


Does f = g? 
Solution 
Yes. Because the absolute value of any real number equals the square root of its square, 
|x| = yx for all x E R. Hence f = g. E 
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TEST YOURSELF 


1. Given sets A and B, a relation from A to B is 


b. for all elements x in A and y and z in B, if 
then 


2. A function F from A to B is a relation from A to B 
that satisfies the following two properties: 
a. for every element x of A, there is 


3. If Fis a function from A to B and x is an element 
of A, then F(x) is = 


EXERCISE SET 1.3 


1. Let A = {2, 3, 4} and B = {6, 8, 10} and define 


c. Write the domain and co-domain of V. 


arelation R from A to B as follows: For every 
(x,y) EA X B, 


(x,y) ER means that ~ is an integer. 


- Is4 R6? Is 4 R 8? Is (3, 8) E R? Is (2, 10) E R? 
. Write R as a set of ordered pairs. 

. Write the domain and co-domain of R. 

. Draw an arrow diagram for R. 


. Let C = D = {-3, —2, —1, 1, 2, 3} and define 
a relation S from C to D as follows: For every 
(x,y) E CXD, 


aA oy 


(x,y) ES means that L_ l is an integer. 


a. Is 2 S2? Is —1 S —1? Is (3, 3) E S? 
Is (3, —3) E $? 
b. Write S as a set of ordered pairs. 
c. Write the domain and co-domain of S. 
d. Draw an arrow diagram for S. 


. Let E = {1, 2,3} and F = {—2, —1, 0} and define 
a relation T from E to F as follows: For every 
(x,y) EE XF, 

(x,y) ET means that = is an integer. 
a. Is3 T0? Is 1 T(-1)? Is (2, —1) E T? 

Is (3, —2) ET? 

b. Write T as a set of ordered pairs. 
c. Write the domain and co-domain of T. 
d. Draw an arrow diagram for T. 


. Let G = {—2, 0, 2} and H = {4, 6, 8} and define a 
relation V from G to H as follows: For every 
(x,y) EG X A, 
(x,y) EV means that ae is an integer. 
a. Is 2 V6? Is (—2) V (8)? Is (0, 6) E V? 
Is (2,4) EV? 
b. Write V as a set of ordered pairs. 
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d. Draw an arrow diagram for V. 


. Define a relation S from R to R as follows: 


For every (x, y) E R X R, 
(x, y)ES means that x= y. 


a. Is (2, 1) E€ S? Is (2,2) E€ S? Is 2 8 3? 
Is (—1) S (-2)? 
b. Draw the graph of S in the Cartesian plane. 


. Define a relation R from R to R as follows: 


For every (x, y) E R X R, 
(x,y) ER meansthat y= x. 
a. Is (2,4) E R? Is (4, 2) E R? Is (—3) R 9? 
Is 9 R (—3)? 
b. Draw the graph of R in the Cartesian plane. 


. Let A = {4, 5, 6} and B = {5, 6, 7} and define 


relations R, S, and T from A to B as follows: 
For every (x, y) EA X B: 
(x,y) ER meansthat x= y. 


x-y. f 
(x, y)ES means that —~ is an integer. 


T = {(4, 7), (6, 5), (6, D}. 
a. Draw arrow diagrams for R, S, and T. 


b. Indicate whether any of the relations R, S, and 
T are functions. 


. Let A = {2, 4} and B = {1, 3, 5} and define rela- 


tions U, V, and W from A to B as follows: 

For every (x, y) E A X B: 
(x,y) E U means that y- x> 2. 
(x,y) EV meansthat y-1= z: 
W = {(2, 5), (4, 1), (2, 3)}. 

a. Draw arrow diagrams for U, V, and W. 


b. Indicate whether any of the relations U, V, and 
W are functions. 


9. 


10. 


1. 


12. 


13. 


14. 


15. 


a. Find all functions from {0, 1} to {1}. 
b. Find two relations from {0, 1} to {1} that are 
not functions. 


Find four relations from {a, b} to {x, y} that are 
not functions from {a, b} to {x, y}. 


Let A = {0, 1, 2} and let S be the set of all strings 
over A. Define a relation L from S to Z"?"" as 
follows: For every string s in S and every nonnega- 
tive integer n, 

(sn) EL means that 


Then L is a function because every string in S has 
one and only one length. Find L(0201) and L(12). 


the length of s is n. 


Let A = {x, y} and let S be the set of all strings 
over A. Define a relation C from S to S as follows: 
For all strings s and rin S, 


(s,f) © C meansthat t= ys. 


Then C is a function because every string in S 
consists entirely of x’s and y’s and adding an ad- 
ditional y on the left creates a single new string 
that consists of x’s and y’s and is, therefore, also in 
S. Find C(x) and C(yyxyx). 


Let A = {—1, 0, 1} and B = {t, u, v, w}. Define a 
function F: A — B by the following arrow diagram: 


We 


a. Write the domain and co-domain of F. 
b. Find F(—1), F(0), and F(1). 


Let C = {1, 2, 3,4} and D = {a, b, c, d}. Define 
a function G: C > D by the following arrow 
diagram: 


a. Write the domain and co-domain of G. 
b. Find G(1), G(2), G@), and G(4). 


Let X = {2,4,5} and Y = {1, 2, 4, 6}. Which of 
the following arrow diagrams determine functions 
from X to Y? 
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16. 


17. 


18. 


19. 


20. 


Let f be the squaring function defined in Example 
1.3.6. Find f(-1), f0), and f(3). 


Let g be the successor function defined in 
Example 1.3.6. Find g(—1000), g(0), and g(999). 


Let h be the constant function defined in 
Example 1.3.6. Find h(—"2), a$), and (75). 


Define functions f and g from R to R by the fol- 
lowing formulas: For every x € R, 

2x7 + 2x 

f@) = 2x and g(x) =—,——. 
x il 
Does f = g? Explain. 


Define functions H and K from R to R by the fol- 
lowing formulas: For every x € R, 


H(x) =(x-2) and K(x) =(x-D-3) 41. 
Does H = K? Explain. 
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ANSWERS FOR TEST YOURSELF 


1. a subset of the Cartesian product A X B 2. a. an element b. (x, y) E F and (x, z) E F; y =z _ 3. the unique element of B 
y of B such that (x, y) E F (e., such that x is related to y by F) that is related to x by F 


EEJ The Language of Graphs 


The whole of mathematics consists in the organization of a series of aids to the 
imagination in the process of reasoning. —Alfred North Whitehead, 1861-1947 


Imagine an organization that wants to set up teams of three to work on some projects. 
In order to maximize the number of people on each team who had previous experience 
working together successfully, the director asked the members to provide names of their 
previous partners. This information is displayed below both in a table and in a diagram. 


Name Previous Partners 
Ana Dan, Flo Ana 
Bev Cai, Flo, Hal Bev 
Cai Bev, Flo r 
Dan Ana, Ed 
Hal 

Ed Dan, Hal Cai 
Flo Cai, Bev, Ana ) 

. Gia 
Gia Hal Dan 
Hal Gia, Ed, Bev, Ira 
Ira Hal Flo Ed 


From the diagram, it is easy to see that Bev, Cai, and Flo are a group of three previous 
partners, and so it would be reasonable for them to form one of these teams. The drawing 
below shows the result when these three names are removed from the diagram. 


Ana 


Ira 


Hal 


Dan 


Ed 


This drawing shows that placing Hal on the same team as Ed would leave Gia and Ira on 
a team where they would not have a previous partner. However, if Hal is placed on a team 
with Gia and Ira, then the remaining team would consist of Ana, Dan, and Ed, and every- 
one on both teams would be working with a previous partner. 

Drawings such as these are illustrations of a structure known as a graph. The dots are 
called vertices (plural of vertex) and the line segments joining vertices are called edges. As 
you can see from the first drawing, it is possible for two edges to cross at a point that is not 
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a vertex. Note also that the type of graph described here is quite different from the “graph 
of an equation” or the “graph of a function.” 

In general, a graph consists of a set of vertices and a set of edges connecting various 
pairs of vertices. The edges may be straight or curved and should either connect one vertex 
to another or a vertex to itself, as shown below. 


Parallel edges Isolated vertex 
U3 
e3 
U5 
(5) ~ 
&4 
v2 
ey U7 
&6 
vi e5 


Loop 


In this drawing, the vertices are labeled with v’s and the edges with e’s. When an edge 
connects a vertex to itself (as e; does), it is called a Joop. When two edges connect the same 
pair of vertices (as e, and e; do), they are said to be parallel. It is quite possible for a vertex 
to be unconnected by an edge to any other vertex in the graph (as vs is), and in that case the 
vertex is said to be isolated. The formal definition of a graph follows. 


Definition 


A graph G consists of two finite sets: a nonempty set V(G) of vertices and a set E(G) 
of edges, where each edge is associated with a set consisting of either one or two 
vertices called its endpoints. The correspondence from edges to endpoints is called 
the edge-endpoint function. 

An edge with just one endpoint is called a loop, and two or more distinct edges 
with the same set of endpoints are said to be parallel. An edge is said to connect 
its endpoints; two vertices that are connected by an edge are called adjacent; and a 
vertex that is an endpoint of a loop is said to be adjacent to itself. 

An edge is said to be incident on each of its endpoints, and two edges incident 
on the same endpoint are called adjacent. A vertex on which no edges are incident 
is called isolated. 


Graphs have pictorial representations in which the vertices are represented by dots and 
the edges by line segments. A given pictorial representation uniquely determines a graph. 


Seem Terminology 


Consider the following graph: 
er 


vy 


v 
2 e4 U3 
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a. Write the vertex set and the edge set, and give a table showing the edge-endpoint function. 


b. Find all edges that are incident on vy, all vertices that are adjacent to v}, all edges that 
are adjacent to e4, all loops, all parallel edges, all vertices that are adjacent to them- 
selves, and all isolated vertices. 


Solution 

a. vertex set = {v], V2, V3, U4, Us, V6} 
edge set = {e}, €2, €3, €4, €5, €6, €7} 
edge-endpoint function: 


Edge Endpoints 


ey {vi, v2} 
e2 {vi, v3} 
e3 {v,, v3} 
e4 {v2, v3} 
es {U5, ve} 
& {us} 

e7 {U6} 


b. ej, e2, and e; are incident on vj. 
v, and v; are adjacent to vy. 
ez, €3, and e; are adjacent to e4. 
eç and e; are loops. 


Note The isolated vertex 
v4 does not appear in the 
table. Although each edge 
of a graph must have ei- 


ther one or two endpoints, e, and e; are parallel. 
a vertex need not be an v; and vę are adjacent to themselves. 
endpoint of an edge. v4 is an isolated vertex. E 


Although a given pictorial representation uniquely determines a graph, a given graph may 
have more than one pictorial representation. Such things as the lengths or curvatures of 
the edges and the relative position of the vertices on the page may vary from one pictorial 
representation to another. 


Drawing More Than One Picture for a Graph 
Consider the graph specified as follows: 


vertex set = {v], Vz, V3, V4} 
edge set = {e], €2, €3, eq} 


edge-endpoint function: 


Edge Endpoints 
ei {vi v3} 
er {v2, v4} 
e3 {v2, v4} 
e4 {v3} 
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Both drawings (a) and (b) shown below are pictorial representations of this graph. 


V4 e4 
U3 


ez e3 el 


Va vi 


(a) (b) E 


Labeling Drawings to Show They Represent the Same Graph 


Consider the two drawings shown in Figure 1.4.1. Label vertices and edges in such a way 
that both drawings represent the same graph. 


(a) (b) 
FIGURE 1.4.1 


Solution Imagine putting one end of a piece of string at the top vertex of Figure 1.4.1(a) 
(call this vertex vı), then laying the string to the next adjacent vertex on the lower right (call 
this vertex v2), then laying it to the next adjacent vertex on the upper left (v3), and so forth, 
returning finally to the top vertex vı. Call the first edge e4, the second e, and so forth, as 
shown below. 


Vi 
ei 
e 
v3 3 V4 
es e4 
ez 
v5 v2 


Now imagine picking up the piece of string, together with its labels, and repositioning it 
as follows: 


v4 3 v3 
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This is the same as Figure 1.4.1(b), so both drawings represent the graph with vertex set 
{VU}, V2, V3, V4, Us}, edge set {e1, er, €3, €4, €s}, and edge-endpoint function as follows: 


Edge Endpoints 
e {v,, v2} 
e {v2, V3} 
e3 {u3, v4} 
€4 {v4, Vs} 
es {Us, vi} El 


Examples of Graphs 


Graphs are a powerful problem-solving tool because they enable us to represent a complex 
situation with a single image that can be analyzed both visually and with the aid of a com- 
puter. A few examples follow, and others are included in the exercises. 


Using a Graph to Represent a Network 


Telephone, electric power, gas pipeline, and air transport systems can all be represented 
by graphs, as can computer networks—from small local area networks to the global In- 
ternet system that connects millions of computers worldwide. Questions that arise in the 
design of such systems involve choosing connecting edges to minimize cost, optimize a 
certain type of service, and so forth. A typical network, called a hub-and-spoke model, 
is shown below. 


Boston 


Denver 


San Francisco New York 


Washington 


Los Angeles 


Using a Graph to Represent the World Wide Web 


The World Wide Web, or Web, is a system of interlinked documents, or webpages, 
contained on the Internet. Users employing Web browsers, such as Internet Explorer, 
Chrome, Safari, and Firefox, can move quickly from one webpage to another by click- 
ing on hyperlinks, which use versions of software called hypertext transfer protocols 
(HTTPs). Individuals and individual companies create the pages, which they transmit 
to servers that contain software capable of delivering them to those who request them 
through a Web browser. Because the amount of information currently on the Web is so 
vast, search engines, such as Google, Yahoo, and Bing, have algorithms for finding in- 
formation very efficiently. 

The following picture shows a minute fraction of the hyperlink connections on the 
Internet that radiate in and out from the Wikipedia main page. 
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f ary 3: Country 


G } r - a 
(Dictionary.com) | ma (1918), 


q 
je be sar (= g 
(NewYork Pon) Cee : a 
Talomrash caa Search Home Page `) 4 
(CBSNews.com) Chie 3 q 7 CGleager.com) 
aaa ies Yahoo Viste) i 
man Wall 2 


ingapore Travel ani 
‘Minnesota Ene: 1 Maps al á 


= Weather - Travel -Hi 


Tevel Double-A Conformance to 
p | 


jance to Wel 


A directed graph is like an (undirected) graph except that each edge is associated with an 
ordered pair of vertices rather than a set of vertices. Thus each edge of a directed graph can 
be drawn as an arrow going from the first vertex to the second vertex of the ordered pair. 


Note Each directed Definition 
graph has an associated 
ordinary (undirected) 
graph, which is obtained 
by ignoring the directions 


A directed graph, or digraph, consists of two finite sets: a nonempty set V(G) of 
vertices and a set D(G) of directed edges, where each is associated with an ordered 
pair of vertices called its endpoints. If edge e is associated with the pair (v, w) of 
of the edges. vertices, then e is said to be the (directed) edge from v to w. 


Using a Graph to Represent Knowledge 


In many applications of artificial intelligence, a knowledge base of information is collected 
and represented inside a computer. Because of the way the knowledge is represented and 
because of the properties that govern the artificial intelligence program, the computer is 
not limited to retrieving data in the same form as it was entered; it can also derive new facts 
from the knowledge base by using certain built-in rules of inference. For example, from the 
knowledge that the Los Angeles Times is a big-city daily and that a big-city daily contains 
national news, an artificial intelligence program could infer that the Los Angeles Times 
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contains national news. The directed graph shown in Figure 1.4.2 is a pictorial representa- 
tion for a simplified knowledge base about periodical publications. 
According to this knowledge base, what paper finish does the New York Times use? 


made-of Periodical Da ee 
P gO OS aa PHB 
aper we 
ans , writing 
Motor Trend peat 
stance -of Sports 


ai Sports <——_ Illustrated 


w Scholarly 


oe journal Sen magazine 
Long Foy, Song , 
words Sy G “ns 
a y, Sports 


is-a is-a weekly is-a KA news 
G, 
, S Tosg, 
Literary Scientific ontains 
journal journal a ee National 
jee news 
instance-of Local 
news 
Poetry Los Angeles New York 
Magazine Times Times 
FIGURE 1.4.2 


Solution The arrow going from New York Times to big-city daily (labeled “instance- 
of”) shows that the New York Times is a big-city daily. The arrow going from big-city 
daily to newspaper (labeled “is-a”°) shows that a big-city daily is a newspaper. The arrow 
going from newspaper to matte (labeled “paper-finish’”) indicates that the paper finish on a 
newspaper is matte. Hence it can be inferred that the paper finish on the New York Times 
is matte. a 


Using a Graph to Solve a Problem: Vegetarians and Cannibals 


The following is a variation of a famous puzzle often used as an example in the study of 
artificial intelligence. It concerns an island on which all the people are of one of two types, 
either vegetarians or cannibals. Initially, two vegetarians and two cannibals are on the left 
bank of a river. With them is a boat that can hold a maximum of two people. The aim of 
the puzzle is to find a way to transport all the vegetarians and cannibals to the right bank of 
the river. What makes this difficult is that at no time can the number of cannibals on either 
bank outnumber the number of vegetarians. Otherwise, disaster befalls the vegetarians! 


Solution A systematic way to approach this problem is to introduce a notation that can 
indicate all possible arrangements of vegetarians, cannibals, and the boat on the banks of 
the river. For example, you could write (vuc / Bc) to indicate that there are two vegetarians 
and one cannibal on the left bank and one cannibal and the boat on the right bank. Then 
(vuccB /) would indicate the initial position in which both vegetarians, both cannibals, and 
the boat are on the left bank of the river. The aim of the puzzle is to figure out a sequence 
of moves to reach the position (/ Buucc) in which both vegetarians, both cannibals, and the 
boat are on the right bank of the river. 

Construct a graph whose vertices are the various arrangements that can be reached in a 
sequence of legal moves starting from the initial position. Connect vertex x to vertex y if it 
is possible to reach vertex y in one legal move from vertex x. For instance, from the initial 
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position there are four legal moves: one vegetarian and one cannibal can take the boat to the 
right bank; two cannibals can take the boat to the right bank; one cannibal can take the boat 
to the right bank; or two vegetarians can take the boat to the right bank. You can show these 
by drawing edges connecting vertex (vuccB /) to vertices (uc / Buc), (vu/ Bcc), (vuc / Bc), and 
(cc / Buv). (It might seem natural to draw directed edges rather than undirected edges from 
one vertex to another. The rationale for drawing undirected edges is that each legal move is 
reversible.) From the position (uc / Buc), the only legal moves are to go back to (vuccB /) or 
to go to (vucB/c). You can also show these by drawing in edges. Continue this process until 
finally you reach (/ Buucc). From Figure 1.4.3 it is apparent that one successful sequence of 
moves is (vuccB /) — (uc / Buc) — (vucB / c) = (c / Buvc) —> (ccB / vv) > (/ Buvcc). 


FIGURE 1.4.3 a 


Definition 


Let G be a graph and v a vertex of G. The degree of v, denoted deg(v), equals the 
number of edges that are incident on v, with an edge that is a loop counted twice. 


Since an edge that is a loop is counted twice, the degree of a vertex can be obtained 
from the drawing of a graph by counting how many end segments of edges are incident on 
the vertex. This is illustrated below. 


2 The degree of this 
Erer 
vertex equals 5. 


ZEWG S: Degree of a Vertex 


Find the degree of each vertex of the graph G shown below. 


U5 


U3 


E3 
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Solution deg(v,) = 0 since no edge is incident on v, (v; is isolated). 
deg(v2) = 2 since both e; and e, are incident on vp. 


deg(v3) = 4 since e; and e, are incident on v3 and the loop e; is also incident 
on v3 (and contributes 2 to the degree of v3). E 


Using a Graph to Color a Map 


Imagine that the diagram shown below is a map with countries labeled A-J. Show that you 
can color the map so that no two adjacent countries have the same color. 


Solution Notice that coloring the map does not depend on the sizes or shapes of the 
countries, but only on which countries are adjacent to which. So, to figure out a coloring, 
you can draw a graph, as shown below, where vertices represent countries and where edges 
are drawn between pairs of vertices that represent adjacent countries. Coloring the vertices 
of the graph will translate to coloring the countries on the map. 


As you assign colors to vertices, a relatively efficient strategy is, at each stage, to focus on 
an uncolored vertex that has maximum degree, in other words that is connected to a maxi- 
mum number of other uncolored vertices. If there is more than one such vertex, it does not 
matter which you choose because there are often several acceptable colorings for a given 
graph. For this graph, both C and H have maximum degree so you can choose one, say, C, 
and color it, say, blue. Now since A, F, I, and J are not connected to C, some of them may 
also be colored blue, and, because J is connected to a maximum number of others, you 
could start by coloring it blue. Then F is the only remaining vertex not connected to either 
C or J, so you can also color F blue. The drawing below shows the graph with vertices C, 
J, and F colored blue. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


1.4 THE LANGUAGE OF GRAPHS 33 


Since the vertices adjacent to C, J, and F cannot be colored blue, you can simplify the job 
of choosing additional colors by removing C, J, and F and the edges connecting them to 
adjacent vertices. The result is shown in Figure 1.4.4a. 


A D A D 


(a) (b) 
FIGURE 1.4.4 


In the simplified graph again choose a vertex that has a maximum degree, namely H, and 
give it a second color, say, gray. Since A, D, and E are not connected to H, some of them 
may also be colored gray, and, because E is connected to a maximum number of these 
vertices, you could start by coloring E gray. Then A is not connected to E, and so you can 
also color A gray. This is shown in Figure 1.4.4b. The drawing below shows the original 
graph with vertices C, J, and F colored blue, vertices H, A, and E, colored gray, and the 
remaining vertices colored black. You can check that no two adjacent vertices have the 
same color. 


Translating the graph coloring back to the original map gives the following picture in 
which no two adjacent countries have the same color. 


The final map in Example 1.4.9 was drawn with three colors. Two colors are not enough 
because, for example, since B, C, and H are all adjacent to each other, different colors must 
be used for all three. The following drawing shows a map of part of Central Africa that 
requires four colors. Take a moment to try to assign colors to the different countries so that 
you see why three colors are not enough. 
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Democratic Republic 


of the Congo Tanzania 


In the mid-1800s it was conjectured that any map, however complex, could be colored with 
just four colors with no two adjacent regions having the same color. The conjecture is now 
known as the four-color theorem because it was finally proved true in 1976 by Kenneth 
Appel and Wolfgang Haken, at the University of Illinois at Urbana-Champaign. They rep- 
resented maps as graphs and used an innovative and controversial technique that combined 
mathematical deduction with computer examination of almost 2000 special cases. 

In 1950 Edward Nelson, a university student, posed the following question: How many 
colors are needed to create a coloring for all the points in an ordinary (Euclidean) plane so 
that no two points separated by a unit distance have the same color? Nelson himself found 
that three colors are not enough, and a fellow student, John Isbell, developed an example 
showing that seven colors could be used. Thus the minimum number had to be 4, 5, or 6. 
Over the years a number of mathematicians tried to narrow the possibilities further, but it 
was not until 2018 that an English biologist and amateur mathematician, Aubrey de Grey, 
using a combination of ingenuity and computer calculations, created an example showing 
that four colors are not enough. As of the publication of this book, the complete answer to 
Nelson’s question is still unknown, but de Grey has now proved that it must be either 5 or 6. 

It turns out that a variety of problems can be modeled by representing their features 
with a graph and solved by finding a coloring for the vertices of the graph. For example, 
scheduling committee meetings when members serve on more than one committee but the 
meetings must be held during a fixed number of time slots or scheduling final exams for a 
group of courses so that no student has more than two exams on any one day. See exercises 
16 and 17 at the end of this section for details about these. 


TEST YOURSELF 


1. A graph consists of two finite sets: _____ 5. An edge is incident on 
and ___, where each edge is associated with a 
set consisting of 


6. Two edges incident on the same endpoint 
are 


ay ARLO A Staph ` 7. A vertex on which no edges are incident is 


3. Two distinct edges in a graph are parallel if, and 


anly it, 8. In a directed graph, each edge is associated 


with 
4. Two vertices are called adjacent if, and only 


if 9. The degree of a vertex in a graph is _ 
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EXERCISE SET 1.4 


1.4 THE LANGUAGE OF GRAPHS 35 


In 1 and 2, graphs are represented by drawings. Define each graph formally by specifying its vertex set, its edge set, anda 


table giving the edge-endpoint function. 


1. ži 


vi v2 V4 


U3 


e3 


In 3 and 4, draw pictures of the specified graphs. 


3. Graph G has vertex set {v], v2, v3, V4, vs} and edge 
set {€), €2, €3, €4}, with edge-endpoint function as 


follows: 

Edge | Endpoints 
ei {v,, V2} 
ez {vi, v2} 
e3 {v2, v3} 
e4 {v2} 


4. Graph H has vertex set {v,, v2, v3, V4, Us} and edge 
set {€), >, €3, e,} with edge-endpoint function as 


follows: 
Edge | Endpoints 
a {v} 
ez {U2, v3} 
e3 {Up, V3} 
e4 {vi Us} 


In 5-7, show that the two drawings represent the same 
graph by labeling the vertices and edges of the right-hand 
drawing to correspond to those of the left-hand drawing. 


(b) 


6. vı v2 


ez e3 


U4 


ei ez 
(a) 
U2 ey v3 
gi e3 
eg Y6 @4 
vı AN v4 
eg es 
e7 eş 
v7 Us 
(a) 
For each of the graphs in 8 and 9: 
(i) Find all edges that are incident on vj. 
(ii) Find all vertices that are adjacent to v3. 
(iii) Find all edges that are adjacent to e}. 
(iv) Find all loops. 
(v) Find all parallel edges. 


(vi) Find all isolated vertices. 
(vii) Find the degree of v3. 


U3 


(b) 
7. 


(b) 


evs 


10. Use the graph of Example 1.4.6 to determine 
a. whether Sports Illustrated contains printed 
writing; 
b. whether Poetry Magazine contains long words. 


11. Find three other winning sequences of moves for 
the vegetarians and the cannibals in Example 1.4.7. 


12. Another famous puzzle used as an example in the 
study of artificial intelligence seems first to have 
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appeared in a collection of problems, Problems for 
the Quickening of the Mind, which was compiled 
about A.D. 775. It involves a wolf, a goat, a bag of 
cabbage, and a ferryman. From an initial posi- 
tion on the left bank of a river, the ferryman is to 
transport the wolf, the goat, and the cabbage to 

the right bank. The difficulty is that the ferry- 
man’s boat is only big enough for him to transport 
one object at a time, other than himself. Yet, for 
obvious reasons, the wolf cannot be left alone with 
the goat, and the goat cannot be left alone with the 
cabbage. How should the ferryman proceed? 


13. Solve the vegetarians-and-cannibals puzzle for the 
case where there are three vegetarians and three 
cannibals to be transported from one side of a 
river to the other. 


H 14. Two jugs A and B have capacities of 3 liters and 
5 liters, respectively. Can you use the jugs to 
measure out exactly 1 liter of water, while obeying 
the following restrictions? You may fill either jug 
to capacity from a water tap; you may empty the 
contents of either jug into a drain; and you may 
pour water from either jug into the other. 


15. Imagine that the diagram shown below is a map 
with countries labeled a-g. Is it possible to color 
the map with only three colors so that no two 
adjacent countries have the same color? To answer 
this question, follow the procedure suggested by 
Example 1.4.9. Draw and analyze a graph in which 
each country is represented by a vertex and two 
vertices are connected by an edge if, and only if, 
the countries share a common border. 


H 16. In this exercise a graph is used to help solve a 
scheduling problem. Twelve faculty members in 


ANSWERS FOR TEST YOURSELF 


a mathematics department serve on the following 
committees: 


Undergraduate Education: Tenner, Peterson, 
Kashina, Degras 


Graduate Education: Hu, Ramsey, Degras, Bergen 
Colloquium: Carroll, Drupieski, Au-Yeung 
Library: Ugarcovici, Tenner, Carroll 

Hiring: Hu, Drupieski, Ramsey, Peterson 
Personnel: Ramsey, Wang, Ugarcovici 


The committees must all meet during the first 
week of classes, but there are only three time slots 
available. Find a schedule that will allow all fac- 
ulty members to attend the meetings of all com- 
mittees on which they serve. To do this, represent 
each committee as the vertex of a graph, and draw 
an edge between two vertices if the two commit- 
tees have a common member. Find a way to color 
the vertices using only three colors so that no two 
committees have the same color, and explain how 
to use the result to schedule the meetings. 


17. A department wants to schedule final exams so 
that no student has more than one exam on any 
given day. The vertices of the graph below show 
the courses that are being taken by more than one 
student, with an edge connecting two vertices if 
there is a student in both courses. Find a way to 
color the vertices of the graph with only four col- 
ors so that no two adjacent vertices have the same 
color and explain how to use the result to schedule 
the final exams. 


MCS101 MCS102 


MCS100 


MCS110 


MCS135 


MCS130 MCS 120 


1. a finite, nonempty set of vertices; a finite set of edges; 
one or two vertices called its endpoints 2. an edge with 
asingle endpoint 3. they have the same set of endpoints 
4. they are connected by an edge 5. each of its endpoints 
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6. adjacent 7. isolated 8. an ordered pair of vertices 
called its endpoints 9. the number of edges that are 
incident on the vertex, with an edge that is a loop 
counted twice 


THE LOGIC OF COMPOUND 
STATEMENTS 


The first great treatises on logic were written by the Greek philosopher Aristotle. They 
were a collection of rules for deductive reasoning that were intended to serve as a basis 
for the study of every branch of knowledge. In the seventeenth century, the German 
philosopher and mathematician Gottfried Leibniz conceived the idea of using symbols 
to mechanize the process of deductive reasoning in much the same way that algebraic 
notation had mechanized the process of reasoning about numbers and their relationships. 
Leibniz’s idea was realized in the nineteenth century by the English mathematicians 
George Boole and Augustus De Morgan, who founded the modern subject of symbolic 
logic. With research continuing to the present day, symbolic logic has provided, among 
other things, the theoretical basis for many areas of computer science such as digital logic 
circuit design (see Sections 2.4 and 2.5), relational database theory (see Section 8.1), 
automata theory and computability (see Section 7.4 and Chapter 12), and artificial intel- 


Aristotle ligence (see Sections 3.3, 10.1, and 10.5). 
(384 B.c.—322 B.C.) 


Mohamed Osama/Alamy Stock Photo 


YA Logical Form and Logical Equivalence 


Logic is a science of the necessary laws of thought, without which no employment 
of the understanding and the reason takes place. —Immanuel Kant, 1785 


An argument is a sequence of statements aimed at demonstrating the truth of an assertion. 
The assertion at the end of the sequence is called the conclusion, and the preceding state- 
ments are called premises. To have confidence in the conclusion that you draw from an 
argument, you must be sure that the premises are acceptable on their own merits or follow 
from other statements that are known to be true. 

In logic, the form of an argument is distinguished from its content. Logical analysis 
won't help you determine the intrinsic merit of an argument’s content, but it will help 
you analyze an argument’s form to determine whether the truth of the conclusion follows 
necessarily from the truth of the premises. For this reason logic is sometimes defined as 
the science of necessary inference or the science of reasoning. 

Consider the following two arguments. They have very different content but their logi- 
cal form is the same. To help make this clear, we use letters like p, g, and r to represent 
component sentences; we let the expression “not p” refer to the sentence “It is not the case 
that p”; and we let the symbol .°. stand for the word “therefore.” 


Argument | 


P q r 
A aa ——— 
If the bell rings or the flag drops, then the race is over. 
notr not p not q 


—_ —_ ——— 
«z. If the race is not over, then the bell hasn’t rung and the flag hasn’t dropped. 


37 
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Argument 2 
P q 4 
== p= GE 
If x = 2 orx = —2, thenx = 4. 
notr not p not q 


2 
<. If x # 4, then x # 2 and x # —2. 


The common form of the arguments is 
If p or q, then r. 
“. If not r, then not p and not q. 


In exercise 10 in Section 2.3 you will show that this form of argument is valid in the sense 
that if its assumptions are true, then its conclusion must also be true. 


CAMIFAR Identifying Logical Form 


Fill in the blanks below so that argument (b) has the same form as argument (a). Then rep- 
resent the common form of the arguments using letters to stand for component sentences. 


a. If Jane is a math major or Jane is a computer science major, then Jane will take 
Math 150. 
Jane is a computer science major. 
Therefore, Jane will take Math 150. 


b. If logic is easy or () , then (2) 
I will study hard. 
Therefore, I will get an A in this course. 


Solution 
1. I (will) study hard. 


2. I will get an A in this course. 


Common form: If p or q, then r. 


q. 
Therefore, r. E 


Statements 


Most of the definitions of formal logic have been developed so that they agree with the 
natural or intuitive logic used by people who have been educated to think clearly and use 
language carefully. The differences that exist between formal and intuitive logic are neces- 
sary to avoid ambiguity and obtain consistency. 

In any mathematical theory, new terms are defined by using those that have been 
previously defined. However, this process has to start somewhere. A few initial terms 
necessarily remain undefined. In logic, the words sentence, true, and false are the initial 
undefined terms. 


Definition 


A statement (or proposition) is a sentence that is true or false but not both. 


For example, “Two plus two equals four” and “Two plus two equals five” are both state- 
ments, the first because it is true and the second because it is false. On the other hand, the 
truth or falsity of 
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depends on the value of x. For some values of x, it is true (x = 3 and x = —3), whereas for 
other values it is false. Similarly, the truth or falsity of 


xty>0 
depends on the values of x and y. For instance, when x = —1 and y = 2 it is true, whereas 
when x = —1 and y = 1 it is false. In Section 3.1 we will discuss ways to transform sen- 


tences of these forms into statements. 


Compound Statements 


We now introduce three symbols that are used to build more complicated logical expres- 
sions out of simpler ones. The symbol ~ denotes not, ^ denotes and, and v denotes or. 
Given a statement p, the sentence “~p” is read “not p” or “It is not the case that p.” In some 

Note ~p means “not p” computer languages the symbol ~ is used in place of ~. Given another statement q, the 

p ^q means “p and q” sentence “p ^ q” is read “p and q.” The sentence “p v q” is read “p or q.” 

p V q means “p or q” In expressions that include the symbol ~ as well as ^or v, the order of opera- 
tions specifies that ~ is performed first. For instance, ~p ^ q = (~p) ^q. In logical 
expressions, as in ordinary algebraic expressions, the order of operations can be over- 
ridden through the use of parentheses. Thus ~(p ^ q) represents the negation of the 
conjunction of p and q. In this, as in most treatments of logic, the symbols ^ and v are 
considered coequal in order of operation, and an expression such as p ^q Vr is con- 
sidered ambiguous. This expression must be written as either (p \g) Vr or p^(qvr) 
to have meaning. 

A variety of English words translate into logic as A, v, or ~. For instance, the word 
but translates the same as and when it links two independent clauses, as in “Jim is tall 
but he is not heavy.” Generally, the word but is used in place of and when the part of 
the sentence that follows is, in some way, unexpected. Another example involves the 
words neither-nor. When Shakespeare wrote, “Neither a borrower nor a lender be,” he 
meant, “Do not be a borrower and do not be a lender.” So if p and q are statements, 
then 


p but g means p and q 


neither p nor q means ~p and ~q. 


CAMFA Translating from English to Symbols: But and Neither-Nor 


Write each of the following sentences symbolically, letting h = “It is hot” and s = “It is 
sunny.” 


a. It is not hot but it is sunny. 
b. It is neither hot nor sunny. 


Solution 


a. The given sentence is equivalent to “It is not hot and it is sunny,” which can be written 
symbolically as ~h/ s. 


b. To say it is neither hot nor sunny means that it is not hot and it is not sunny. Therefore, 
the given sentence can be written symbolically as ~h A ~s. E 
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The notation for inequalities involves and and or statements. For instance, if x, a, and b are 
particular real numbers, then 


3 i =a means x< or x=a 
Note The point of speci- id 
fying x, a, and b to be asx=b means a=x and xb. 
particular real numbers 
is to ensure that sentences 
such as “x < a” and Note that the inequality 2 = x = 1 is not satisfied by any real numbers because 
“x = b” are either true or 
false and hence that they 
are statements, 25451 means 2% and x= 1, 


and this is false no matter what number x happens to be. 


CASA Ana, Or, and Inequalities 


Suppose x is a particular real number. Let p, q, and r symbolize “0 < x,” “x < 3,’ and “x = 3,” 
respectively. Write the following inequalities symbolically: 


ax=3 bO0<x<3 c 0O<x33 
Solution 
aqvr b. pAq c. pA(qvVr) E 


Truth Values 


In Examples 2.1.2 and 2.1.3 we built compound sentences out of component statements 
and the terms not, and, and or. If such sentences are to be statements, however, they must 
have well-defined truth values—they must be either true or false. We now define such 
compound sentences as statements by specifying their truth values in terms of the state- 
ments that compose them. 


Note Think of negation 

like this: Definition 
The negation of a 
statement is a state- 


If p is a statement variable, the negation of p is “not p” or “It is not the case that p” 


ment that exactly ex- and is denoted ~p. It has opposite truth value from p: if p is true, ~p is false; if p is 
presses what it would false, ~p is true. 

mean for the statement 

to be false. 


The truth values for negation are summarized in a truth table. 


Truth Table for ~p 
P | œp 
T 
F T 


In ordinary language the sentence “It is hot and it is sunny” is understood to be true 
when both conditions—being hot and being sunny—are satisfied. If it is hot but not 
sunny, or sunny but not hot, or neither hot nor sunny, the sentence is understood to be 
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Note The only way for 
an and statement to be 
true is for both compo- 
nents to be true. So in 
the truth table for an and 
statement the first row is 
the only row with a T. 


Note The statement 

“2 = 2” means that 2 is 

less than 2 or 2 equals 2. 
It is true because 2 = 2. 
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false. The formal definition of truth values for an and statement agrees with this gen- 
eral understanding. 


Definition 


If p and q are statement variables, the conjunction of p and q is “p and q,” denoted 
p ^q. It is true when, and only when, both p and q are true. If either p or q is false, 
or if both are false, p ^ q is false. 


The truth values for conjunction can also be summarized in a truth table. The table is 
obtained by considering the four possible combinations of truth values for p and q. Each 
combination is displayed in one row of the table; the corresponding truth value for the whole 
statement is placed in the right-most column of that row. Note that the only row containing 
a T is the first one because an and statement is true only when both components are true. 


Truth Table for p ^ q 
P q PAG 
T T T 
T F F 
F T F 
F F F 


The order of truth values for p and q in the table above is TT, TF, FT, FF. It is not abso- 
lutely necessary to write the truth values in this order, although it is customary to do so. So 
please use this order for all truth tables involving two statement variables. Example 2.1.5 
shows the standard order for truth tables that involve three statement variables. 

In the case of disjunction—statements of the form “p or q’—intuitive logic offers two 
alternative interpretations. In ordinary language or is sometimes used in an exclusive sense 
(p or q but not both) and sometimes in an inclusive sense (p or q or both). A waiter who says 
you may have “coffee, tea, or milk” uses the word or in an exclusive sense: Extra payment 
is generally required if you want more than one beverage. On the other hand, a waiter who 
offers “cream or sugar” uses the word or in an inclusive sense: You are entitled to both 
cream and sugar if you wish to have them. 

Mathematicians and logicians avoid possible ambiguity about the meaning of the word 
or by understanding it to mean the inclusive “and/or.” The symbol v comes from the Latin 
word vel, which means or in its inclusive sense. To express the exclusive or, the phrase p 
or q but not both is used. 


Definition 


If p and q are statement variables, the disjunction of p and q is “p or q,’ denoted 
p V q. It is true when either p is true, or q is true, or both p and q are true; it is false 
only when both p and q are false. 
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Here is the truth table for disjunction: 


Note The only way for Truth Table for p V q 
an or statement to be false 
is for both components to P q Pd 
be false. So in the truth T T T 
table for an or statement 
the last row is the only T F T 
row with an F. 

F T T 

F F F 


Evaluating the Truth of More General Compound Statements 


Note Java, C, and C++ Now that truth values have been assigned to ~p, p ^ q, and p V q, consider the question of 


use the following assigning truth values to more complicated expressions suchas ~p V q, (pV Q \~(D ^q), 
notations: and (p ^q)v r. Such expressions are called statement forms (or propositional forms). 
za ! The close relationship between statement forms and Boolean expressions is discussed in 
Section 2.4. 
A && 


M | Definition 


A statement form (or propositional form) is an expression made up of statement 
variables (such as p, q, and r) and logical connectives (such as ~, ^, and v) that 
becomes a statement when actual statements are substituted for the component state- 
ment variables. The truth table for a given statement form displays the truth values 
that correspond to all possible combinations of truth values for its component state- 
ment variables. 


To compute the truth values for a statement form, follow rules similar to those used 
to evaluate algebraic expressions. For each combination of truth values for the statement 
variables, first evaluate the expressions within the innermost parentheses, then evaluate 
the expressions within the next innermost set of parentheses, and so forth, until you have 
the truth values for the complete expression. 


Truth Table for Exclusive Or 


Note Exclusive or is Construct the truth table for the statement form (p v q) ^ ~(p ^ q). Note that when or is 
often symbolized as used in its exclusive sense, the statement “p or g” means “p or q but not both” or “p or q and 
p ® q or p XOR q. not both p and q,” which translates into symbols as (p v q) \ ~(p ^ q). 


Solution Set up columns labeled p, q, p V q,p ^q, ~(p ^q), and (py) \~(p Aq). 
Fill in the p and q columns with all the logically possible combinations of T’s and F’s. 
Then use the truth tables for v and ^ to fill in the p v q and p ^ q columns with the ap- 
propriate truth values. Next fill in the ~(p ^ q) column by taking the opposites of the truth 
values for p A q. For example, the entry for ~(p ^ q) in the first row is F because in the 
first row the truth value of p ^ q is T. Finally, fill in the (p v q) \ ~(p ^ q) column by con- 
sidering the truth values for an and statement together with the truth values for p V q and 
~(p ^ q). Since an and statement is true only when both components are true and since 
rows 2 and 3 are the only two rows where both p v q and ~(p ^ q) are true, put T in rows 
2 and 3 and F in the remaining rows. 
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Note To fill out a truth 
table for an and state- 
ment, first put a T in each 
row where both com- 
ponents are true; then 

put an F in each of the 
remaining rows. 


Note To fill out a truth 
table for an or statement, 
first put an F in each row 
where both components 
are false; then put a T in 
each of the remaining 
rows. 
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Truth Table for Exclusive Or: (p V Q A ~(p \q) 


P 4 |PVI PAT ~PAD | CPVODA~PAD 

T T T T F F 

T F T F T T 

F T T F T T 

F F F F T F n 
Truth Table for (p a q) v ~r 


Construct a truth table for the statement form (p ^ q) V ~r. 


Solution Make columns headed p, q, r, p ^ q, ~r, and (p A q) v ~r. Enter the eight logi- 
cally possible combinations of truth values for p, q, and r in the three left-most columns. 
Then fill in the truth values for p ^ q and for ~r. Complete the table by considering the 
truth values for (p ^ q) and for ~r and the definition of an or statement. Since an or state- 
ment is false only when both components are false, the only rows in which the entry is F 
are the third, fifth, and seventh rows because those are the only rows in which the expres- 
sions p ^q and ~r are both false. The entry for all the other rows is T. 


™ 


P pavor 


Q 
l 
` 


amlhafm|/a)alalal/als 
amlhaljalalulaialalos 
mlalm)/alalalafa 
alhafm| al aiaiHalal> 
Alalala ala} sa|o 
alalama] 


The essential point about assigning truth values to compound statements is that it allows 
you—using logic alone—to judge the truth of a compound statement on the basis of your 
knowledge of the truth of its component parts. Logic does not help you determine the truth 
or falsity of the component statements. Rather, logic helps link these separate pieces of 
information together into a coherent whole. 


Logical Equivalence 


The statements 

6 is greater than2 and 2 is less than 6 
are two different ways of saying the same thing. Why? Because of the definition of the 
phrases greater than and less than. By contrast, although the statements 


(1) Dogs bark and cats meow and (2) Cats meow and dogs bark 
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are also two different ways of saying the same thing, the reason has nothing to do with 
the definition of the words. It has to do with the logical form of the statements. Any two 
statements whose logical forms are related in the same way as (1) and (2) would either 
both be true or both be false. You can see this by examining the following truth table, 
where the statement variables p and q are substituted for the component statements “Dogs 
bark” and “Cats meow,” respectively. The table shows that for each combination of truth 
values for p and q, p ^q is true when, and only when, q ^ p is true. In such a case, the 
statement forms are called logically equivalent, and we say that (1) and (2) are logically 
equivalent statements. 


“imaja 
mM Alnmiais 


. 


p ^q and q ^ p always 
have the same truth 
values, so they are 
logically equivalent 


Definition 


Two statement forms are called logically equivalent if, and only if, they have iden- 
tical truth values for each possible substitution of statements for their statement 
variables. The logical equivalence of statement forms P and Q is denoted by writing 
P= Q. 

Two statements are called logically equivalent if, and only if, they have logi- 
cally equivalent forms when identical component statement variables are used to 
replace identical component statements. 


Testing Whether Two Statement Forms P and Q Are Logically Equivalent 


1. Construct a truth table with one column for the truth values of P and another 
column for the truth values of Q. 

2. Check each combination of truth values of the statement variables to see whether 
the truth value of P is the same as the truth value of Q. 


a. If in each row the truth value of P is the same as the truth value of Q, then P 
and Q are logically equivalent. 


b. Ifin some row P has a different truth value from Q, then P and Q are not logi- 
cally equivalent. 


Double Negative Property: ~(~ p) = p 


Construct a truth table to show that the negation of the negation of a statement is logically 
equivalent to the statement, annotating the table with a sentence of explanation. 
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Solution 
p =p = =p) 
T F T 
F T F 
t ij 


p and ~(~p) always have 
the same truth values, so they 
are logically equivalent 


There are two ways to show that statement forms P and Q are not logically equivalent. 
As indicated previously, one is to use a truth table to find rows for which their truth values 
differ. The other way is to find concrete statements for each of the two forms, one of which 
is true and the other of which is false. The next example illustrates both of these ways. 


Se yaw Showing Nonequivalence 


Show that the statement forms ~(p ^ q) and ~p ^ ~q are not logically equivalent. 


Solution 
a. This method uses a truth table annotated with a sentence of explanation. 
P 4) ~P ~q PM “PAD ~PAN~4 
T T F F T F F 
T F F T F T F F 
F T T F F T F F 
F F T T F T T 
t J 


~(p \q) and ~p A ~q have 
different truth values in rows 2 and 3, 
so they are not logically equivalent 


b. This method uses an example to show that ~(p ^ q) and ~p ^ ~4q are not logically 
equivalent. Let p be the statement “0 < 1” and let q be the statement “1 < 0.” Then 


~(pAq) is “Itis not the case that both O < 1 and 1 < 0,” 
which is true. On the other hand, 
~pA\~q is “O41 and 140, 


which is false. This example shows that there are concrete statements you can substi- 
tute for p and q to make one of the statement forms true and the other false. Therefore, 
the statement forms are not logically equivalent. E 


ZCWJGYAR:J Negations of And and Or: De Morgan’s Laws 


For the statement “John is tall and Jim is redheaded” to be true, both components must be 
true. So for the statement to be false, one or both components must be false. Thus the nega- 
tion can be written as “John is not tall or Jim is not redheaded.” In general, the negation 
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of the conjunction of two statements is logically equivalent to the disjunction of their ne- 
gations. That is, statements of the forms ~(p ^ q) and ~p V ~q are logically equivalent. 
Check this using truth tables. 


Solution 
P â 4| ~p ~q PAW) ~PAD ~PV~E 
T T F F T F F 
T F F T F T T 
F T T F F T T 
F F T T F T T 
DO, 
~(p ^q) and ~p V ~q always 
have the same truth values, so they 
are logically equivalent 
Symbolically, 


“pAg= ~p~. 


In the exercises at the end of this section you are asked to show the analogous law that 
the negation of the disjunction of two statements is logically equivalent to the conjunction 
of their negations: 


~(PVQ)=~P\~4. 


Paul Fearn/Alamy Stock Photo 


The two logical equivalences of Example 2.1.8 are known as De Morgan’s laws of 
logic in honor of Augustus De Morgan, who was the first to state them in formal math- 
ematical terms. 


Augustus De Morgan 
(1806-1871) 


De Morgan’s Laws 


The negation of an and statement is logically equivalent to the or statement in which 
each component is negated. 

The negation of an or statement is logically equivalent to the and statement in 
which each component is negated. 


Applying De Morgan’s Laws 


Write negations for each of the following statements: 

a. John is 180 centimeters tall and he weighs at least 90 kilograms. 

b. The bus was late or Tom’s watch was slow. 

Solution 

a. John is not 180 centimeters tall or he weighs less than 90 kilograms. 


b. The bus was not late and Tom’s watch was not slow. 


Since the statement “neither p nor q”? means the same as “~p and ~q,” an alternative 
answer for (b) is “Neither was the bus late nor was Tom’s watch slow.” E 
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Example 2.1.10 


A 


Caution! The negation of 
=] <x= 4 is. not 

—1 x #4. It is also not 
=| >a, 


Example 2.1.11 
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If x is a particular real number, saying that x is not less than 2 (x £ 2) means that x does 
not lie to the left of 2 on the number line. This is equivalent to saying that either x = 2 or 
x lies to the right of 2 on the number line (x = 2 or x > 2). Hence, 


x £2 isequivalentto x= 2. 
Pictorially, 
-2 -1 0 1 2 a 4 5 
t t t t t 
If x £ 2, then x lies 
in the shaded region. 
Similarly, 
x2 isequivalentto x=2, 
x2 isequivalentto x > 2, and 
x#2 isequivalentto x <2. 


Inequalities and De Morgan’s Laws 
Use De Morgan’s laws to write the negation of —1 < x = 4. 
Solution The given statement is equivalent to 

-l<x and x=4. 
By De Morgan’s laws, the negation is 

—l¢<x or x #4, 
which is equivalent to 

-l2x or x>4. 


Pictorially, if — 1 = x or x > 4, then x lies in the shaded region of the number line, as 
shown below. 


-2 Aa 0 1 2 3 2 4 5 6 
-~ ee = 


De Morgan’s laws are frequently used in writing computer programs. For instance, sup- 
pose you want your program to delete all files modified outside a certain range of dates, 
say from date | through date 2 inclusive. You would use the fact that 


~(datel = file_modification_date = date?) 
is equivalent to 


(file_modification_date < datel) or (date2 < file_modification_date). 


A Cautionary Example 
According to De Morgan’s laws, the negation of 


p: Jim is tall and Jim is thin 


is ~p: Jim is not tall or Jim is not thin 


because the negation of an and statement is the or statement in which the two components 
are negated. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


48 CHAPTER2 THELOGIC OF COMPOUND STATEMENTS 


Unfortunately, a potentially confusing aspect of the English language can arise 
when you are taking negations of this kind. Note that statement p can be written more 
compactly as 


p": Jim is tall and thin. 
When it is so written, another way to negate it is 


A ~(p’): Jim is not tall and thin. 


Caution! Although the But in this form the negation looks like an and statement. Doesn’t that violate De Morgan’s 


laws of logic are extremely laws? 

useful, they should be Actually no violation occurs. The reason is that in formal logic the words and and or 
used as an aid to think- are allowed only between complete statements, not between sentence fragments. So when 
ing, not as a mechanical you apply De Morgan’s laws, you must have complete statements on either side of each and 
substitute for it. and on either side of each or. a 


Tautologies and Contradictions 


It has been said that all of mathematics reduces to tautologies. Although this is formally 
true, most working mathematicians think of their subject as having substance as well as 
form. Nonetheless, an intuitive grasp of basic logical tautologies is part of the equipment 
of anyone who reasons with mathematics. 


Definition 


A tautology is a statement form that is always true regardless of the truth values of 
the individual statements substituted for its statement variables. A statement whose 
form is a tautology is a tautological statement. 

A contradication is a statement form that is always false regardless of the truth 
values of the individual statements substituted for its statement variables. A state- 
ment whose form is a contradiction is a contradictory statement. 


According to this definition, the truth of a tautological statement and the falsity of a 
contradictory statement are due to the logical structure of the statements themselves and 
are independent of the meanings of the statements. 


EU weave Tautologies and Contradictions 


Show that the statement form p V ~p is a tautology and that the statement form p ^ ~p is 
a contradiction. 


Solution 


~P PVP P NTP 


T T F 
T F 
all T’s, so all F’s, so 
pepis pA\~pisa 
a tautology contradiction E 
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Example 2.1.13 


2.1 


LOGICAL FORM AND LOGICAL EQUIVALENCE 


Logical Equivalence Involving Tautologies and Contradictions 


If t is a tautology and c is a contradiction, show that p ^ t = pandpAc=c. 


Solution 


t pt P c prc 
T T T T F F 
T F F F F 
t ii i i} 
same truth same truth 
values, so values, so 
pAt=p pve=e 


Summary of Logical Equivalences 


49 


Knowledge of logically equivalent statements is very useful for constructing arguments. 
It often happens that it is difficult to see how a conclusion follows from one form of a 
statement, whereas it is easy to see how it follows from a logically equivalent form of 
the statement. A number of logical equivalences are summarized in Theorem 2.1.1 for 
future reference. 


— 
— 


NO Go Sl ae go SS > 


> 


Commutative laws: 
Associative laws: 
Distributive laws: 
Identity laws: 
Negation laws: 
Double negative law: 


Idempotent laws: 


Universal bound laws: 


De Morgan’s laws: 


Absorption laws: 


. Negations of t and c: 


Theorem 2.1.1 Logical Equivalences 


p^q=q^p 


(DAQAr=PpAQAr) 


PAGVN=CAQDV(PAL) 


p^t=p 
py~p=t 

~(~p) =p 
P\p=pPp 

pvt=t 
~(D\q)=~PpV~q 
PV(PAQ =P 
~t=c 


Given any statement variables p, q, and r, a tautology t and a contradiction c, the following logical equivalences 
hold. 


tl, 


PV4=4VP 

(PV qQ)Vr=pVQqvnr) 
PVGAN=CVQDACVN) 
pVc=p 

p\~p=e 


DN DEH 
pAc=c 
~pVg=~prA~q 
P\(PVQ)=P 
~c=t 


The proofs of laws 4 and 6, the first parts of laws | and 5, and the second part of law 9 


have already been given as examples in the text. Proofs of the other parts of the theorem 
are left as exercises. In fact, it can be shown that the first five laws of Theorem 2.1.1 form 
a core from which the other laws can be derived. The first five laws are the axioms for a 
mathematical structure known as a Boolean algebra, which is discussed in Section 6.4. 
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The equivalences of Theorem 2.1.1 are general laws of thought that occur in all areas of 
human endeavor. They can also be used in a formal way to rewrite complicated statement 
forms more simply. 


Simplifying Statement Forms 


Use Theorem 2.1.1 to verify the logical equivalence 
~(PADNPVD =P. 


Solution Use the laws of Theorem 2.1.1 to replace sections of the statement form on 
the left by logically equivalent expressions. Each time you do this, you obtain a logically 
equivalent statement form. Continue making replacements until you obtain the statement 
form on the right. 


~CPADNPVD = CCOPV~DACV® by De Morgan’s laws 
= (PV~QAPVG by the double negative law 
= (pv(-qAq) by the distributive law 
= pV (q\~q by the commutative law for /\ 
= pve by the negation law 
= p by the identity law E 


Skill in simplifying statement forms is useful in constructing logically efficient computer 
programs and in designing digital logic circuits. 

Although the properties in Theorem 2.1.1 can be used to prove the logical equivalence 
of two statement forms, they cannot be used to prove that statement forms are not logically 
equivalent. On the other hand, truth tables can always be used to determine both equiva- 
lence and nonequivalence, and truth tables are easy to program on a computer. When 
truth tables are used, however, checking for equivalence always requires 2” steps, where 
n is the number of variables. Sometimes you can quickly see that two statement forms are 
equivalent by Theorem 2.1.1, whereas it would take quite a bit of calculating to show their 
equivalence using truth tables. For instance, it follows immediately from the associative 
law for ^ that p ^ (~q A ~r) = (p A ~q) ^ ~r, whereas a truth table verification requires 
constructing a table with eight rows. 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. An and statement is true when, and only when, statement in which each component is __, and 
both components are___. (2) that the negation of an or statement is logically 
equivalent to the ______ statement in which each 


2. Anor statement is false when, and only when, : 
component is 

both components are 

i ; 5. A tautology is a statement that is always 

3. Two statement forms are logically equivalent 8y y 


when, and only when, they always have —— 6. A contradiction is a statement that is always 


4. De Morgan’s laws say (1) that the negation of an 
and statement is logically equivalent to the 
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EXERCISE SET 2.1* 


2.1 LOGICAL FORM AND LOGICAL EQUIVALENCE 51 


In each of 1-4 represent the common form of each argu- 
ment using letters to stand for component sentences, and 
fill in the blanks so that the argument in part (b) has the 
same logical form as the argument in part (a). 


1. 


a. If all integers are rational, then the number 1 
is rational. 
All integers are rational. 
Therefore, the number | is rational. 

b. If all algebraic expressions can be written in 
prefix notation, then 


Therefore, (a + 2b)\(a? — b) can be written in 
prefix notation. 


. a. If all computer programs contain errors, then 


this program contains an error. 
This program does not contain an error. 
Therefore, it is not the case that all computer 
programs contain errors. 

b. If _____, then ______. 
2 is not odd. 
Therefore, it is not the case that all prime 
numbers are odd. 


. a. This number is even or this number is odd. 


This number is not even. 
Therefore, this number is odd. 

b. _____ or logic is confusing. 
My mind is not shot. 
Therefore, 


a. Ifthe program syntax is faulty, then the com- 
puter will generate an error message. 

If the computer generates an error message, 
then the program will not run. 

Therefore, if the program syntax is faulty, then 
the program will not run. 

b. If this simple graph ______, then it is complete. 
If this graph , then any two of its verti- 
ces can be joined by a path. 

Therefore, if this simple graph has 4 vertices 
and 6 edges, then _____. 


Indicate which of the following sentences are 

statements. 

a. 1,024 is the smallest four-digit number that is a 
perfect square. 

b. She is a mathematics major. 


c. 128 = 2° 
d. x=2° 


Write the statements in 6-9 in symbolic form using the 
symbols ~, V, and A and the indicated letters to represent 
component statements. 


6. 


10. 


Tl. 


Let s = “stocks are increasing” and i = “interest 

rates are steady.” 

a. Stocks are increasing but interest rates are steady. 

b. Neither are stocks increasing nor are interest 
rates steady. 


Juan is a math major but not a computer science 
major. (m = “Juan is a math major,” c = “Juan is a 
computer science major”) 


. Let h = “John is healthy,’ w = “John is wealthy,” 


and s = “John is wise.” 

a. John is healthy and wealthy but not wise. 

b. John is not wealthy but he is healthy and wise. 

c. John is neither healthy, wealthy, nor wise. 

d. John is neither wealthy nor wise, but he is healthy. 
e. John is wealthy, but he is not both healthy and wise. 


Let p = “x > 57 q = “x = 5 andr = “10 > x.” 
a x25 

b. 10 >x>5 

c. 1O>x=5 


Let p be the statement “DATAENDFLAG is off,” 

q the statement “ERROR equals 0,” and r the 

statement “SUM is less than 1,000.” Express the 

following sentences in symbolic notation. 

a. DATAENDFLAG is off, ERROR equals 0, and 
SUM is less than 1,000. 

b. DATAENDFLAG is off but ERROR is not 
equal to 0. 

c. DATAENDFLAG is off; however, ERROR is 
not 0 or SUM is greater than or equal to 1,000. 

d. DATAENDFLAG is on and ERROR equals 0 
but SUM is greater than or equal to 1,000. 

e. Either DATAENDFLAG is on or it is the case 
that both ERROR equals 0 and SUM is less 
than 1,000. 


In the following sentence, is the word or used in 
its inclusive or exclusive sense? A team wins the 
playoffs if it wins two games in a row or a total of 
three games. 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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Write truth tables for the statement forms in 12-15. 


B. ~PAQV (PV) 
15. DA(~qvVr) 


12. ~p ^q 
14. p^ lq ^r) 
Determine whether the statement forms in 16-24 are 
logically equivalent. In each case, construct a truth table 
and include a sentence justifying your answer. Your sen- 


tence should show that you understand the meaning of 
logical equivalence. 


16. pV (p Aq) and p 

17. ~(p ^q) and ~p ^ ~q 

18. pvtandt 

19. p\tandp 

20. p\candpve 

21. (pAg A^Arandp^r lq ^r) 

22. p\ (qv rjand(pAqg)V parn 
23. (pAgVrandpA(qvn 


24. (pVgV(pAnand(pvgAr 


Use De Morgan’s laws to write negations for the state- 
ments in 25-30. 


25. Hal is a math major and Hal’s sister is a computer 
science major. 


26. Sam is an orange belt and Kate is a red belt. 


27. The connector is loose or the machine is 
unplugged. 


28. The train is late or my watch is fast. 


29. This computer program has a logical error in the 
first ten lines or it is being run with an incomplete 
data set. 


30. The dollar is at an all-time high and the stock 
market is at a record low. 


31. Let s be a string of length 2 with characters from 
{0, 1, 2}, and define statements a, b, c, and d as 
follows: 


a = “the first character of s is 0” 
b = “the first character of s is 1” 
c = “the second character of s is 1” 


d = “the second character of s is 2”. 
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Describe the set of all strings for which each of 
the following is true. 


a. (aVb)A(cV ad) 
b. (~avb)) A(cvV da) 
c. (~a) v b) A (cv (~d) 


Assume x is a particular real number and use De Morgan’s 
laws to write negations for the statements in 32-37. 


32. -2 <x <7 33. -—10<x<2 


34. x<2orx>5 35. x=-lorx>1 


36. l1>x=-3 37. 0 > x 2-7 


In 38 and 39, imagine that num_orders and num_instock 
are particular values, such as might occur during execu- 
tion of a computer program. Write negations for the 
following statements. 


38. (num_orders > 100 and num_instock = 500) or 
num_instock < 200 


39. (num_orders < 50 and num_instock > 300) or 
(50 < num_orders < 75 and num_instock > 500) 


Use truth tables to establish which of the statement 
forms in 40-43 are tautologies and which are 
contradictions. 


40. (PAD V EPV (PA ~9) 
41. (pA ~q) A(~pv gq) 
42. (~PADAGAN)A~¢ 


43. (~pV QV (PA ~@) 


44. Recall that a < x < b means that a < x and 
x < b. Also a = b means that a < bora = b. 
Find all real numbers that satisfy the following 
inequalities. 


a. 2<x<0 b. l=x<-l 


45. Determine whether the statements in (a) and (b) 
are logically equivalent. 

a. Bob is both a math and computer science 
major and Ann is a math major, but Ann is not 
both a math and computer science major. 

b. It is not the case that both Bob and Ann are 
both math and computer science majors, 
but it is the case that Ann is a math major 
and Bob is both a math and computer 
science major. 
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*46. Let the symbol Ð denote exclusive or; so In 48 and 49 below, a logical equivalence is derived from 
pBq=(PVqQgA~(pAQq. Hence the truth Theorem 2.1.1. Supply a reason for each step. 


table for p © q is as follows: 
re 48. (PA~QV(PAD=PACqV@ by@ 


P 4 | pq =pA@v~q) oe 
=p^t c 
T T F i vO 
=p by (d) 
T F T Therefore, (p \ ~q) V (p ^Q =p. 
F T T 49. (pV ~q) \(~pV ~g) 
F F F =(~qvp)\(~qv~p) bya 
= ~q V (p\~p) by (b) 
a. Find simpler statement forms that are logically =~qVve by ©) 
equivalent to p © p and (p ® p) @ p. =~q by @ 
b. Is (pq) r= p Og GP)? Justify your Therefore, (p V ~q) \(~p V ~q) = ~q. 


ASRS Use Theorem 2.1.1 to verify the logical equivalences in 


c. Is(p DQ) Ar= (p^r) q^ r)? Justify 50-54. Supply a reason for each step. 
your answer. 


50. (pA~qg) Vp =p 51. pA\(~qVp)=P 
*47. In logic and in standard English, a double negative 


is equivalent to a positive. There is one fairly com- 52. ~(pV ~g)V (~p\~q) = ~p 

mon English usage in which a “double positive” is = 
53. ~(~pA ph ^q) = 

equivalent to a negative. What is it? Can you think CP AKOVE?P DV OAD) =p 


of others? 54. (DA(~(~PV@)) V(PAQ =P 


ANSWERS FOR TEST YOURSELF 


1. true 2.false 3.the same truth values 4. or; negated; and; negated 5.true 6. false 


Conditional Statements 


... hypothetical reasoning implies the subordination of the real to the realm of the 
possible ... —Jean Piaget, 1972 


When you make a logical inference or deduction, you reason from a hypothesis to a con- 
clusion. Your aim is to be able to say, “Zf such and such is known, then something or other 
must be the case.” 

Let p and q be statements. A sentence of the form “If p then q” is denoted symbolically 
by “p — q”; p is called the hypothesis and q is called the conclusion. For instance, consider 
the following statement: 


If 4,686 is divisible by 6, then 4,686 is divisible by 3 
hypothesis conclusion 
Such a sentence is called conditional because the truth of statement q is conditioned on the 
truth of statement p. 

The notation p — q indicates that — is a connective, like ^ or v, which can be used 
to join statements to create new statements. To define p — q as a statement, therefore, 
we must specify the truth values for p — q as we specified truth values for p ^ q and for 
p V q. As is the case with the other connectives, the formal definition of truth values 
for — (if-then) is based on its everyday, intuitive meaning. Consider an example. 
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Suppose you go to interview for a job at a store and the owner of the 
store makes you the following promise: 


If you show up for work Monday morning, then you will get the job. 


Under what circumstances are you justified in saying the owner spoke 
falsely? That is, under what circumstances is the above sentence false? 
The answer is: You do show up for work Monday morning and you do 
not get the job. 


After all, the owner’s promise only says you will get the job if a certain 
condition (showing up for work Monday morning) is met; it says noth- 
ing about what will happen if the condition is not met. So if the condi- 
tion is not met, you cannot in fairness say the promise is false regard- 
less of whether or not you get the job. 


The above example was intended to convince you that the only combination of circum- 
stances in which you would call a conditional sentence false occurs when the hypothesis 
is true and the conclusion is false. In all other cases, you would not call the sentence false. 
This implies that the only row of the truth table for p — q that should be filled in with an 
F is the row where p is T and q is F. No other row should contain an F. But each row of a 
truth table must be filled in with either a T or an F. Thus all other rows of the truth table 
for p — q must be filled in with T’s. 


Truth Table for p > q 
P 4 | P>4 
T T T 
T F F 
F T T 
F F T 


Definition 


If p and q are statement variables, the conditional of q by p is “If p then q” or 
“p implies q” and is denoted p — q. It is false when p is true and q is false; other- 
wise it is true. We call p the hypothesis (or antecedent) of the conditional and q the 
conclusion (or consequent). 


A conditional statement that is true by virtue of the fact that its hypothesis is false is of- 
ten called vacuously true or true by default. Thus the statement “If you show up for work 
Monday morning, then you will get the job” is vacuously true if you do not show up for 
work Monday morning. In general, when the “if” part of an if-then statement is false, the 
statement as a whole is said to be true, regardless of whether the conclusion is true or false. 


CANIF] A Conditional Statement with a False Hypothesis 
Consider the statement 
If 0 = 1 then 1 = 2. 


As strange as it may seem, since the hypothesis of this statement is false, the statement 
as a whole is true. a 
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Note For example, if you 
hypothesize that 0 = 1, 
then, by adding 1 to both 
sides of the equation, you 
can deduce that 1 = 2. 


Note The only rows 

in which the hypothesis 

p v ~q is true and the 
conclusion ~p is false are 
the first and second rows. 
So you put F’s in those 
two rows and T’s in the 
other two rows. 
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The philosopher Willard Van Orman Quine advises against using the phrase “p implies 
q” to mean “p — q” because the word implies suggests that q can be logically deduced 
from p and this is often not the case. Nonetheless, the phrase is used by many people, prob- 
ably because it is a convenient replacement for the — symbol. And, of course, in many 
cases a conclusion can be deduced from a hypothesis, even when the hypothesis is false. 

In expressions that include — as well as other logical operators such as A, v, and ~, 
the order of operations is that — is performed last. Thus, according to the specification 
of order of operations in Section 2.1, ~ is performed first, then ^ and v, and finally —. 


Truth Table for p v ~q —> ~p 
Construct a truth table for the statement form p V ~q > ~p. 


Solution By the order of operations given above, the following two expressions are 
equivalent: p v ~q —> ~p and (p v (~q)) —> (~p), and this order governs the construc- 
tion of the truth table. First fill in the four possible combinations of truth values for p and 
q, and then enter the truth values for ~p and ~q using the definition of negation. Next fill 
in the p v ~q column using the definition of v. Finally, fill in the p V ~q —> ~p column 
using the definition of —. 


conclusion hypothesis 
P 4q ~p ~q | PV~d PY~I>~P 
T T F F T F 
T F F T T F 
F T T F F T 
F F T T T T 


Logical Equivalences Involving — 


Imagine that you are trying to solve a problem involving three statements: p, g, and r. Sup- 
pose you know that the truth of r follows from the truth of p and also that the truth of r 
follows from the truth of q. Then no matter whether p or q is the case, the truth of r must 
follow. The division-into-cases method of analysis is based on this idea. 


Division into Cases: Showing That p Vq—>r=(p—nA(q—n) 


Use truth tables to show the logical equivalence of the statement forms p V q — r and 
(p > r) ^ (q — r). Annotate the table with a sentence of explanation. 


Solution First fill in the eight possible combinations of truth values for p, q, and r. Then 
fill in the columns for p V q, p —> r, and q — r using the definitions of or and if-then. 
For instance, the p — r column has F’s in the second and fourth rows because these are 
the rows in which p is true and r is false. Next fill in the p V q —> r column using the 
definition of if-then. The rows in which the hypothesis p v q is true and the conclusion r is 
false are the second, fourth, and sixth. So F’s go in these rows and T’s in all the others. The 
complete table shows that p V q — r and (p —> r) ^ (q — r) have the same truth values 
for each combination of truth values of p, g, and r. Hence the two statement forms are 
logically equivalent. 
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P q r PV por q>r pvq->r (pP>nNAQ—r) 
T T T T T T T T 
T T F T F F F F 
T F T T ab T T T 
Ï F F T F T F F 
F T T T T T T T 
F T F T T F F F 
F F T F T T T T 
F F F F T T T T 
t t 


PpVq-—rand(p—>r)A(q—>n) 
always have the same truth values, 
so they are logically equivalent E 


Representation of If-Then as Or 
In exercise 13(a) at the end of this section you are asked to use truth tables to show that 
P?qd>=~PVG. 


The logical equivalence of “if p then q” and “not p or q” is occasionally used in every- 
day speech. Here is one instance. 


SEWISE Application of the Equivalence between ~p v q and p—>q 
Rewrite the following statement in if-then form. 


Either you get to work on time or you are fired. 


Solution Let ~p be 
You get to work on time. 

and q be 
You are fired. 

Then the given statement is ~p v q. Also p is 
You do not get to work on time. 

So the equivalent if-then version, p — 4, is 


If you do not get to work on time, then you are fired. a 


The Negation of a Conditional Statement 


By definition, p — q is false if, and only if, its hypothesis, p, is true and its conclusion, q, 
is false. It follows that 


The negation of “if p then q” is logically equivalent to “p and not q.” 
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A 


Caution! Remember that 
the negation of an 

if-then statement does 
not start with the word if. 
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This can be restated symbolically as follows: 


~p >q) =p\~q 


To obtain this result you can also start from the logical equivalence p > q = ~p V q. Take 
the negation of both sides to obtain 
~P>9=~CpVva) 
= ~(~p) /\ (~q) by De Morgan’s laws 


PA=g by the double negative law. 


Yet another way to derive this result is to construct truth tables for ~(p — q) and for 
p^ ~q and to check that they have the same truth values. (See exercise 13(b) at the end 
of this section.) 


Negations of If-Then Statements 

Write negations for each of the following statements: 

a. If my car is in the repair shop, then I cannot get to class. 
b. If Sara lives in Athens, then she lives in Greece. 
Solution 

a. My car is in the repair shop and I can get to class. 


b. Sara lives in Athens and she does not live in Greece. (Sara might live in Athens, 
Georgia; Athens, Ohio; or Athens, Wisconsin.) | 


It is tempting to write the negation of an if-then statement as another if-then statement. 
Please resist that temptation! 


The Contrapositive of a Conditional Statement 


One of the most fundamental laws of logic is the equivalence between a conditional state- 
ment and its contrapositive. 


Definition 


The contrapositive of a conditional statement of the form “If p then q” is 
If ~q then ~p. 
Symbolically, 


The contrapositive of p — q is ~q — ~p. 


The fact is that 


A conditional statement is logically equivalent to its contrapositive. 


You are asked to establish this equivalence in exercise 26 at the end of this section. 
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Example 2.2.6 


Example 2.2.7 


A 


Caution! Many people 
believe that if a condition- 
al statement is true, then 
its converse and inverse 
must also be true. This is 
not correct! The converse 
might be true, but it does 
not have to be true. 


Writing the Contrapositive 

Write each of the following statements in its equivalent contrapositive form: 

a. If Howard can swim across the lake, then Howard can swim to the island. 

b. If today is Easter, then tomorrow is Monday. 

Solution 

a. If Howard cannot swim to the island, then Howard cannot swim across the lake. 


b. If tomorrow is not Monday, then today is not Easter. E 


When you are trying to solve certain problems, you may find that the contrapositive 
form of a conditional statement is easier to work with than the original statement. Replac- 
ing a statement by its contrapositive may give the extra push that helps you over the top 
in your search for a solution. This logical equivalence is also the basis for one of the most 
important laws of deduction, modus tollens (to be explained in Section 2.3), and for the 
contrapositive method of proof (to be explained in Section 4.7). 


The Converse and Inverse of a Conditional Statement 


The fact that a conditional statement and its contrapositive are logically equivalent is very 
important and has wide application. Two other variants of a conditional statement are not 
logically equivalent to the statement. 


Definition 
Suppose a conditional statement of the form “If p then q” is given. 


1. The converse is “If q then p.” 
2. The inverse is “If ~p then ~q.” 


Symbolically, 
The converse of p—q is q—> p, 
and 


The inverse of p—>q is ~p > ~q. 


Writing the Converse and the Inverse 
Write the converse and inverse of each of the following statements: 
a. If Howard can swim across the lake, then Howard can swim to the island. 


b. If today is Easter, then tomorrow is Monday. 
Solution 


a. Converse: If Howard can swim to the island, then Howard can swim across the lake. 


Inverse: If Howard cannot swim across the lake, then Howard cannot swim to 
the island. 


b. Converse: If tomorrow is Monday, then today is Easter. 


Inverse: If today is not Easter, then tomorrow is not Monday. E 
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Note that while the statement “If today is Easter, then tomorrow is Monday” is always 
true, both its converse and inverse are false on every Sunday except Easter. 


1. A conditional statement and its converse are not logically equivalent. 
2. A conditional statement and its inverse are not logically equivalent. 


3. The converse and the inverse of a conditional statement are logically equivalent 
to each other. 


In exercises 24, 25, and 27 at the end of this section, you are asked to use truth tables 
to verify the statements in the box above. Note that the truth of statement 3 also follows 
from the observation that the inverse of a conditional statement is the contrapositive of 
its converse. 


Only If and the Biconditional 


To say “p only if q” means that p can take place only if q takes place also. That is, if 
q does not take place, then p cannot take place. Another way to say this is that if p 
occurs, then g must also occur (by the logical equivalence between a statement and its 
contrapositive). 


Definition 


If p and q are statements, 
ponlyifg means “if not q then not p,” 
or, equivalently, 


“if p then q.” 


Converting Only If to If-Then 


Rewrite the following statement in if-then form in two ways, one of which is the contra- 
positive of the other. 


John will break the world’s record for the 1,000-meter run only if he 
runs the 1,000 meters in under 2.5 minutes. 


Solution Version 1: If John does not run the 1,000 meters in under 2.5 minutes, then 
he will not break the world’s record. 
Version 2: If John breaks the world’s record, then he will have run the 


À 1,000 meters in under 2.5 minutes. E 


Caution! “p only if q” Note that it is possible for “p only if q” to be true at the same time that “p if q” is false. 

does not mean “p if q.” For instance, to say that John will break the world’s record only if he runs the 1,000 meters 
in under 2.5 minutes does not mean that John will break the world’s record if he runs the 
1,000 meters in under 2.5 minutes. His time could be under 2.5 minutes but still not be fast 
enough to break the record. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


60 


CHAPTER2 THELOGIC OF COMPOUND STATEMENTS 


Definition 


Given statement variables p and q, the biconditional of p and q is “p if, and only if, 
q” and is denoted p <+> q. It is true if both p and q have the same truth values and is 
false if p and g have opposite truth values. The words if and only if are sometimes 
abbreviated iff. 


The biconditional has the following truth table: 


Truth Table for p +> q 
P q Peq 
T T 

T F F 

F T F 

F F T 


In order of operations <> is coequal with —. As with ^ and v , the only way to indicate 
precedence between them is to use parentheses. The full hierarchy of operations for the 
five logical operators is shown below. 


Order of Operations for Logical Operators 


1 Evaluate negations first. 

2. A,V Evaluate ^ and v second. When both are present, parentheses may be 
needed. 

3. —,<> Evaluate — and <> third. When both are present, parentheses may 
be needed. 


According to the separate definitions of if and only if, saying “p if, and only if, q” should 
mean the same as saying both “p if q” and “p only if q.” The following annotated truth table 
shows that this is the case: 


Truth Table Showing That p> q = (p > q) ^ (q4 > p) 


PpP q4 p>q q4>p peq p>^q>p) 
T T T T T T 
T F F T F F 
F T T F F F 
F F T T T T 


t t 


p <> q and (p > q) ^ (q > p) 
always have the same truth values, 
so they are logically equivalent 
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If and Only if 


Rewrite the following statement as a conjunction of two if-then statements: 
This computer program is correct if, and only if, it produces correct 


answers for all possible sets of input data. 


Solution If this program is correct, then it produces the correct answers for all possible 
sets of input data; and if this program produces the correct answers for all possible sets of 
input data, then it is correct. E 


Necessary and Sufficient Conditions 


The phrases necessary condition and sufficient condition, as used in formal English, cor- 
respond exactly to their definitions in logic. 


Definition 


If r and s are statements: 


r is a sufficient condition fors means “ifr then s.” 


r is a necessary condition fors means “if not r then not s.” 


In other words, to say “r is a sufficient condition for s” means that the occurrence of r 
is sufficient to guarantee the occurrence of s. On the other hand, to say “r is a necessary 
condition for s” means that if r does not occur, then s cannot occur either: 


The occurrence of r is necessary to obtain the occurrence of s. Note that 
because of the equivalence between a statement and its contrapositive, 


ris a necessary condition for s also means “ifs then r.” 


Consequently, 


r is a necessary and sufficient condition for s means “rif, and only if, s.” 


Interpreting Necessary and Sufficient Conditions 


Consider the statement “If John is eligible to vote, then he is at least 18 years old.” The 
truth of the condition “John is eligible to vote” is sufficient to ensure the truth of the condi- 
tion “John is at least 18 years old.” In addition, the condition “John is at least 18 years old” 
is necessary for the condition “John is eligible to vote” to be true. If John were younger 
than 18, then he would not be eligible to vote. E 


ZEWG YAL Converting a Sufficient Condition to If-Then Form 
Rewrite the following statement in the form “If A then B”: 


Pia’s birth on U.S. soil is a sufficient condition 
for her to be a U.S. citizen. 


Solution If Pia was born on U.S. soil, then she is a U.S. citizen. 
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ZSWIGR TAr Converting a Necessary Condition to If-Then Form 
Use the contrapositive to rewrite the following statement in two ways: 


George’s attaining age 35 is a necessary condition for his being presi- 
dent of the United States. 


Solution Version 1: If George has not attained the age of 35, then he cannot be presi- 
dent of the United States. 

Version 2: If George can be president of the United States, then he has 

attained the age of 35. a 


Remarks 


1. In logic, a hypothesis and conclusion are not required to have related subject matters. 

In ordinary speech we never say things like “If computers are machines, then Babe 
Ruth was a baseball player” or “If 2 +2 = 5, then Mickey Mouse is president of the 
United States.” We formulate a sentence like “If p then q” only if there is some connec- 
tion of content between p and q. 

In logic, however, the two parts of a conditional statement need not have related 
meanings. The reason? If there were such a requirement, who would enforce it? What 
one person perceives as two unrelated clauses may seem related to someone else. 
There would have to be a central arbiter to check each conditional sentence before 
anyone could use it, to be sure its clauses were in proper relation. This is impractical, 
to say the least! 

Thus a statement like “if computers are machines, then Babe Ruth was a baseball 
player” is allowed, and it is even called true because both its hypothesis and its conclu- 
sion are true. Similarly, the statement “If 2 +2 = 5, then Mickey Mouse is president 
of the United States” is allowed and is called true because its hypothesis is false, even 
though doing so may seem ridiculous. 

In mathematics it often happens that a carefully formulated definition that success- 
fully covers the situations for which it was primarily intended is later seen to be satis- 
fied by some extreme cases that the formulator did not have in mind. But those are the 
breaks, and it is important to get into the habit of exploring definitions fully to seek out 
and understand all their instances, even the unusual ones. 


2. In informal language, simple conditionals are often used to mean biconditionals. 

The formal statement “p if, and only if, q” is seldom used in ordinary language. 
Frequently, when people intend the biconditional they leave out either the and only if or 
the if and. That is, they say either “p if q” or “p only if q? when they really mean “p if, 
and only if, q.” For example, consider the statement “You will get dessert if, and only 
if, you eat your dinner.” Logically, this is equivalent to the conjunction of the following 
two statements. 


Statement 1: If you eat your dinner, then you will get dessert. 
Statement 2: You will get dessert only if you eat your dinner. 
or 
If you do not eat your dinner, then you will not get dessert. 


Now how many parents in the history of the world have said to their children “You 
will get dessert if, and only if, you eat your dinner”? Not many! Most say either “If you 
eat your dinner, you will get dessert” (these take the positive approach—they empha- 
size the reward) or “You will get dessert only if you eat your dinner” (these take the 
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1. 
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EXERCISE SET 2.2 
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negative approach—they emphasize the punishment). Yet the parents who promise the 
reward intend to suggest the punishment as well, and those who threaten the punish- 
ment will certainly give the reward if it is earned. Both sets of parents expect that their 
conditional statements will be interpreted as biconditionals. 

Since we often (correctly) interpret conditional statements as biconditionals, it is 
not surprising that we may come to believe (mistakenly) that conditional statements are 
always logically equivalent to their inverses and converses. In formal settings, however, 
statements must have unambiguous interpretations. If-then statements can’t sometimes 
mean “if-then” and other times mean “if and only if.” When using language in math- 
ematics, science, or other situations where precision is important, it is essential to inter- 
pret if-then statements according to the formal definition and not to confuse them with 
their converses and inverses. 


An if-then statement is false if, and only if, the hy- 
pothesis is and the conclusion is 


The negation of “if p then q” is ____. 
The converse of “if p then q” is 

The contrapositive of “if p then q” is 
The inverse of “if p then q” is 


A conditional statement and its contrapositive 
are 


10. 


A conditional statement and its converse are 
not 


. “R is a sufficient condition for $” means “if 


then ___..” 


“R is a necessary condition for S” means “if 
then ___..” 


“R only if S” means “if ______ then 


Rewrite the statements in 1-4 in if-then form. 


1. 


2. 
3. 
4. 


This loop will repeat exactly N times if it does not 
contain a stop or a go to. 


I am on time for work if I catch the 8:05 bus. 
Freeze or Pll shoot. 


Fix my ceiling or I won’t pay my rent. 


Construct truth tables for the statement forms in 5-11. 


12. 


as a a alas) 6. (PVQV(-PAD—>4 
P\~q->>r 8 ~pVq-r 


p\N~raqvr 10. (p—>r) < (q—>r) 


-(p7eqG7>n)eWMAQD-?N 


Use the logical equivalence established in 
Example 2.2.3, pv q >r = (p—>r)^ lq >r), 


13. 


H 14. 


to rewrite the following statement. (Assume that x 
represents a fixed real number.) 


Ifx>2orx< —2, then x° > 4. 
Use truth tables to verify the following logical 


equivalences. Include a few words of explanation 
with your answers. 


a p>q=~pVvq 

b. ~p >90) =PpA~4. 

a. Show that the following statement forms are all 
logically equivalent: 


p>qVvr, pAv~q->r and pA~r—q 


b. Use the logical equivalences established in part 
(a) to rewrite the following sentence in two dif- 
ferent ways. (Assume that n represents a fixed 
integer.) 


If n is prime, then n is odd or n is 2. 
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15. Determine whether the following statement forms 
are logically equivalent: 


p->@q-n and (p>qg-r 


In 16 and 17, write each of the two statements in symbolic 
form and determine whether they are logically equiva- 
lent. Include a truth table and a few words of explanation 
to show that you understand what it means for state- 
ments to be logically equivalent. 


16. If you paid full price, you didn’t buy it at Crown 
Books. You didn’t buy it at Crown Books or you 
paid full price. 


17. If 2 is a factor of n and 3 is a factor of n, then 6 is 
a factor of n. 2 is not a factor of n or 3 is not a fac- 
tor of n or 6 is a factor of n. 


18. Write each of the following three statements in 
symbolic form and determine which pairs are 
logically equivalent. Include truth tables and a few 
words of explanation. 


If it walks like a duck and it talks like a duck, then 
it is a duck. 


Either it does not walk like a duck or it does not 
talk like a duck, or it is a duck. 


If it does not walk like a duck and it does not talk 
like a duck, then it is not a duck. 


19. True or false? The negation of “If Sue is Luiz’s 
mother, then Ali is his cousin” is “If Sue is Luiz’s 
mother, then Ali is not his cousin.” 


20. Write negations for each of the following state- 
ments. (Assume that all variables represent fixed 
quantities or entities, as appropriate.) 

a. If P is a square, then P is a rectangle. 

b. If today is New Year’s Eve, then tomorrow is 
January. 

c. If the decimal expansion of r is terminating, 
then r is rational. 

d. Ifnis prime, then n is odd or n is 2. 

e. If x is nonnegative, then x is positive or x is 0. 

f. If Tom is Ann’s father, then Jim is her uncle 
and Sue is her aunt. 

g- Ifnis divisible by 6, then n is divisible by 2 
and n is divisible by 3. 


21. Suppose that p and q are statements so that 
p — q is false. Find the truth values of each of 
the following: 
a ~p—>q 


bpVq ¢«q—p 
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H 22. Write contrapositives for the statements of 


exercise 20. 


H 23. Write the converse and inverse for each statement 


of exercise 20. 
Use truth tables to establish the truth of each statement 
in 24-27. 


24. A conditional statement is not logically equivalent 
to its converse. 


25. A conditional statement is not logically equivalent 
to its inverse. 


26. A conditional statement and its contrapositive are 
logically equivalent to each other. 


27. The converse and inverse of a conditional state- 
ment are logically equivalent to each other. 


H 28. “Do you mean that you think you can find out the 


answer to it?” said the March Hare. 


“Exactly so,” said Alice. 
“Then you should say what you mean,” the 
March Hare went on. 


“I do,” Alice hastily replied; “at least—at least 
I mean what I say—that’s the same thing, you 
know.” 

“Not the same thing a bit!” said the Hatter. 
“Why, you might just as well say that ‘I see what I 
eat’ is the same thing as ‘I eat what I see’!” 

—from “A Mad Tea-Party” in Alice in 

Wonderland, by Lewis Carroll 


The Hatter is right. “I say what I mean” is not the 
same thing as “I mean what I say.” Rewrite each 
of these two sentences in if-then form and explain 
the logical relation between them. (This exercise is 
referred to in the introduction to Chapter 4.) 
If statement forms P and Q are logically equivalent, then 
P <> Q is a tautology. Conversely, if P <> Q is a tautology, 
then P and Q are logically equivalent. Use <> to convert 
each of the logical equivalences in 29-31 to a tautology. 
Then use a truth table to verify each tautology. 


29. p> @vn=—PA~g->r 


30. PAGVr)=(PAQVPAN 


31. p> G@>n=(PAQD-r 
Rewrite each of the statements in 32 and 33 as a conjunc- 
tion of two if-then statements. 


32. This quadratic equation has two distinct real roots if, 
and only if, its discriminant is greater than zero. 


33. This integer is even if, and only if, it equals twice 
some integer. 


Rewrite the statements in 34 and 35 in if-then form in two 
ways, one of which is the contrapositive of the other. Use 
the formal definition of “only if.” 


34. The Cubs will win the pennant only if they win 
tomorrow’s game. 


35. Sam will be allowed on Signe’s racing boat only if 
he is an expert sailor. 


36. Taking the long view on your education, you go to 
the Prestige Corporation and ask what you should 
do in college to be hired when you graduate. The 
personnel director replies that you will be hired 
only if you major in mathematics or computer sci- 
ence, get a B average or better, and take account- 
ing. You do, in fact, become a math major, get a 
B+ average, and take accounting. You return to 
Prestige Corporation, make a formal application, 
and are turned down. Did the personnel director 
lie to you? 


Some programming languages use statements of the form 
“r unless s” to mean that as long as s does not happen, 
then r will happen. More formally: 


Definition: If r and s are statements, 


runlesss means if ~s then r. 


In 37-39, rewrite the statements in if-then form. 


37. Payment will be made on the fifth unless a new 
hearing is granted. 


38. Ann will go unless it rains. 


39. This door will not open unless a security code is 
entered. 


Rewrite the statements in 40 and 41 in if-then form. 


40. Catching the 8:05 bus is a sufficient condition for 
my being on time for work. 


41. Having two 45° angles is a sufficient condition for 
this triangle to be a right triangle. 


Use the contrapositive to rewrite the statements in 
42 and 43 in if-then form in two ways. 


42. Being divisible by 3 is a necessary condition for 
this number to be divisible by 9. 
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43. Doing homework regularly is a necessary condi- 
tion for Jim to pass the course. 


Note that “a sufficient condition for s is r” means r 
is a sufficient condition for s and that “a necessary 
condition for s is r” means r is a necessary condition 
for s. Rewrite the statements in 44 and 45 in if- 
then form. 


44. A sufficient condition for Jon’s team to win 
the championship is that it win the rest of 
its games. 


45. A necessary condition for this computer program 
to be correct is that it not produce error messages 
during translation. 


46. “If compound X is boiling, then its temperature 
must be at least 150°C.” Assuming that this 
statement is true, which of the following must 
also be true? 

a. Ifthe temperature of compound X is at least 
150°C, then compound X is boiling. 

b. Ifthe temperature of compound X is less than 
150°C, then compound X is not boiling. 

c. Compound X will boil only if its temperature 
is at least 150°C. 

d. If compound X is not boiling, then its tempera- 
ture is less than 150°C. 

e. A necessary condition for compound X to boil 
is that its temperature be at least 150°C. 

f. A sufficient condition for compound X to boil 
is that its temperature be at least 150°C. 


In 47-50 (a) use the logical equivalences p—q = ~p vq 
and p<>q = (~p v q) ^ (~q v p) to rewrite the given 
statement forms without using the symbol — or <, 
and (b) use the logical equivalence p v q = ~(~p A ~q) 
to rewrite each statement form using only ^ and ~. 

47. p\~q-r 

48. pv ~q>rvq 

49. p >r) e> q >r) 


50. p> Q> Ne (WAD>” 
51. Given any statement form, is it possible to find a 


logically equivalent form that uses only ~ and ^? 
Justify your answer. 
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ANSWERS FOR TEST YOURSELF 


1. true; false 2.p/A~q 3.ifgthenp 4.if~qthen~p 5.if~pthen~g 6. logically equivalent 7. logically equivalent 
8.R;S 9.8;R 10.R;S 


BX Valid and Invalid Arguments 


“Contrariwise,” continued Tweedledee, “if it was so, it might be; and if it were so, it 
would be; but as it isn’t, it ain’t. That’s logic.” —Lewis Carroll, Through the Looking Glass 


In mathematics and logic an argument is not a dispute. It is simply a sequence of state- 
ments ending in a conclusion. In this section we show how to determine whether an ar- 
gument is valid—that is, whether the conclusion follows necessarily from the preceding 
statements. We will show that this determination depends only on the form of an argu- 
ment, not on its content. 

It was shown in Section 2.1 that the logical form of an argument can be abstracted from 
its content. For example, the argument 


If Socrates is a man, then Socrates is mortal. 


Socrates is a man. 
<. Socrates is mortal. 
has the abstract form 
If p then q 
P 
~q 
When considering the abstract form of an argument, think of p and q as variables for 
which statements may be substituted. An argument form is called valid if, and only if, 


whenever statements are substituted that make all the premises true, the conclusion is 
also true. 


Definition 


An argument is a sequence of statements, and an argument form is a sequence 
of statement forms. All statements in an argument and all statement forms in an 
argument form, except for the final one, are called premises (or assumptions or 
hypotheses). The final statement or statement form is called the conclusion. The 
symbol .., which is read “therefore,” is normally placed just before the conclusion. 

To say that an argument form is valid means that no matter what particular 
statements are substituted for the statement variables in its premises, if the resulting 
premises are all true, then the conclusion is also true. To say that an argument is 
valid means that its form is valid. 


The crucial fact about a valid argument is that the truth of its conclusion follows neces- 
sarily or inescapably or by logical form alone from the truth of its premises. It is impos- 
sible to have a valid argument with all true premises and a false conclusion. When an 
argument is valid and its premises are true, the truth of the conclusion is said to be inferred 
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or deduced from the truth of the premises. If a conclusion “ain’t necessarily so,” then it 
isn’t a valid deduction. 


Testing an Argument Form for Validity 


1. Identify the premises and conclusion of the argument form. 

2. Construct a truth table showing the truth values of all the premises and the 
conclusion. 

3. A row of the truth table in which all the premises are true is called a critical row. 
If there is a critical row in which the conclusion is false, then it is possible for an 
argument of the given form to have true premises and a false conclusion, and so 
the argument form is invalid. If the conclusion in every critical row is true, then 
the argument form is valid. 


CANIF] Determining Validity or Invalidity 


Determine whether the following argument form is valid or invalid by drawing a truth ta- 
ble, indicating which columns represent the premises and which represent the conclusion, 
and annotating the table with a sentence of explanation. When you fill in the table, you 
only need to indicate the truth values for the conclusion in the rows where all the premises 
are true (the critical rows) because the truth values of the conclusion in the other rows are 
irrelevant to the validity or invalidity of the argument. 


A p—qVv~r 
q—>p^r 
Caution! If at least one 
premise of an argument 
is false, then we have no 


information about the 
conclusion: It might be 


por 


Solution The truth table shows that even though there are several situations in which the 
premises and the conclusion are all true (rows 1, 7, and 8), there is one situation (row 4) 


true or it might be false. where the premises are true and the conclusion is false. 
premises conclusion 

P q r ~r |qV-~r p^r poaqVvVrr q—p/r p-r 

T T T F T T JE T T 

T T F T T F T F This row shows that an 
argument of this form 

I F a F F T F 1 can have true premises 

T F F T T F T T F and a false conclusion. 
Hence this form of 

F T T F T F T F argument is invalid. 

F T F T T F T F 

F F T F F F T T 

F F F T T F T T 
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Modus Ponens and Modus Tollens 


An argument form consisting of two premises and a conclusion is called a syllogism. 
The first and second premises are called the major premise and minor premise, re- 
spectively. The most famous form of syllogism in logic is called modus ponens. It has 
the following form: 


If p then q. 
P 
~q 


Here is an argument of this form: 


If the sum of the digits of 371,487 is divisible by 3, 
then 371,487 is divisible by 3. 


The sum of the digits of 371,487 is divisible by 3. 
*, 371,487 is divisible by 3. 


The term modus ponens is Latin meaning “method of affirming” (the conclusion is an 
affirmation). Long before you saw your first truth table, you were undoubtedly being con- 
vinced by arguments of this form. Nevertheless, it is instructive to prove that modus po- 
nens is a valid form of argument, if for no other reason than to confirm the agreement 
between the formal definition of validity and the intuitive concept. To do so, we construct 
a truth table for the premises and conclusion. 


premises conclusion 

P q p>q Pp 

T T JẸ T I < critical row 
T F F T 

F T T F 

F F T F 


The first row is the only one in which both premises are true, and the conclusion in that 
row is also true. Hence the argument form is valid. 


Now consider another valid argument form called modus tollens. It has the following 
form: 


If p then q. 
~q 
-~p 


Here is an example of modus tollens: 


If Zeus is human, then Zeus is mortal. 
Zeus is not mortal. 


.. Zeus is not human. 
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An intuitive explanation for the validity of modus tollens uses proof by contradiction. 
It goes like this: 
Suppose 


(1) If Zeus is human, then Zeus is mortal; and 


(2) Zeus is not mortal. 


Must Zeus necessarily be nonhuman? 
Yes! 


Because, if Zeus were human, then by (1) he would be mortal. 
But by (2) he is not mortal. 


Hence, Zeus cannot be human. 


Modus tollens is Latin meaning “method of denying” (the conclusion is a denial). The 
validity of modus tollens can be shown to follow from modus ponens together with the 
fact that a conditional statement is logically equivalent to its contrapositive. Or it can be 
established formally by using a truth table. (See exercise 13.) 

Studies by cognitive psychologists have shown that although nearly 100% of college 
students have a solid, intuitive understanding of modus ponens, less than 60% are able 
to apply modus tollens correctly.* Yet in mathematical reasoning, modus tollens is used 
almost as often as modus ponens. Thus it is important to study the form of modus tollens 
carefully to learn to use it effectively. 


CAIFE Recognizing Modus Ponens and Modus Tollens 


Use modus ponens or modus tollens to fill in the blanks of the following arguments so that 
they become valid inferences. 


a. If there are more pigeons than there are pigeonholes, then at least two pigeons roost in 
the same hole. 
There are more pigeons than there are pigeonholes. 


b. If 870,232 is divisible by 6, then it is divisible by 3. 
870,232 is not divisible by 3. 


Solution 
a. At least two pigeons roost in the same hole. by modus ponens 
b. 870,232 is not divisible by 6. by modus tollens a 


Additional Valid Argument Forms: Rules of Inference 


A rule of inference is a form of argument that is valid. Thus modus ponens and modus 
tollens are both rules of inference. The following are additional examples of rules of infer- 
ence that are frequently used in deductive reasoning. 


Geese) Generalization 


The following argument forms are valid: 


a p b. q 
pva pva 


*Cognitive Psychology and Its Implications, 3d ed. by John R. Anderson (New York: Freeman, 1990), pp. 292-297. 
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These argument forms are used for making generalizations. For instance, according to 
the first, if p is true, then, more generally, “p or q” is true for any other statement q. As an 
example, suppose you are given the job of counting the upperclassmen at your school. You 
ask what class Anton is in and are told he is a junior. 

You reason as follows: 


Anton is a junior. 
.. (more generally) Anton is a junior or Anton is a senior. 


Knowing that upperclassman means junior or senior, you add Anton to your list. a 


Selves Specialization 


The following argument forms are valid: 


a prq b p^q 
Ap fag 

These argument forms are used for specializing. When classifying objects according to 
some property, you often know much more about them than whether they do or do not have 
that property. When this happens, you discard extraneous information as you concentrate 
on the particular property of interest. 

For instance, suppose you are looking for a person who knows graph algorithms to work 
with you on a project. You discover that Ana knows both numerical analysis and graph 
algorithms. You reason as follows: 


Ana knows numerical analysis and Ana knows graph algorithms. 
.. (in particular) Ana knows graph algorithms. 


Accordingly, you invite her to work with you on your project. a 


Both generalization and specialization are used frequently in mathematics to tailor facts 
to fit into hypotheses of known theorems in order to draw further conclusions. Elimination, 
transitivity, and proof by division into cases are also widely used tools. 


CAEH Elimination 


The following argument forms are valid: 


a pvq b. pvq 
~q ~p 
rap wg 


These argument forms say that when you have only two possibilities and you can rule 
one out, the other must be the case. For instance, suppose you know that for a particular 
number x, 
x—3=0 or x+2=0. 
If you also know that x is not negative, then x # —2, so 
x+2 +0. 
By elimination, you can then conclude that 


.x-3=0. E 
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Example 2.3.6 Transitivity 


The following argument form is valid: 
pq 
q7>r 
“por 
Many arguments in mathematics contain chains of if-then statements. From the fact that 


one statement implies a second and the second implies a third, you can conclude that the 
first statement implies the third. In the example below suppose n is a particular integer. 


If n is divisible by 18, then n is divisible by 9. 
If n is divisible by 9, then the sum of the digits of n is divisible by 9. 
.. If n is divisible by 18, then the sum of the digits of n is divisible by 9. E 


Sen wewe Proof by Division into Cases 


The following argument form is valid: 
PV 
pr 
q>r 
BF 
It often happens that you know one thing or another is true. If you can show that in either case 
a certain conclusion follows, then this conclusion must also be true. For instance, suppose you 
know that x is a particular nonzero real number that is not zero. The trichotomy property of 
the real numbers says that any real number is positive, negative, or zero. Thus (by elimination) 
you know that x is positive or x is negative. You can deduce that X >0 by arguing as follows: 
x is positive or x is negative. 
If x is positive, then x>0. 
If x is negative, then xr >o. 
Aa SO, a 


The rules of valid inference are used constantly in problem solving. Here is an example 
from everyday life. 


Application: A More Complex Deduction 


You are about to leave for class in the morning and discover that you don’t have your 
glasses. You know the following statements are true: 


a. If I was reading my class notes in the kitchen, then my glasses are on the kitchen table. 
b. If my glasses are on the kitchen table, then I saw them at breakfast. 
c. I did not see my glasses at breakfast. 


d. I was reading my class notes in the living room or I was reading my class notes in the 
kitchen. 


e. If I was reading my class notes in the living room then my glasses are on the coffee table. 


Where are the glasses? 
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Solution Let RK =I was reading my class notes in the kitchen. 
GK = My glasses are on the kitchen table. 
SB = I saw my glasses at breakfast. 
RL = I was reading my class notes in the living room. 


GC = My glasses are on the coffee table. 


Here is a sequence of steps you might use to reach the answer, together with the rules of 
inference that allow you to draw the conclusion of each step: 


ils RK — GK by (a) 
GK — SB by (b) 
*.RK—SB by transitivity 
2. RK—SB by the conclusion of (1) 
~SB by (c) 
“RK, by modus tollens 
3. RLV RK by (d) 
~RK by the conclusion of (2) 
RL by elimination 
4. RL— GC by (e) 
RL by the conclusion of (3) 
GC by modus ponens 
Thus the glasses are on the coffee table. E 
Fallacies 


A fallacy is an error in reasoning that results in an invalid argument. Three common fal- 
lacies are using ambiguous premises, and treating them as if they were unambiguous, 
circular reasoning (assuming what is to be proved without having derived it from the 
premises), and jumping to a conclusion (without adequate grounds). In this section we 
discuss two other fallacies, called converse error and inverse error, which give rise to 
arguments that superficially resemble those that are valid by modus ponens and modus 
tollens but are not, in fact, valid. 

As in previous examples, you can show that an argument is invalid by constructing 
a truth table for the argument form and finding at least one critical row in which all the 
premises are true but the conclusion is false. Another way is to find an argument of the 
same form with true premises and a false conclusion. 


For an argument to be valid, every argument of the same form whose premises are 
all true must have a true conclusion. It follows that for an argument to be invalid 


means that there is an argument of that form whose premises are all true and whose 
conclusion is false. 
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elu wees Converse Error 


Show that the following argument is invalid: 


If Zeke is a cheater, then Zeke sits in the back row. 
Zeke sits in the back row. 


.. Zeke is a cheater. 


Solution Many people recognize the invalidity of the above argument intuitively, rea- 
soning something like this: The first premise gives information about Zeke if it is known 
he is a cheater. It doesn’t give any information about him if it is not already known that he 
is a cheater. One can certainly imagine a person who is not a cheater but happens to sit in 
the back row. Then if that person’s name is substituted for Zeke, the first premise is true by 
default and the second premise is also true but the conclusion is false. 

The general form of the previous argument is as follows: 


Pd 
q 
aP 
In exercise 12(a) at the end of this section you are asked to use a truth table to show that 
this form of argument is invalid. E 


The fallacy underlying this invalid argument form is called the converse error because 
the conclusion of the argument would follow from the premises if the premise p — q were 
replaced by its converse. Such a replacement is not allowed, however, because a conditional 
statement is not logically equivalent to its converse. Converse error is also known as the 
fallacy of affirming the consequent. 

A related common reasoning error is shown in the next example. 


SCW EAL Inverse Error 


Consider the following argument: 


If these two vertices are adjacent, then they do not have the same color. 
These two vertices are not adjacent. 


.. These two vertices have the same color. 


Note that this argument has the following form: 


pd 
=p 


A sag 


: . You are asked to give a truth table verification of the invalidity of this argument form in 
Caution! In logic, the ; ; : 
s exercise 12(b) at the end of this section. 
words true and valid have 7 ae . A E 
: p The fallacy underlying this invalid argument form is called the inverse error because 
very different meanings. A i , , í 
valid argument may have the conclusion of the argument would follow from the premises if the premise p —> q were 


a false conclusion, and replaced by its inverse. Such a replacement is not allowed, however, because a conditional 
an invalid argument may statement is not logically equivalent to its inverse. Inverse error is also known as the fallacy 
have a true conclusion. of denying the antecedent. E 
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Sometimes people lump together the ideas of validity and truth. If an argument seems 
valid, they accept the conclusion as true. And if an argument seems fishy (really a slang 
expression for invalid), they think the conclusion must be false. This is not correct! 


ZSWIG EAI A Valid Argument with a False Premise and a False Conclusion 


The argument below is valid by modus ponens. But its major premise is false, and so is its 
conclusion. 


If Canada is north of the United States, then temperatures in Canada never rise above 
freezing. 


Canada is north of the United States. 


"`. Temperatures in Canada never rise above freezing. E 


ZEWG EAr An Invalid Argument with True Premises and a True Conclusion 


The argument below is invalid by the converse error, but it has a true conclusion. 


If New York is a big city, then New York has tall buildings. 
New York has tall buildings. 
.. New York is a big city. E 


Definition 


An argument is called sound if, and only if, it is valid and all its premises are true. 
An argument that is not sound is called unsound. 


The important thing to note is that validity is a property of argument forms: If an argu- 
ment is valid, then so is every other argument that has the same form. Similarly, if an argu- 
ment is invalid, then so is every other argument that has the same form. What characterizes a 
valid argument is that no argument whose form is valid can have all true premises and a false 
conclusion. For each valid argument, there are arguments of that form with all true prem- 
ises and a true conclusion, with at least one false premise and a true conclusion, and with at 
least one false premise and a false conclusion. On the other hand, for each invalid argument, 
there are arguments of that form with every combination of truth values for the premises and 
conclusion, including all true premises and a false conclusion. The bottom line is that we can 
only be sure that the conclusion of an argument is true when we know that the argument is 
sound, that is, when we know both that the argument is valid and that it has all true premises. 


Contradictions and Valid Arguments 


The concept of logical contradiction can be used to make inferences through a technique 
of reasoning called the contradiction rule. Suppose p is some statement whose truth you 
wish to deduce. 


Contradiction Rule 


If you can show that the supposition that statement p is false leads logically to a 
contradiction, then you can conclude that p is true. 
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Example 2.3.14 


Eddie Hausner/The New York Times/Redux 


Raymond Smullyan 
(1919-2017) 
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Contradiction Rule 
Show that the following argument form is valid: 
~p — c, where c is a contradiction 
~p 
Solution Construct a truth table for the premise and the conclusion of this argument. 


premises conclusion 


There is only one critical 
~P . ee aa 4 row in which the premise 
is true, and in this row the 
T F F ut T conclusion is also true. 
Hence this form of argu- 
T F F ment is valid. E 


The contradiction rule is the logical heart of the method of proof by contradiction. A slight 
variation also provides the basis for solving many logical puzzles by eliminating contradic- 
tory answers: If an assumption leads to a contradiction, then that assumption must be false. 


Knights and Knaves 


The logician Raymond Smullyan describes an island containing two types of people: 
knights who always tell the truth and knaves who always lie.* You visit the island and are 
approached by two natives who speak to you as follows: 


A says: B is a knight. 
B says: A and I are of opposite type. 


What are A and B? 
Solution A and B are both knaves. To see this, reason as follows: 


Suppose A is a knight. 


*, What A says is true. by definition of knight 
*, B is also a knight. That’s what A said. 
*, What B says is true. by definition of knight 


'. A and B are of opposite types. That’s what B said. 


*, We have arrived at the following contradiction: A and B are both knights 
and A and B are of opposite type. 


*. The supposition is false. by the contradiction rule 
"A is not a knight. negation of supposition 
*,Ais a knave. by elimination: It’s given that all inhabitants are knights 


or knaves, so since A is not a knight, A is a knave. 
*, What A says is false. 
*, B is not a knight. 


.. Bis also a knave. by elimination 


*Raymond Smullyan has written a delightful series of whimsical yet profound books of logical puzzles start- 
ing with What Is the Name of This Book? (Englewood Cliffs, New Jersey: Prentice-Hall, 1978). Other good 
sources of logical puzzles are the many excellent books of Martin Gardner, such as Aha! Insight and Aha! 
Gotcha (New York: W. H. Freeman, 1978, 1982). 
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This reasoning shows that if the problem has a solution at all, then A and B must both be 
knaves. It is conceivable, however, that the problem has no solution. The problem statement 
could be inherently contradictory. If you look back at the solution, though, you can see that 
it does work out for both A and B to be knaves. | 
Summary of Rules of Inference 


Table 2.3.1 summarizes some of the most important rules of inference. 


TABLE 2.3.1 Valid Argument Forms 


Modus Ponens p>q Elimination a pvq b. pvq 

P ~q ~p 

q <P q 
Modus Tollens p>q Transitivity p>q 

“q q—>r 
Generalization a p b. q Proof by DV 

pva ..p\/q_ | Division into Cases por 

Specialization a prq b. p^q : 1 a 

p q l 
Conjunction p Contradiction Rule ~p>c 

q <P 

p^q 


TEST YOURSELF 


1. For an argument to be valid means that every 3. For an argument to be sound means that it 
argument of the same form whose premises —___ is _____ and its premises 
has a 


. In this case 
conclusion. we can be sure that its conclusion 


2. For an argument to be invalid means that there 
is an argument of the same form whose premises 
—— and whose conclusion ____ 


EXERCISE SET 2.3 


Use modus ponens or modus tollens to fill in the blanks in 3. If logic is easy, then I am a monkey’s uncle. 
the arguments of 1-5 so as to produce valid inferences. I am not a monkey’s uncle. 
1. If V2 is rational, then V2 = a/b for some 
integers a and b. 4. If this graph can be colored with three colors, 
It is not true that V2 = a/b for some integers a then it can colored with four colors. 
and b. This graph cannot be colored with four colors. 
2. If 1—0.99999...is less than every positive real 5. If they were unsure of the address, then they 
number, then it equals zero. would have telephoned. 
*, The number 1 — 0.99999 .. . equals zero. .. They were sure of the address. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Use truth tables to determine whether the argument forms 
in 6-11 are valid. Indicate which columns represent the 
premises and which represent the conclusion, and include 
a sentence explaining how the truth table supports your 
answer. Your explanation should show that you understand 
what it means for a form of argument to be valid or invalid. 


6 p—>q 7 p 
q4 >P Pq 
“pVg ~qvr 
AT 
8& pvdq 9. pAq->-~r 
porn pV ~q 
p>r ~4—>P 
ar tees fi 


10. pVq-r 
“rT ~pDAN~| 
(This is the form of argument shown on pages 37 
and 38.) 


11. pmaqvr 
~qV~r 
per 
12. Use truth tables to show that the following forms 
of argument are invalid. 


a. pq b. p>q 
q SD 
AP MO" 


(converse error) (inverse error) 


Use truth tables to show that the argument forms referred 
to in 13-21 are valid. Indicate which columns represent the 
premises and which represent the conclusion, and include 
a sentence explaining how the truth table supports your 
answer. Your explanation should show that you understand 
what it means for a form of argument to be valid. 


13. Modus tollens: 

pd 

~q 

-~p 

14. Example 2.3.3(a) 
15. Example 2.3.3(b) 
16. Example 2.3.4(a) 
17. Example 2.3.4(b) 
18. Example 2.3.5(a) 
19. Example 2.3.5(b) 
20. Example 2.3.6 


21. Example 2.3.7 
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Use symbols to write the logical form of each argument 

in 22 and 23, and then use a truth table to test the argu- 
ment for validity. Indicate which columns represent the 
premises and which represent the conclusion, and include 
a few words of explanation showing that you understand 
the meaning of validity. 


22. If Tom is not on team A, then Hua is on team B. 
If Hua is not on team B, then Tom is on team A. 
*, Tom is not on team A or Hua is not on team B. 


23. Oleg is a math major or Oleg is an economics 
major. 
If Oleg is a math major, then Oleg is required to 
take Math 362. 
*, Oleg is an economics major or Oleg is not 
required to take Math 362. 
Some of the arguments in 24-32 are valid, whereas others 
exhibit the converse or the inverse error. Use symbols to 
write the logical form of each argument. If the argument 
is valid, identify the rule of inference that guarantees its 
validity. Otherwise, state whether the converse or the 
inverse error is made. 


24. If Jules solved this problem correctly, then Jules 
obtained the answer 2. 
Jules obtained the answer 2. 
*, Jules solved this problem correctly. 


25. This real number is rational or it is irrational. 
This real number is not rational. 
*. This real number is irrational. 


26. IfI go to the movies, I won’t finish my homework. 
If I don’t finish my homework, I won’t do well 
on the exam tomorrow. 

‘, If I go to the movies, I won’t do well on the 
exam tomorrow. 


27. If this number is larger than 2, then its square is 
larger than 4. 
This number is not larger than 2. 
*, The square of this number is not larger than 4. 


28. ‘If there are as many rational numbers as there 
are irrational numbers, then the set of all irratio- 
nal numbers is infinite. 

The set of all irrational numbers is infinite. 
‘, There are as many rational numbers as there are 
irrational numbers. 


29. Ifat least one of these two numbers is divisible 
by 6, then the product of these two numbers is 
divisible by 6. 

Neither of these two numbers is divisible by 6. 
.. The product of these two numbers is not divisible 
by 6. 
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30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 
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If this computer program is correct, then it pro- 
duces the correct output when run with the test 
data my teacher gave me. 
This computer program produces the correct output 
when run with the test data my teacher gave me. 

*, This computer program is correct. 


Sandra knows Java and Sandra knows C+ +. 
*, Sandra knows C++. 


If I get a Christmas bonus, I'll buy a stereo. 
If I sell my motorcycle, I'll buy a stereo. 

*, If I get a Christmas bonus or I sell my motor- 
cycle, then I’ll buy a stereo. 


Give an example (other than Example 2.3.11) of a 
valid argument with a false conclusion. 


Give an example (other than Example 2.3.12) of an 
invalid argument with a true conclusion. 


Explain in your own words what distinguishes a 
valid form of argument from an invalid one. 


Given the following information about a computer 

program, find the mistake in the program. 

a. There is an undeclared variable or there is a 
syntax error in the first five lines. 

b. If there is a syntax error in the first five lines, 
then there is a missing semicolon or a variable 
name is misspelled. 

c. There is not a missing semicolon. 

d. There is not a misspelled variable name. 


In the back of an old cupboard you discover a note 

signed by a pirate famous for his bizarre sense of 

humor and love of logical puzzles. In the note he 

wrote that he had hidden treasure somewhere on 

the property. He listed five true statements (a—e 

below) and challenged the reader to use them to 

figure out the location of the treasure. 

a. If this house is next to a lake, then the treasure 
is not in the kitchen. 

b. If the tree in the front yard is an elm, then the 
treasure is in the kitchen. 

c. This house is next to a lake. 

d. The tree in the front yard is an elm or the trea- 
sure is buried under the flagpole. 

e. If the tree in the back yard is an oak, then the 
treasure is in the garage. 


Where is the treasure hidden? 
You are visiting the island described in Example 


2.3.14 and have the following encounters with 
natives. 
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H d. 


39. 


40. 


a. Two natives A and B address you as follows: 
A says: Both of us are knights. 
B says: A is a knave. 
What are A and B? 
b. Another two natives C and D approach you but 
only C speaks. 
C says: Both of us are knaves. 
What are C and D? 
c. You then encounter natives E and F. 
E says: F is a knave. 
F says: E is a knave. 
How many knaves are there? 
Finally, you meet a group of six natives, U, V, 
W, X, Y, and Z, who speak to you as follows: 
U says: None of us is a knight. 
V says: At least three of us are knights. 
W says: At most three of us are knights. 
X says: Exactly five of us are knights. 
Y says: Exactly two of us are knights. 
Z says: Exactly one of us is a knight. 
Which are knights and which are knaves? 


The famous detective Percule Hoirot was called in 
to solve a baffling murder mystery. He determined 
the following facts: 

a. Lord Hazelton, the murdered man, was killed 
by a blow on the head with a brass candlestick. 

b. Either Lady Hazelton or a maid, Sara, was in 
the dining room at the time of the murder. 

c. Ifthe cook was in the kitchen at the time of the 
murder, then the butler killed Lord Hazelton 
with a fatal dose of strychnine. 

d. If Lady Hazelton was in the dining room at the 
time of the murder, then the chauffeur killed 
Lord Hazelton. 

e. Ifthe cook was not in the kitchen at the time 
of the murder, then Sara was not in the dining 
room when the murder was committed. 

f. If Sara was in the dining room at the time the 
murder was committed, then the wine steward 
killed Lord Hazelton. 


Is it possible for the detective to deduce the iden- 
tity of the murderer from these facts? If so, who 
did murder Lord Hazelton? (Assume there was 
only one cause of death.) 


Sharky, a leader of the underworld, was killed by 
one of his own band of four henchmen. Detective 
Sharp interviewed the men and determined that all 
were lying except for one. He deduced who killed 
Sharky on the basis of the following statements: 

a. Socko: Lefty killed Sharky. 

b. Fats: Muscles didn’t kill Sharky. 
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c. Lefty: Muscles was shooting craps with Socko 42.a. pvq 
when Sharky was knocked off. b. q—>r 
d. Muscles: Lefty didn’t kill Sharky. & p^s>t 
Who did kill Sharky? d =f 
In 41-44 a set of premises and a conclusion are given. Use e. ~q>Uuns 
the valid argument forms listed in Table 2.3.1 to deduce the fi wit 
conclusion from the premises, giving a reason for each step as 
in Example 2.3.8. Assume all variables are statement variables. 43. a. ~p—r\~s 44. a. p—>q 
aal ~pvqor b. t—>s b rvs 
b. sv~q c. u —> ~p c. ~s>~t 
E =i d ~w d. ~qvs 
d. p>t e uvw e os 
e pAr f. o~t f. ~p^r—>u 
fo ~q g. wvVt 
h. o.uAw 


ANSWERS FOR TEST YOURSELF 


1. are all true; true 2. are all true; is false 3. valid; are all true; is true 


EJ Application: Digital Logic Circuits 


Only connect! —E. M. Forster, Howards End 


In the late 1930s, a young M.LT. graduate student named Claude Shannon noticed an analogy 
between the operations of switching devices, such as telephone switching circuits, and the op- 
erations of logical connectives. He used this analogy with striking success to solve problems 
of circuit design and wrote up his results in his master’s thesis, which was published in 1938. 
The drawing in Figure 2.4.1(a) shows the appearance of the two positions of a simple 
switch. When the switch is closed, current can flow from one terminal to the other; when 
„ it is open, current cannot flow. Imagine that such a switch is part of the circuit shown in 
È Figure 2.4.1(b). The light bulb turns on if, and only if, current flows through it. And this 
happens if, and only if, the switch is closed. 


The symbol 
denotes a batte: 


the symbol 


and 


a 


Alfred Eisenstaedt/Getty | 


© 


—eo 


Claude Shannon Open Closed denotes a light bulb. 
(1916-2001) (a) (b) 


FIGURE 2.4.1 


Now consider the more complicated circuits of Figures 2.4.2(a) and 2.4.2(b). 


Switches “in series” Switches “in parallel” 
(a) (b) 
FIGURE 2.4.2 
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In the circuit of Figure 2.4.2(a) current flows and the light bulb turns on if, and only if, 
both switches P and Q are closed. The switches in this circuit are said to be in series. In the 
circuit of Figure 2.4.2(b) current flows and the light bulb turns on if, and only if, at least 
one of the switches P or Q is closed. The switches in this circuit are said to be in parallel. 
All possible behaviors of these circuits are described by Table 2.4.1. 


(a) Switches in Series (b) Switches in Parallel 
P Q State P Q State 
closed closed on closed closed on 
closed open off closed open on 
open closed off open closed on 
open open off open open off 


Observe that if the words closed and on are replaced by T and open and off are replaced 
by F, Table 2.4.1(a) becomes the truth table for and and Table 2.4.1(b) becomes the truth 
table for or. Consequently, the switching circuit of Figure 2.4.2(a) is said to correspond to 
the logical expression P ^ Q, and that of Figure 2.4.2(b) is said to correspond to P v Q. 

More complicated circuits correspond to more complicated logical expressions. This 
correspondence has been used extensively in the design and study of circuits. 

In the 1940s and 1950s, switches were replaced by electronic devices, with the physical 
states of closed and open corresponding to electronic states such as high and low voltages. 


The Intel 4004, introduced 
in 1971, is generally 
considered to be the first 
commercially viable 
microprocessor or central 
processing unit (CPU) 
contained on a chip about 
the size of a fingernail. 

It consisted of 2,300 
transistors and could 
execute 70,000 instructions 
per second, essentially the 
same computing power 

as the first electronic 
computer, the ENIAC, 
built in 1946, which filled 
an entire room. Modern 
microprocessors consist 
of several CPUs on one 
chip, contain close to a 
billion transistors and 
many hundreds of millions 
of logic circuits, and can 
compute hundreds of 
millions of instructions per 
second. 


Tim McNerney 
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The new electronic technology led to the development of modern digital systems such as 
electronic computers, electronic telephone switching systems, traffic light controls, elec- 
tronic calculators, and the control mechanisms used in hundreds of other types of electron- 
ic equipment. The basic electronic components of a digital system are called digital logic 
circuits. The word logic indicates the important role of logic in the design of such circuits, 
and the word digital indicates that the circuits process discrete, or separate, signals as op- 
posed to continuous ones. 

Electrical engineers continue to use the language of logic when they refer to values of 
signals produced by an electronic switch as being “true” or “false.” But they generally use 
the symbols | and 0 rather than T and F to denote these values. The symbols 0 and 1 are 
called bits, short for binary digits. This terminology was introduced in 1946 by the statisti- 
cian John Tukey. 


Alfred Eisenstaedt/Time Life Pictures/Getty Images 


John W. Tukey 
(1915-2000) 


Black Boxes and Gates 


Combinations of signal bits (1’s and 0’s) can be transformed into other combinations of sig- 
nal bits (1’s and 0’s) by means of various circuits. Because a variety of different technolo- 
gies are used in circuit construction, computer engineers and digital system designers find 
it useful to think of certain basic circuits as black boxes. The inside of a black box contains 
the detailed implementation of the circuit and is often ignored while attention is focused 
on the relation between the input and the output signals. 


P 
Input 
signals : black box S Output signal 


The operation of a black box is completely specified by constructing an input/output 
table that lists all its possible input signals together with their corresponding output sig- 
nals. For example, the black box pictured above has three input signals. Since each of these 
signals can take the value | or 0, there are eight possible combinations of input signals. One 
possible correspondence of input to output signals is as follows: 


An Input/Output Table 
Input Output 

P Q R S 
1 1 1 1 
1 1 0 0 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 1 1 
0 0 0 0 


The third row, for instance, indicates that for inputs P = 1, Q = 0, and R = 1, the output 
Sis 0. 
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An efficient method for designing more complicated circuits is to build them by con- 
necting less complicated black box circuits. Three such circuits are known as NOT-, AND-, 
and OR-gates. 

A NOT-gate (or inverter) is a circuit with one input signal and one output signal. If 
the input signal is 1, the output signal is 0. Conversely, if the input signal is 0, then the 
output signal is 1. An AND-gate is a circuit with two input signals and one output signal. 
If both input signals are 1, then the output signal is 1. Otherwise, the output signal is 0. 
An OR-gate also has two input signals and one output signal. If both input signals are 0, 
then the output signal is 0. Otherwise, the output signal is 1. 

The actions of NOT-, AND-, and OR-gates are summarized in Figure 2.4.3, where P 
and Q represent input signals and R represents the output signal. It should be clear from 
Figure 2.4.3 that the actions of the NOT-, AND-, and OR-gates on signals correspond 
exactly to those of the logical connectives ~, A, and v on statements, if the symbol 1 is 
identified with T and the symbol 0 is identified with F. 

Gates can be combined into circuits in a variety of ways. If the rules shown at the 
bottom of the page are obeyed, the result is a combinational circuit, one whose output at 
any time is determined entirely by its input at that time without regard to previous inputs. 


Type of Gate Symbolic Representation Action 


NOT e—a e 


P Q 
Q 1 0 
0 1 
0 0 


Input Output 


a 


a | > 


FIGURE 2.4.3 


— 


oloj=|=|®™ 
oj=/loj= 0 
m 


o 


Rules for a Combinational Circuit 


Never combine two input wires. 2.4.1 
A single input wire can be split partway and used as input for 

two separate gates. 2.4.2 
An output wire can be used as input. 2.4.3 
No output of a gate can eventually feed back into that gate. 2.4.4 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


2.4 APPLICATION: DIGITAL LOGIC CIRCUITS 83 


Rule (2.4.4) is violated in more complex circuits, called sequential circuits, whose 
output at any given time depends both on the input at that time and also on previous inputs. 
These circuits are discussed in Section 12.2. 


The Input/Output Table for a Circuit 


If you are given a set of input signals for a circuit, you can find its output by tracing through 
the circuit gate by gate. 


Determining Output for a Given Input 


Indicate the output of the circuits shown below for the given input signals. 


a. Input signals: P = 0 and Q = 1 
P O 
AND R 


Input signals: P= 1,0 =0,R=1 


v 
\/ 


fas] 


Solution 


a. Move from left to right through the diagram, tracing the action of each gate on the in- 
put signals. The NOT-gate changes P = 0 to a 1, so both inputs to the AND-gate are 1; 
hence the output R is 1. This is illustrated by annotating the diagram as shown below. 


b. The output of the OR-gate is 1 since one of the input signals, P, is 1. The NOT-gate 
changes this | into a 0, so the two inputs to the AND-gate are 0 and R = 1. Hence the 
output S is 0. The trace is shown below. 


R | 


To construct the entire input/output table for a circuit, trace through the circuit to find 
the corresponding output signals for each possible combination of input signals. 


Constructing the Input/Output Table for a Circuit 


Construct the input/output table for the following circuit. 


ae 
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Solution List the four possible combinations of input signals, and find the output for 
each by tracing through the circuit. 


Input Output 


g p P Q R 

S 7 1 1 1 

=e 1 0 1 

ge 0 1 0 
George Boole 0 0 1 T 
(1815-1864) 

The Boolean Expression Corresponding to a Circuit 

Note Strictly speak- In logic, variables such as p, q, and r represent statements, and a statement can have one 
ing, only meaningful of only two truth values: T (true) or F (false). A statement form is an expression, such as 
expressions such as p ^ (~q V r), composed of statement variables and logical connectives. 
(~p ^q) V (pr) and As noted earlier, one of the founders of symbolic logic was the English mathematician 
~~ ^q) V r) are George Boole. In his honor, any variable, such as a statement variable or an input signal, 


allowed as Boolean, not 
meaningless ones like 
p~q (rs V \q~. We use 
recursion to give a careful 
definition of Boolean ex- 
pressions in Section 5.9. 


that can take one of only two values is called a Boolean variable. An expression composed 
of Boolean variables and the connectives ~, ^, and v is called a Boolean expression. 

Given a circuit consisting of combined NOT-, AND-, and OR-gates, a correspond- 
ing Boolean expression can be obtained by tracing the actions of the gates on the input 
variables. 


Scu SJ Finding a Boolean Expression for a Circuit 


Find the Boolean expressions that correspond to the circuits shown below. A black dot 
indicates a soldering of two wires; wires that cross without a dot are assumed not to touch. 


Solution 


a. Trace through the circuit from left to right, indicating the output of each gate symboli- 


cally, as shown below. 
jap Jeanne se 


The final expression obtained, (P v Q) ^A ~(P ^ Q), is the expression for exclusive or: 
P or Q but not both. 


b. The Boolean expression corresponding to the circuit is (P ^A Q) ^A ~R, as shown on 
the next page. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


2.4 APPLICATION: DIGITAL LOGIC CIRCUITS 85 


(PAQ)A-R 


r : 


Observe that the output of the circuit shown in Example 2.4.3(b) is 1 for exactly one 
combination of inputs (P = 1, Q = 1, and R = 0) and is 0 for all other combinations of 
inputs. For this reason, the circuit can be said to “recognize” one particular combination 
of inputs. The output column of the input/output table has a 1 in exactly one row and 0’s 
in all other rows. 


Definition 


A recognizer is a circuit that outputs a | for exactly one particular combination of 
input signals and outputs 0’s for all other combinations. 


Input/Output Table for a Recognizer 


The Circuit Corresponding to a Boolean Expression 


The preceding examples showed how to find a Boolean expression corresponding to a cir- 
cuit. The following example shows how to construct a circuit corresponding to a Boolean 
expression. The strategy is to work from the outermost part of the Boolean expression to 
the innermost part, adding logic gates that correspond to the operations in the expression 
as you move from right to left in the circuit diagram. 


Constructing Circuits for Boolean Expressions 
Construct circuits for the following Boolean expressions. 
a. (~PAQ)V~@Q b. (PA Q)A(RAS) AT 
Solution 


a. Write the input variables in a column on the left side of the diagram. Since the last op- 
eration executed when evaluating (~P ^ Q) v ~Q is v, put an OR-gate at the extreme 
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right of the diagram. One input to this gate is ~P ^ Q, so draw an AND-gate to the 
left of the OR-gate and show its output coming into the OR-gate. Since one input to 
the AND-gate is ~P, draw a line from P to a NOT-gate and from there to the AND- 
gate. Since the other input to the AND-gate is Q, draw a line from Q directly to the 
AND-gate. The other input to the OR-gate is ~Q, so draw a line from Q to a NOT- 
gate and from the NOT-gate to the OR-gate. The circuit you obtain is shown below. 


CPAQ)V~Q 


b. To start constructing this circuit, put one AND-gate at the extreme right to correspond 
to the A, which is the final operation between ((P ^ Q) A (R ^ S)) and T. To the left of 
that gate put the AND-gate corresponding to the ^ between P ^ Q and R ^ S. To the 
left of that gate put the two AND-gates corresponding to the ^’s between P and Q and 
between R and S. The circuit is shown in Figure 2.4.4. 


(PAQ)A(RAS) 


(P AQ) A(RAS)) AT 


FIGURE 2.4.4 a 


It follows from Theorem 2.1.1 that all the ways of adding parentheses to P \OARASAT 
give logically equivalent results. Thus, for example, 
(PAQ) ARRAS) ATS PAQAR) ASAT), 


and hence the circuit in Figure 2.4.5, which corresponds to (PA (QA R)) A(S AT), 
has the same input/output table as the circuit in Figure 2.4.4, which corresponds to 
(PA Q)A(RAS)) AT. 


(PA(QAR)) A(SAT) 


> © > 


nA 


I 


FIGURE 2.4.5 


It follows that the circuits in Figures 2.4.4 and 2.4.5 are both implementations of the 
expression PA QA RAS AT. Such a circuit is called a multiple-input AND-gate and 
is represented by the diagram shown in Figure 2.4.6. Multiple-input OR-gates are con- 
structed similarly. 
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P 
— 
R AND 
5 — 
T 


FIGURE 2.4.6 


Finding a Circuit That Corresponds to a Given Input/Output Table 


To this point, we have discussed how to construct the input/output table for a circuit, how 
to find the Boolean expression corresponding to a given circuit, and how to construct the 
circuit corresponding to a given Boolean expression. Now we address the question of how 
to design a circuit (or find a Boolean expression) corresponding to a given input/output 
table. The way to do this is to put several recognizers together in parallel. 


Designing a Circuit for a Given Input/Output Table 


Design a circuit for the following input/output table: 


Input Output 
P Q R S 
1 1 1 1 
1 1 0 0 
1 0 1 1 
1 0 0 1 
0 1 1 0 
0 1 0 0 
0 0 1 0 
0 0 0 0 


Solution First construct a Boolean expression with this table as its truth table. To do this, 
identify each row for which the output is 1—in this case, the rows 1, 3, and 4. For each such 
row, construct an and expression that produces a | (or true) for the exact combination of 
input values for that row and a 0 (or false) for all other combinations of input values. 

For example, the expression for row 1 is P AQ AR because P ^A Q ^ Ris 1 if P = 1 and 
Q = | and R = 1, and it is 0 for all other values of P, Q, and R. The expression for row 3 
is P A ~Q AR because P A ~Q ^ Ris 1 if P = 1 and Q = Oand R = 1, and it is 0 for all 
other values of P, Q, and R. Similarly, the expression for row 4 is PA ~Q A ~R. 

Now any Boolean expression with the given table as its truth table has the value 1 in 
case PA OAR = 1, or incase PA ~Q AR = 1, or incase PA ~Q A ~R = 1, and in no 
other cases. It follows that a Boolean expression with the given truth table is 


(PAQAR)V(PA~OQAR)V (PA ~QA ~R). ZAS 


The circuit corresponding to this expression has the diagram shown in Figure 2.4.7. Ob- 
serve that expression (2.4.5) is a disjunction of terms that are themselves conjunctions in 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


88 CHAPTER2 THE LOGIC OF COMPOUND STATEMENTS 


which one of P or ~P, one of Q or ~Q, and one of R or ~R all appear. Such expressions 
are said to be in disjunctive normal form or sum-of-products form. 


P 
Q + AND 
R 


FIGURE 2.4.7 E 


Simplifying Combinational Circuits 


Consider the two combinational circuits shown in Figure 2.4.8. 


araro 
HS TD 


(b) 
FIGURE 2.4.8 


If you trace through circuit (a), you will find that its input/output table is 
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which is the same as the input/output table for circuit (b). Thus these two circuits do the same 
job in the sense that they transform the same combinations of input signals into the same 
output signals. Yet circuit (b) is simpler than circuit (a) in that it contains many fewer logic 
gates. Thus, as part of an integrated circuit, it would take less space and require less power. 


Definition 


Two digital logic circuits are equivalent if, and only if, their input/output tables 
are identical. 


Since logically equivalent statement forms have identical truth tables, you can deter- 
mine that two circuits are equivalent by finding the Boolean expressions corresponding 
to the circuits and showing that these expressions, regarded as statement forms, are logi- 
cally equivalent. Example 2.4.6 shows how this procedure works for circuits (a) and (b) 
in Figure 2.4.8. 


Showing That Two Circuits Are Equivalent 


Find the Boolean expressions for each circuit in Figure 2.4.8. Use Theorem 2.1.1 to show 
that these expressions are logically equivalent when regarded as statement forms. 


Solution The Boolean expressions that correspond to circuits (a) and (b) are 
(PA ~Q)V(PAQ)) AQ and P ^Q, respectively. By Theorem 2.1.1, 


(PA~QVPAQ)AA 


=(PA(~QVQ)AQ by the distributive law 
=(PA(QV~Q)AQ by the commutative law for V 
=(PAKHAQ by the negation law 
=PAQ by the identity law. 


It follows that the truth tables for ((P A ~Q) V(PAQ))AQ and P/Q are the same. 
Hence the input/output tables for the circuits corresponding to these expressions are also 
the same, and so the circuits are equivalent. E 


In general, you can simplify a combinational circuit by finding the corresponding Bool- 
ean expression, using the properties listed in Theorem 2.1.1 to find a Boolean expression 
that is shorter and logically equivalent to it (when both are regarded as statement forms), 
and constructing the circuit corresponding to this shorter Boolean expression. 


NAND and NOR Gates 


Another way to simplify a circuit is to find an equivalent circuit that uses the least number 
of different kinds of logic gates. Two gates not previously introduced are particularly use- 
= ful for this: NAND-gates and NOR-gates. A NAND-gate is a single gate that acts like an 

AND-gate followed by a NOT-gate. A NOR-gate acts like an OR-gate followed by a NOT- 
gate. Thus the output signal of a NAND-gate is 0 when, and only when, both input signals 
are 1, and the output signal for a NOR-gate is 1 when, and only when, both input signals 
are 0. The logical symbols corresponding to these gates are | (for NAND) and | (for NOR), 
here | is called a Sheffer stroke (after H. M. Sheffer, 1882-1964) and | is called a Peirce 
arrow (after C. S. Peirce, 1839-1914; see page 110). Thus 


rchives 


g 


Ive 


< 


HUD 305.25. Harvard Uni 


H. M. Sheffer 
(1882-1964) P|Q=~(PAQ0) and P} Q=~(Pv Q). 
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The table below summarizes the actions of NAND and NOR gates. 


Type of Gate Symbolic Representation Action 
Input Output 
P Q R=P\Q 
P 1 1 0 
NAND R 
Q 1 0 1 
0 1 1 
0 0 1 
Input Output 


P 1 1 0 
NOR ) sor) R 
Q 1 0 0 


It can be shown that any Boolean expression is equivalent to one written entirely with 
Sheffer strokes or entirely with Peirce arrows. Thus any digital logic circuit is equivalent 
to one that uses only NAND-gates or only NOR-gates. Example 2.4.7 develops part of the 
derivation of this result; the rest is left for the exercises. 


Rewriting Expressions Using the Sheffer Stroke 


Use Theorem 2.1.1 and the definition of Sheffer stroke to show that 
a. ~P=P|P and b. PVQ=(P|P)|(Q|Q). 


Solution 
a. ~P=~(PAP) by the idempotent law for ^ 
= P|P by definition of l. 
b. PVO=~(~(PV Q) by the double negative law 
=~(~PA~Q) by De Morgan’s laws 
= ~((P|P) \(Q|Q)) by part (a) 
= (P|P) | (Q| Q) by definition of I. E 


TEST YOURSELF 


1. The input/output table for a digital logic circuit is 3. A recognizer is a digital logic circuit that 
a table that shows 


4. Two digital logic circuits are equivalent if, and 
2. The Boolean expression that corresponds to a only if, 
digital logic circuit is 
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5. A NAND-gate is constructed by placing a 
gate immediately following an gate. 


EXERCISE SET 2.4 


2.4 APPLICATION: DIGITAL LOGIC CIRCUITS 91 


6. A NOR-gate is constructed by placing a 
gate immediately following an 


gate. 


Give the output signals for the circuits in 1-4 if the input 
signals are as indicated. 


eae 


input signals: P=1 and Q=1 


input signals: P=1 and Q=0 


3. P 
p> 
R 


input signals: P=1, Q=0, R=0 


ea oe 
TS 


input signals: P=0, Q=0, R=0 


In 5-8, write an input/output table for the circuit in the 


referenced exercise. 
5. Exercise 1 6. Exercise 2 
7. Exercise 3 8. Exercise 4 


In 9-12, find the Boolean expression that corresponds to 
the circuit in the referenced exercise. 

9. Exercise 1 10. Exercise 2 
11. Exercise 3 12. Exercise 4 
Construct circuits for the Boolean expressions in 13-17. 
14. ~(Pv Q) 


16. (PAQ)V~R 


133. ~PVQ 
15. PV (~PA~Q) 
17. (PA ~Q)V(~PAR) 


For each of the tables in 18-21, construct (a) a Boolean 
expression having the given table as its truth table and (b) 
a circuit having the given table as its input/output table. 


138. P Q R|S 
1 1 140 
1 1 Of] 1 
0 141] 0 
1 0 oO]| 0 
1 1 


0 1 

0 1 0 0 

0 0 1 0 

0 0 0 0 
o| p o rls 

1 1 1 

1 

1 

1 

0 

0 

0 0 1 (0) 

0 0 0 0 
20.[> o Rls 

1 1 fi -41 

1 1 0 0 

1 0 i) 

1 0 0 0 

0 1 1 0 

0 1 0 0 

0 0 1 0 

0 0 0 1 
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21. 


P Q R/S 
1 1 #140 
1 t oļi 
1 (0) 1 0 
1 0 0 0 
Oo 1 dig 
0 1 (0) 1 
0 0 1 0 
0 0 0 0 


22. Design a circuit to take input signals P, Q, and R 
and output a | if, and only if, P and Q have the 
same value and Q and R have opposite values. 


23. Design a circuit to take input signals P, Q, and R 
and output a | if, and only if, all three of P, Q, and 
R have the same value. 


24. The lights in a classroom are controlled by two 
switches: one at the back of the room and one at 
the front. Moving either switch to the opposite 
position turns the lights off if they are on and on if 
they are off. Assume the lights have been installed 
so that when both switches are in the down posi- 
tion, the lights are off. Design a circuit to control 
the switches. 


AND 

25. An alarm system has three different control panels “> 

in three different locations. To enable the system, 

switches in at least two of the panels must be in b. P 

the on position. If fewer than two are in the on 

position, the system is disabled. Design a circuit to Q 

control the switches. For the circuits corresponding to the Boolean expressions 
Use the properties listed in Theorem 2.1.1 to show that in each of 30 and 31 there is an equivalent circuit with at 
each pair of circuits in 26-29 have the same input/output most two logic gates. Find such a circuit. 
table. (Find the Boolean expressions for the circuits and 30. (PA 

ls V(~PAQ)V(~PA~ 
show that they are logically equivalent when regarded as ( Q)v( Q)v( Q) 
statement forms.) 31. (~PA ~Q)V(~PAQ)V (PA ~Q) 
26. a p 32. The Boolean expression for the circuit in 
a a i] Dar Example 2.4.5 is 
(PAQAR)V(PA~QAR)V (PA~QOA~R) 
b. P (a disjunctive normal form). Find a circuit with at 


o Da AND most three logic gates that is equivalent to this circuit. 

33. a. Show that for the Sheffer stroke 
PAQ=(P|Q)|(P|Q). 

b. Use the results of Example 2.4.7 and part (a) 


above to write P ^A (~Q v R) using only 
Sheffer strokes. 


` 
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34. Show that the following logical equivalences hold c PAQ=(P\P)J\(Q)Q) 
for the Peirce arrow J, where P | Q = ~(P v Q). H d. Write P — Q using Peirce arrows only. 
a ~P=P}]P e. Write P <> Q using Peirce arrows only. 


b. PvQ=(P)Q)1(PJ 9) 


ANSWERS FOR TEST YOURSELF 


1. the output signal(s) that correspond to all possible the input signals 3. outputs a 1 for exactly one particular 
combinations of input signals to the circuit 2. a Boolean combination of input signals and outputs 0’s for all other 
expression that represents the input signals as variables combinations 4. they have the same input/output table 
and indicates the successive actions of the logic gates on 5. NOT; AND 6. NOT; OR 


Application: Number Systems 
and Circuits for Addition 


Counting in binary is just like counting in decimal if you are all thumbs. —Glaser and Way 


In elementary school, you learned the meaning of decimal notation: that to interpret a 
string of decimal digits as a number, you mentally multiply each digit by its place value. 
For instance, 5,049 has a 5 in the thousands place, a 0 in the hundreds place, a 4 in the tens 
place, and a 9 in the ones place. Thus 


5,049 = 5-(1,000) + 0-(100) + 4-(10) + 9-(1). 
Using exponential notation, this equation can be rewritten as 
5,049 = 5-10° +. 0-10° + 4-10! + 9-10°. 


More generally, decimal notation is based on the fact that any positive integer can be writ- 
ten uniquely as a sum of products of the form 


d-10", 


where each n is a nonnegative integer and each d is one of the decimal digits 0, 1, 2, 3, 4, 
5, 6, 7, 8, or 9. The word decimal comes from the Latin root deci, meaning “ten.” Decimal 
(or base 10) notation expresses a number as a string of digits in which each digit’s posi- 
tion indicates the power of 10 by which it is multiplied. The right-most position is the ones 
place (or 10° place), to the left of that is the tens place (or 10' place), to the left of that is the 
hundreds place (or 107 place), and so forth, as illustrated below. 


uiti 10° 10° 10! 10° 
thousands hundreds tens ones 
Decimal Digit 5 0 4 9 


Binary Representation of Numbers 


There is nothing sacred about the number 10; we use 10 as a base for our usual number 
system because we happen to have ten fingers. In fact, any integer greater than 1 can serve 
as a base for a number system. In computer science, base 2 notation, or binary notation, 
is of special importance because the signals used in modern electronics are always in one 
of only two states. (The Latin root bi means “two.”) 
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In Section 5.4, we show that any integer can be represented uniquely as a sum of prod- 
ucts of the form 


d: gi 
where each n is an integer and each d is one of the binary digits (or bits) 0 or 1. For example, 
27 = 16+8+2+1 
= 1-24+1-27+0-27+1-2'+1-2" 


In binary notation, as in decimal notation, we write just the binary digits, and not the 
powers of the base. In binary notation, then, 


1-241-24+0-2?4+ 1-2'41- 2° 


= 


STan = 1101 15 


where the subscripts indicate the base, whether 10 or 2, in which the number is written. 
The places in binary notation correspond to the various powers of 2. The right-most posi- 
tion is the ones place (or 3? place), to the left of that is the twos place (or yy place), to the 
left of that is the fours place (or 2? place), and so forth, as illustrated below. 


> 2 2? 2! 2 
Place : z 
sixteens eights fours twos ones 
Binary Digit 1 1 0 1 1 


As in the decimal notation, leading zeros may be added or dropped as desired. For 
example, 


00319 = 319 = 1-2'+1-2° = 11, = 011). 


CASA Binary Notation for Integers from 1to 9 


Derive the binary notation for the integers from | to 9. 


Solution 1,9 = res b 
l= 1-2'+0-2°= 10, 
Sa 1:2!+1-2?= 11, 
4ig= 1-2?+0-2!+0-2°= 100, 
= 1-2°?+0-2'+1-2°= 101, 
Gy = 1-2°>4+1-2'+0-2°= 110, 
Tq = 1:-27+1-2'4+1-2°= 111, 


81o = 1:2°+0-27+0-2'+0-2° = 1000, 


919 = 1:2°+0-27+0-2'+1-2° = 1001, = 
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A list of powers of 2 is useful for doing binary-to-decimal and decimal-to-binary con- 
versions. See Table 2.5.1. 


TABLE 2.5.1 Powers of 2 
Power of 2 aa) 22 I sat 27 Ve? ca a a | Id | a 
Decimal Form | 1024 | 512 | 256 | 128 | 64 | 32) 16/8 /41]21]1 


CAE Converting a Binary to a Decimal Number 
Represent 110101, in decimal notation. 
Solution 110101, = 1-2°+1-27+0-23+1-27+0-2'+1-2° 
= 32+16+4+1 
= 5319 


Alternatively, the schema below may be used. 


CAUSE Converting a Decimal to a Binary Number 


Represent 209 in binary notation. 


Solution Use Table 2.5.1 to write 209 as a sum of powers of 2, starting with the highest 
power of 2 that is less than 209 and continuing to lower powers. 
Since 209 is between 128 and 256, the highest power of 2 that is less than 209 is 128. Hence 


209 |, = 128 + a smaller number. 


Now 209 — 128 = 81, and 81 is between 64 and 128, so the highest power of 2 that is less 
than 81 is 64. Hence 


209,9 = 128 + 64 + a smaller number. 
Continuing in this way, you obtain 
209,9 = 128 +64+16+1 
= 1-274+1-2°+0-2°+1-24+0-2°+0-27+0-2'+1-2° 


For each power of 2 that occurs in the sum, there is a 1 in the corresponding position 
of the binary number. For each power of 2 that is missing from the sum, there is a O in the 
corresponding position of the binary number. Thus 


209,) = 11010001, E 


Another procedure for converting from decimal to binary notation is discussed in 
Section 5.1. 
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A Binary Addition and Subtraction 


The computational methods of binary arithmetic are analogous to those of decimal arith- 
metic. In binary arithmetic the number 2 (which equals 10, in binary notation) plays a role 
similar to that of the number 10 in decimal arithmetic. 


Caution! Do not read 
10, as “ten”; it is the num- 
ber two. Read 10, as “one 
oh base two.” 


Addition in Binary Notation 


Add 1101, and 111, using binary notation. 
Solution Because 2,) = 10, and 1;ọ = 1,, the translation of 1;ọ + 1iọ = 2)) to binary 
notation is 
l2 
+ 1, 
10, 


It follows that adding two 1’s together results in a carry of 1 when binary notation is used. 
Adding three 1’s together also results in a carry of 1 since 3;) = 11, (“one one base two”). 


<— carry row 


ee 
ole 0e 
= 
N 


CES Subtraction in Binary Notation 


Subtract 1011, from 11000, using binary notation. 


Solution In decimal subtraction the fact that 10; ) — 119 = 949 is used to borrow across 
several columns. For example, consider the following: 


2 9 


1 <— borrow row 
sab O10 
= 5 819 
9 4 210 


In binary subtraction it may also be necessary to borrow across more than one column. 
But when you borrow a 1, from 10>, what remains is 15. 
10, 


— 1 2 
l2 
Thus the subtraction can be performed as follows: 


Oi J 
(48 6 6 <— borrow row 
2 


- 1011, 
i: 1 Ot E 
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Circuits for Computer Addition 


Consider the question of designing a circuit to produce the sum of two binary digits P 
and Q. Both P and Q can be either 0 or 1. And the following facts are known: 


eae = 10,, 
1,+0, =1,=01,, 
0,+1,=1, =01,, 
0 +0) = 0; = 00>. 


It follows that the circuit must have two outputs—one for the left binary digit (this 
is called the carry) and one for the right binary digit (this is called the sum). The carry 
output is 1 if both P and Q are 1; it is O otherwise. Thus the carry can be produced using 
the AND-gate circuit that corresponds to the Boolean expression P \ Q. The sum output 
is 1 if either P or Q, but not both, is 1. The sum can, therefore, be produced using a circuit 
that corresponds to the Boolean expression for exclusive or: (P VQ) \ ~(P ^ Q). (See 
Example 2.4.3(a).) Hence, a circuit to add two binary digits P and Q can be constructed as 
in Figure 2.5.1. This circuit is called a half-adder. 


HALF-ADDER 
Circuit Input/OutputTable 
P Q Carry | Sum 
1 1 1 0 
sum 1 o| o 1 
0 1 0 1 
oe 0 | o | 0 | 0 


FIGURE 2.5.1 Circuit to Add P + Q, Where P and Q Are Binary Digits 


Now consider the question of how to construct a circuit to add two binary integers, each 
with more than one digit. Because the addition of two binary digits may result in a carry to 
the next column to the left, it may be necessary to add three binary digits at certain points. 
In the following example, the sum in the right column is the sum of two binary digits, and, 
because of the carry, the sum in the left column is the sum of three binary digits. 

1 | <carry row 
1, 
+ 11, 
110, 


Thus, in order to construct a circuit that will add multidigit binary numbers, it is neces- 
sary to incorporate a circuit that will compute the sum of three binary digits. Such a circuit 
is called a full-adder. Consider a general addition of three binary digits P, Q, and R that 
results in a carry (or left-most digit) C and a sum (or right-most digit) S. 

P 

+ Q 
+R 
CS 

The operation of the full-adder is based on the fact that addition is a binary operation: 
Only two numbers can be added at one time. Thus P is first added to Q and then the result 
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is added to R. For instance, consider the following addition: 


1, 


‘hit, = Ols hi. + 1, = 10; 


+b 
10, 


The process illustrated here can be broken down into steps that use half-adder circuits. 


Step 1: Add P and Q using a half-adder to obtain a binary number with two digits. 
P 
+ Q 
Cis 


Step 2: Add R to the sum C, S, of P and Q. 


CS; 
+ R 


To do this, proceed as follows: 
Step 2a: Add R to S; using a half-adder to obtain the two-digit number C35. 
Si 
+ R 
CS 


Then S is the right-most digit of the entire sum of P, Q, and R. 


Step 2b: Determine the left-most digit, C, of the entire sum as follows: First note that it 
is impossible for both C; and C; to be 1’s. For if C, = 1, then P and Q are both 
1, and so Sı = 0. Consequently, the addition of S; and R gives a binary number 
C,S, where C, = 0. Next observe that C will be a 1 in the case that the addition 
of P and Q gives a carry of 1 or in the case that the addition of S; (the right-most 
digit of P+ Q) and R gives a carry of 1. In other words, C = 1 if, and only if, 
Cı = 1 or C, = 1. It follows that the circuit shown in Figure 2.5.2 will compute 
the sum of three binary digits. 


FULL-ADDER 
Circuit Input/Output Table 
P P|Q|R/|C\S 
e filia 1 
Q 

C, 1}1;0;)1 70 
TOTII O 

half-adder #2 
R s LOO OL 
Oy Ly Lyd 
Oo;1;,0);04; 1 
0;oO;1)04; 1 
0;0;0);04)0 


FIGURE 2.5.2 Circuit to Add P+ Q + R, Where P, Q, and R Are Binary Digits 
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Two full-adders and one half-adder can be used together to build a circuit that will add 
two three-digit binary numbers PQR and STU to obtain the sum WXYZ. This is illustrated 
in Figure 2.5.3. Such a circuit is called a parallel adder. Parallel adders can be constructed 
to add binary numbers of any finite length. 


R Sp=Z 
half-adder 
U 1 
S,=Y 
full-adder 
T 2 
S3=X 
P full-adder 
C;=W 


FIGURE 2.5.3 A Parallel Adder to Add PQR and STU to Obtain WXYZ 


Two’s Complements and the Computer Representation 
of Signed Integers 


Typically a fixed number of bits is used to represent integers on a computer. One way to 
do this is to select a particular bit, normally the left-most, to indicate the sign of the inte- 
ger, and to use the remaining bits for its absolute value in binary notation. The problem 
with this approach is that the procedures for adding the resulting numbers are somewhat 
complicated and the representation of 0 is not unique. A more common approach is to use 
“two’s complements,” which makes it possible to add integers quite easily and results in a 
unique representation for 0. Bit lengths of 64 and (sometimes) 32 are most often used in 
practice, but, for simplicity and because the principles are the same for all bit lengths, this 
discussion will focus on a bit length of 8. 

We will show how to use eight bits to represent the 256 integers from —128 through 127 
and how to perform additions and subtractions within this system of numbers. When the 
more realistic 32-bit two’s complements system is used, more than 4 billion integers can 
be represented. 


Definition 


The 8-bit two’s complement for an integer a between —128 and 127 is the 8-bit 


ifa=O0 


a 
bi tation fi 
inary representation for ie a 


Thus the 8-bit representation for a nonnegative integer is the same as its 8-bit binary 
representation. As a concrete example for the negative integer —46, observe that 


(2®—|—46])19 = (256 — 46)i9 = 21019 = (128 +64 + 16 + 2)19 = 11010010». 


and so the 8-bit two’s complement for —46 is 11010010. 
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For negative integers, however, there is a more convenient way to compute two’s com- 
plements, which involves less arithmetic than applying the definition directly. 


The 8-Bit Two’s Complement for a Negative Integer 


The 8-bit two’s complement for a negative integer a that is at least —128 can be 
obtained as follows: 


e Write the 8-bit binary representation for |a]. 


e Switch all the 1’s to 0’s and all the 0’s to 1’s. (This is called flipping, or comple- 
menting, the bits.) 


e Add | in binary notation. 


Selves Finding a Two’s Complement 


Use the method described above to find the 8-bit two’s complement for —46. 


Solution Write the 8-bit binary representation for |—46| (=46), switch all the 1’s to 0’s 
and all the 0’s to 1’s, and then add 1. 


|-46| 19 = 4619 = 82+8 +4 + 2) = 00101110, ŽPS 11010001 “““S 11010010. 


Note that this is the same result as was obtained directly from the definition. E 


The fact that the method for finding 8-bit two’s complements works in general depends 
on the following facts: 


1. The binary representation of 28 — 1 is 11111111). 
2. Subtracting an 8-bit binary number a from 11111111, switches all the 1’s to 0’s 


and all the 0’s to I’s. 
3. 23 — |a| = [(2°-1)—|a|]+1 for any number a. 


Here is how the facts are used when a = —46: 


[ala jalijil iJi] ies 


[oļoļiıfoļiıļı[|ı[|o]e1-%i 


0’s and 1’s are 7 
switched 


nonan ass 


1 is added + 0/00] 010/0 0J]-1 |F 


CEE] e*s 


Because 127 is the largest integer represented in the 8-bit two’s complement system and 
because 127:9 = O1111111,, all the 8-bit two’s complements for nonnegative integers have 
a leading bit of 0. Moreover, because the bits are switched, the leading bit for all the nega- 
tive integers is 1. Table 2.5.2 illustrates the 8-bit two’s complement representations for the 
integers from —128 through 127. 
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TABLE 2.5.2 
Decimal Form of 
8-Bit Two’s Two’s Complement for 
Integer Complement Negative Integers 
127 01111111 
126 01111110 
2 00000010 
00000001 
0 00000000 
= 11111111 84 
=2 11111110 28—272 
=3 11111101 28-3 
=127 10000001 28 — 127 
—128 10000000 78 _ 128 


Observe that if the two’s complement procedure is used on 11010010, which is the two’s 
complement for —46, the result is 
11010010 22> 0101101 — > 00101110, 
which is the two’s complement for 46. In general, if the two’s complement procedure is 
applied to a positive or negative integer in two’s complement form, the result is the negative 
(or opposite) of that integer. The only exception is the number —128. (See exercise 37a.) 


To find the decimal representation of the negative integer with a given 8-bit two’s 
complement: 


e Apply the two’s complement procedure to the given two’s complement. 


e Write the decimal equivalent of the result. 


SeluyeewA Finding a Number with a Given Two’s Complement 
What is the decimal representation for the integer with two’s complement 10101001? 
Solution Since the left-most digit is 1, the integer is negative. Applying the two’s com- 
plement procedure gives the following result: 


10101001 PEE. 01010110 — > 01010111; 


=(64+16+4+2+ 1) = 8719 = |- 87li0. 


So the answer is —87. You can check its correctness by deriving the two’s complement of 
—87 directly from the definition: 


(28 — |—87|)19 = (256 — 87)19 = 16949 = (128 +32+8+1),)=10101001, E 
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Addition and Subtraction with Integers 
in Two’s Complement Form 


The main advantage of a two’s complement representation for integers is that the same 
computer circuits used to add nonnegative integers in binary notation can be used for both 
additions and subtractions of integers in a two’s complement system of numeration. First 
note that because of the algebraic identity 


a—b = a+ (—b) for all real numbers, 


any subtraction problem can be changed into an addition one. For example, suppose you 
want to compute 78 — 46. This equals 78 + (—46), which should give an answer of 32. To 
see what happens when you add the numbers in their two’s complement forms, observe 
that the 8-bit two’s complement for 78 is the same as the ordinary binary representation 
for 78, which is 01001110 because 78 = 64+ 8 + 4 + 2, and, as previously shown, the 8-bit 
two’s complement for —46 is 11010010. Adding the numbers using binary addition gives 
the following: 


+)/1];1);o0;1;/0}]0);14]0) 46 


1]/0;0);1/;0;/0}]0);,0] 0] 32 


The result has a carry bit of 1 in the ninth, or th, position, but if you discard it, you ob- 
tain 00100000, which is the correct answer in 8-bit two’s complement form because, since 
32 = 2", 


321o = 00100000). 


In general, if you add numbers in 8-bit two’s complement form and get a carry bit of 
1 in the ninth, or 2°th position, you should discard it. Using this procedure is equivalent 
to reducing the sum of the numbers “modulo 28” and it gives results that are correct in 
ordinary decimal arithmetic as long as the sum of the two numbers is within the fixed-bit- 
length system of integer representations you are using, in this case those between —128 and 
127. The fact that this method produces correct results follows from general properties of 
modular arithmetic, which is discussed at length in Section 8.4. 


General Procedure for Using 8-Bit Two’s Complements to Add Two Integers 


To add two integers in the range —128 through 127 whose sum is also in the range 
—128 through 127: 


e Convert both integers to their 8-bit two’s complement representations. 


e Add the resulting integers using ordinary binary addition, discarding any carry bit 
of 1 that may occur in the 2°th position. 


e Convert the result back to decimal form. 


When integers are restricted to the range —128 through 127, you can easily 
imagine adding two integers and obtaining a sum outside the range. For instance, 
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(—87) + (—46) = — 133, which is less than — 128 and, therefore, requires more than eight 
bits for its representation. Because this result is outside the 8-bit fixed-length register 
system imposed by the architecture of the computer, it is often labeled “overflow error.” 
In the more realistic environment where integers are represented using 64 bits, they can 
range from less than — 10! to more than 10'°. So a vast number of integer calculations 
can be made without producing overflow error. And even if a 32-bit fixed integer length 
is used, nearly 4 billion integers are represented within the system. 

Detecting overflow error turns out to be quite simple. The 8-bit two’s complement sum 
of two integers will be outside the range from — 128 through 127 if, and only if, the integers 
are both positive and the sum computed using 8-bit two’s complements is negative, or if the 
integers are both negative and the sum computed using 8-bit two’s complement is positive. 
To see a concrete example for how this works, consider trying to add (—87) and (—46). 
Here is what you obtain: 


1;O0; rT) 1} ry 1),o0]; 141 


When you discard the | in the 2th position, you find that the leading digit of the result 
is 0, which would mean that the number with the two’s complement representation for 
the sum of two negative numbers would be positive. So the computer signals an over- 
flow error.* 


Hexadecimal Notation 


It should now be obvious that numbers written in binary notation take up much more space 
than numbers written in decimal notation. Yet many aspects of computer operation can 
best be analyzed using binary numbers. Hexadecimal notation is even more compact than 
decimal notation, and it is much easier to convert back and forth between hexadecimal and 
binary notation than it is between binary and decimal notation. The word hexadecimal 
comes from the Greek root hex-, meaning “six,” and the Latin root deci-, meaning “ten.” 
Hence hexadecimal refers to “sixteen,” and hexadecimal notation is also called base 16 
notation. Hexadecimal notation is based on the fact that any integer can be uniquely ex- 
pressed as a sum of numbers of the form 


d:16", 


where each n is a nonnegative integer and each d is one of the integers from 0 to 15. In 
order to avoid ambiguity, each hexadecimal digit must be represented by a single symbol. 
The integers 10 through 15 are represented by the symbols A, B, C, D, E, and F. The 16 
hexadecimal digits are shown in Table 2.5.3, together with their decimal equivalents and, 
for future reference, their 4-bit binary equivalents. 


*If the carry bit had not been discarded and if the resulting 9 bits could be processed using a “9-bit two’s 
complement conversion procedure,” the result of 101111011 would convert to —133, which is the correct 
answer. However, the computer signals an error because —133 is not representable within its 8-bit two’s 
complement system. 
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TABLE 2.5.3 
Decimal Hexadecimal 4-Bit Binary Equivalent 

0 0 0000 

1 0001 

2 2 0010 

3 3 0011 

4 4 0100 

5 5 0101 

6 6 0110 

T 7 0111 

8 8 1000 

9 9 1001 

10 A 1010 

11 B 1011 
12 C 1100 
13 D 1101 
14 E 1110 
15 F 1111 


Selves Converting from Hexadecimal to Decimal Notation 
Convert 3CF), to decimal notation. 


Solution A schema similar to the one introduced in Example 2.5.2 can be used here. 


| l r = 15 
>12- 16 = 192 
> 3-256 = 768 
97510 
So 3CFi¢ = 97510. | 


Now consider how to convert from hexadecimal to binary notation. In the example 
below the numbers are rewritten using powers of 2, and the laws of exponents are applied. 
The result suggests a general procedure. 


| L—+ 10-169 = (23+ 2)-1 =23+2 since 10 = 23 + 2 
> 0-16! = 0-24 = 0 since 16! = 24 
> 5-167 = (274+ 1)-28 = 2104 28 since 5 = 2? + 1, 16? = (24)? = 28 and 22 -28 = 210 
> 12-167 = (23 + 27)-2!27 = 2542!4 since 12 = 23+ 2, 162 = (24)3 = 2!2, 
23 .212 = oi). and 22.212 E> 214 
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But 
(2! +2") (20 + 2%) +04 (2? +2) 
= 11000000 00000000, + 0101 00000000, by the rules for writing 


+ 00000000, + 10105 binary numbers. 
So 


C50A;6 = 1100 0101 0000 1010, 
= = ae) 


Cis 5i Ois Ato by the rules for adding 
binary numbers. 


The procedure illustrated in this example can be generalized. In fact, the following 
sequence of steps will always give the correct answer. 


To convert an integer from hexadecimal to binary notation: 


e Write each hexadecimal digit of the integer in 4-bit binary notation. 


e Juxtapose the results. 


ZEWG AAJ Converting from Hexadecimal to Binary Notation 
Convert BO9F;, to binary notation. 


Solution Bio = 11jo = 1011), O16 = Oio = 0000», 916 = 940 = 1001), and Fis = 1549 = 
1111,. Consequently, 
B 0 9 F 


t t t t 
1011 0000 1001 1111 


and the answer is 1011000010011111. m 


To convert integers written in binary notation into hexadecimal notation, reverse the 
steps of the previous procedure. Note that the commonly used computer representation for 
integers uses 32 bits. When these numbers are written in hexadecimal notation only eight 
characters are needed. 


To convert an integer from binary to hexadecimal notation: 


e Group the digits of the binary number into sets of four, starting from the right and 
adding leading zeros as needed. 


e Convert the binary numbers in each set of four into hexadecimal digits. Juxtapose 
those hexadecimal digits. 


Seve Converting from Binary to Hexadecimal Notation 
Convert 100110110101001, to hexadecimal notation. 


Solution First group the binary digits in sets of four, working from right to left and add- 
ing leading 0’s if necessary. 


0100 1101 1010 1001. 
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Convert each group of four binary digits into a hexadecimal digit. 


0100 1101 1010 1001 


t t t t 
4 D A 9 


Then juxtapose the hexadecimal digits. 


4ADA9 E 


Reading a Memory Dump 


The smallest addressable memory unit on most computers is one byte, or eight bits. In some 
debugging operations a dump is made of memory contents; that is, the contents of each 
memory location are displayed or printed out in order. To save space and make the output 
easier on the eye, the hexadecimal versions of the memory contents are given, rather than 
the binary versions. Suppose, for example, that a segment of the memory dump looks like 


A3 BB 59 2E. 
What is the actual content of the four memory locations? 
Solution 
A3; = 10100011, 


BB), = 10111011, 
59,6 = 01011001, 
2E), = 00101110, | 
TEST YOURSELF 
1. To represent a nonnegative integer in binary nota- 6. To find the 8-bit two’s complement of a negative 
tion means to write it as a sum of products of the integer a that is at least —128, you ——, 5 
form , where and 


2. To add integers in binary notation, you use the 7. To add two integers in the range —128 through 
facts that l + 1, = and 1,+1,+1,= 127 whose sum is also in the range —128 through 
ooy 127, you g ; , and 

3. To subtract integers in binary notation, you use the 8. To represent a nonnegative integer in hexadecimal 
facts that 10, — l1, = and 11,—-1, = ___. notation means to write it as a sum of products of 

4. A half-adder is a digital logic circuit that ; oe pune 
and a full-adder is a digital logic circuit that 9. To convert a nonnegative integer from hexadeci- 

5. Ifa is an integer with —128 = a = 127, the 8-bit nial Onin Rowen you ane 


EXERCISE SET 2.5 


two’s complement of a is if a = 0 and is 


ifa<0. 


Represent the decimal integers in 1-6 in binary notation. 


1. 
4. 


19 2. 55 3. 287 
458 5. 1609 6. 1424 
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Represent the integers in 7-12 in decimal notation. 
7. 11105 8. 10111, 9. 110110, 
10. 1100101, 11. 1000111, 12. 1011011, 
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Perform the arithmetic in 13-20 using binary notation. 


3. 1011, 14. 1001, 
+ 10l, + 10115 
15. 101101, 16. 110111011, 
+ 11101, + 1001011010, 
17. 10100, 18. 11010, 
— 1101, — 1101, 
a 101101, 20. 1010100, 
— 10011, — 10111, 


21. Give the output signals S and T for the circuit 
shown below if the input signals P, Q, and R are 
as specified. Note that this is not the circuit for a 
full-adder. 


=1 

=0 

=1 
P 
Q AND S 

C2 
half-adder #2 

R F 


22. Add 11111111, + 1, and convert the result to deci- 
mal notation, to verify that 11111111, = (28 — 1) jo. 


Find the 8-bit two’s complements for the integers in 23-26. 
23. —23 24. —67 25. —4 26. —115 


Find the decimal representations for the integers with the 
8-bit two’s complements given in 27-30. 


27. 11010011 28. 10011001 

29. 11110010 30. 10111010 

Use 8-bit two’s complements to compute the sums in 31-36. 
31. 57+ (—118) 32. 62 +(—18) 

33. (—6) + (—73) 34. 89+(—55) 

35. (—15) +(—46) 36. 123+ (—94) 


ANSWERS FOR TEST YOURSELF 


37. a. Show that when you apply the 8-bit two’s com- 
plement procedure to the 8-bit two’s complement 
for — 128, you get the 8-bit two’s complement 
for — 128. 

*b. Show that if a, b, and a + b are integers in the 
range 1 through 128, then 


(2-a) + (2 -— b) = (28 -lath +t = 29 +27. 


Explain why it follows that if integers a, b, and 
a+b are all in the range 1 through 128, then 
the 8-bit two’s complement of (—a) + (—b) is a 
negative number. 


Convert the integers in 38-40 from hexadecimal to deci- 
mal notation. 


38. A2BCi, 39. EOD, 40. 39EB,, 


Convert the integers in 41-43 from hexadecimal to binary 
notation. 


41. ICOABE,, 42. BS3DF8,, 43. 4ADF83j, 


Convert the integers in 44-46 from binary to hexadeci- 
mal notation. 


44. 00101110, 
46. 11001001011100- 


45. 1011011111000101, 


47. Octal Notation: In addition to binary and 
hexadecimal, computer scientists also use 

octal notation (base 8) to represent numbers. 

Octal notation is based on the fact that any 

integer can be uniquely represented as a sum 

of numbers of the form d:8”, where each n 

is a nonnegative integer and each d is one of 

the integers from 0 to 7. Thus, for example, 

5073; = 5:8? +0-8° +7°8' + 3-8° = 261919. 

a. Convert 61502; to decimal notation. 

b. Convert 20763, to decimal notation. 

c. Describe methods for converting integers from 
octal to binary notation and the reverse that are 
similar to the methods used in Examples 2.5.9 
and 2.5.10 for converting back and forth from 
hexadecimal to binary notation. Give examples 
showing that these methods result in correct 
answers. 


1. d-2"; d = 0 or d = 1, and n is a nonnegative integer 
2.105; 11, 3.15; 10, 4. outputs the sum of any two 
binary digits; outputs the sum of any three binary 
digits 5. the 8-bit binary representation of a; the 8-bit 
binary representation of 28—a 6. write the 8-bit binary 
representation of a; flip the bits; add 1 in binary notation 


7. convert both integers to their 8-bit two’s complements; 
add the results using binary notation; truncate any leading 
1; convert back to decimal form 8. d-16";d = 0, 1,2,... 
9, A, B, C, D, E, F, and n is a nonnegative integer 9. write 
each hexadecimal digit in 4-bit binary notation; juxtapose 
the results 
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THE LOGIC OF QUANTIFIED 
STATEMENTS 


In Chapter 2 we discussed the logical analysis of compound statements—those made of 
simple statements joined by the connectives ~, A, V, —, and <>. Such analysis casts light 
on many aspects of human reasoning, but it cannot be used to determine validity in the 
majority of everyday and mathematical situations. For example, the argument 


All men are mortal. 
Socrates is a man. 


.. Socrates is mortal. 


is intuitively perceived as correct. Yet its validity cannot be derived using the methods out- 
lined in Section 2.3. To determine validity in examples like this, it is necessary to separate 
the statements into parts in much the same way that you separate declarative sentences 
into subjects and predicates. And you must analyze and understand the special role played 
by words that denote quantities such as “all” or “some.” The symbolic analysis of predi- 
cates and quantified statements is called the predicate calculus. The symbolic analysis 
of ordinary compound statements (as outlined in Sections 2.1—2.3) is called the statement 
calculus (or the propositional calculus). 


EAN Predicates and Quantified Statements | 


. it was not till within the last few years that it has been realized how fundamental 
any and some are to the very nature of mathematics. —A. N. Whitehead (1861-1947) 


As noted in Section 2.1, the sentence “+2 = 11” is not a statement because it may be 
either true or false depending on the value of x. Similarly, the sentence “x + y > 0” is not a 
statement because its truth value depends on the values of the variables x and y. 

In grammar, the word predicate refers to the part of a sentence that gives information 
about the subject. In the sentence “James is a student at Bedford College,” the word James 
is the subject and the phrase is a student at Bedford College is the predicate. The predicate 
is the part of the sentence from which the subject has been removed. 

In logic, predicates can be obtained by removing some or all of the nouns from a state- 
ment. For instance, let P stand for “is a student at Bedford College” and let Q stand for “is a 
student at.” Then both P and Q are predicate symbols. The sentences “‘x is a student at Bed- 
ford College” and “x is a student at y” are symbolized as P(x) and as Q(x, y), respectively, 
where x and y are predicate variables that take values in appropriate sets. When concrete 
values are substituted in place of predicate variables, a statement results. For simplicity, 
we define a predicate to be a predicate symbol together with suitable predicate variables. 
In some other treatments of logic, such objects are referred to as propositional functions 
or open sentences. 


108 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


Note Recall that we read 
these symbols as “the 

set of all x in D such that 
PO’ 


3.1 PREDICATES AND QUANTIFIED STATEMENTS | 109 


Definition 


A predicate is a sentence that contains a finite number of variables and becomes 
a statement when specific values are substituted for the variables. The domain of a 
predicate variable is the set of all values that may be substituted in place of the variable. 


Finding Truth Values of a Predicate 


Let P(x) be the predicate “x” > x” with domain the set R of all real numbers. Write 
P(2), PÍ), and PÍ -3), and indicate which of these statements are true and which are false. 


Solution 


P(2): 2?>2, or 4>2. True. 


When an element in the domain of the variable of a one-variable predicate is substituted 
for the variable, the resulting statement is either true or false. The set of all such elements 
that make the predicate true is called the truth set of the predicate. 


Definition 


If P(x) is a predicate and x has domain D, the truth set of P(x) is the set of all ele- 
ments of D that make P(x) true when they are substituted for x. The truth set of 
P(x) is denoted 


{x E D| P(@x)}. 


Finding the Truth Set of a Predicate 

Let Q(n) be the predicate “n is a factor of 8.” Find the truth set of Q(n) if 
a. the domain of n is Z*, the set of all positive integers 

b. the domain of n is Z, the set of all integers. 


Solution 

a. The truth set is {1, 2, 4, 8} because these are exactly the positive integers that 
divide 8 evenly. 

b. The truth set is {1, 2, 4, 8, — 1, —2, —4, —8} because the negative integers — 1, —2, —4, 
and —8 also divide into 8 without leaving a remainder. a 


The Universal Quantifier: V 


One sure way to change predicates into statements is to assign specific values to all their 
variables. For example, if x represents the number 35, the sentence “x is (evenly) divisible 
by 5” is a true statement since 35 = 5-7. Another way to obtain statements from predicates 
is to add quantifiers. Quantifiers are words that refer to quantities such as “some” or “all” 
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CHAPTER3 THE LOGIC OF QUANTIFIED STATEMENTS 


and tell for how many elements a given predicate is true. The formal concept of quantifier 
was introduced into symbolic logic in the late nineteenth century by the American philoso- 
pher, logician, and engineer Charles Sanders Peirce and, independently, by the German 
logician Gottlob Frege. 

The symbol V is called the universal quantifier. Depending on the context, it is read as 
“for every,” “for each,” “for any,” “given any,” or “for all.” For example, another way to express 
the sentence “Every human being is mortal” or “All human beings are mortal” is to write 


V human beings x, x is mortal, 


which you would read as “For every human being x, x is mortal.” If you let H be the set of 
all human beings, then you can symbolize the statement more formally by writing 


Fine Art Images/Glow Images 


Vx E H, xis mortal. 


Charles Sanders Peirce : eae A ; . A 
(1839-1914) Think of the symbol x as an individual but generic object, with all the properties shared 


by every human being but with no other properties. Because x is individual, even if you 


Note Think “for every” read V as “for all,’ you should use the singular verb and say, “For all x in H, x is mortal” 
when you see the symbol Y. rather than “For all x in H, x are mortal.” 


In a universally quantified sentence the domain of the predicate variable is generally 
indicated either between the V symbol and the variable name (as in V human being x) or 
immediately following the variable name (as in Vx E H). In sentences containing a mix- 
ture of symbols and words, the V symbol can refer to two or more variables. For instance, 
you could symbolize “For all real numbers x and y, x+y = y +x.” as “V real numbers x 
and y, x+y = y+x.* 

Sentences that are quantified universally are defined as statements by giving them the 
truth values specified in the following definition: 


Definition 


Let Q(x) be a predicate and D the domain of x. A universal statement is a statement 
of the form “vx E D, Q(x)” It is defined to be true if, and only if, Q(x) is true for 
each individual x in D. It is defined to be false if, and only if, Q(x) is false for at least 
one x in D. A value for x for which Q(x) is false is called a counterexample to the 


Pictorial Press Ltd./Alamy Stock Photo 


Gottlob Frege universal statement. 


(1848-1925) 


Gescesfe Truth and Falsity of Universal Statements 
a. Let D = {1, 2, 3, 4, 5}, and consider the statement 
Vx € Dx =x. 
Write one way to read this statement out loud, and show that it is true. 


b. Consider the statement 
Vx ER, Ex. 


Find a counterexample to show that this statement is false. 


*More formal versions of symbolic logic would require a separate V for each variable: 
“yx E R(Vy E R& +y = y + x)).” 
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Solution 


a. “For every x in the set D, x is greater than or equal to x.” The inequalities below show 
that “x” = x” is true for each individual x in D. 


a) FS) Fes ad 525. 


2 . 
Hence “Vx E D, x“ = x” is true. 


b. Counterexample: The statement claims that xX = x for every real number x. But when 


x= L, for example, 
1\* 121 


2 . 
Hence “Vx € R, x° = x” is false. E 


The technique used to show the truth of the universal statement in Example 3.1.3(a) is 
called the method of exhaustion. It consists of showing the truth of the predicate sepa- 
rately for each individual element of the domain. (The idea is to exhaust the possibilities 
before you exhaust yourself!) This method can, in theory, be used whenever the domain 
of the predicate variable is finite. In recent years the prevalence of digital computers has 
greatly increased the convenience of using the method of exhaustion. Computer expert 
systems, or knowledge-based systems, use this method to arrive at answers to many of 
the questions posed to them. Because most mathematical sets are infinite, however, the 
method of exhaustion can rarely be used to derive general mathematical results. 


The Existential Quantifier: 3 


The symbol 43 denotes “there exists” and is called the existential quantifier. For example, 
the sentence “There is a student in Math 140” can be written as 


Note Think “there Ja person p such that p is a student in Math 140, 
exists” when you see the 


symbol 3. or, more formally, 


dp E P such that p is a student in Math 140, 


where P is the set of all people. The domain of the predicate variable is generally indicated 
either between the 3 symbol and the variable name or immediately following the variable 
name, and the words such that are inserted just before the predicate. Some other expres- 
sions that can be used in place of there exists are there is a, we can find a, there is at least 
one, for some, and for at least one. In a sentence such as “J integers m and n such that 
m+n = m:n, the 4 symbol is understood to refer to both m and n.* 

Sentences that are quantified existentially are defined as statements by giving them the 
truth values specified in the following definition. 


Definition 


Let Q(x) be a predicate and D the domain of x. An existential statement is a state- 
ment of the form “4.x E D such that Q(x).” It is defined to be true if, and only if, Q(x) 
is true for at least one x in D. It is false if, and only if, Q(x) is false for all x in D. 


*In more formal versions of symbolic logic, the words such that are not written out (although they are under- 
stood) and a separate 4 symbol is used for each variable: “3m E Z Gn E Zim +n = m-n)).” 
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Truth and Falsity of Existential Statements 
a. Consider the statement 
4m € Z* such that m = m. 
Write one way to read this statement out loud, and show that it is true. 
b. Let E = {5, 6, 7, 8} and consider the statement 
Jm € E such that m = m. 
Show that this statement is false. 


Solution 


a. “There is at least one positive integer m such that m? = m?” Observe that 1° = 1. Thus 
“m? = m” is true for a positive integer m, and so “Sm E Z” such that m = m” is true. 


b. Note that m? = m is not true for any integers m from 5 through 8: 
5 =25 +5, 6 =3646, T =49+7, 8 =64#8. 
Thus “Im € E such that m? = m” is false. E 


Formal vs. Informal Language 


It is important to be able to translate from formal to informal language when trying to 
make sense of mathematical concepts that are new to you. It is equally important to be able 
to translate from informal to formal language when thinking out a complicated problem. 


CAA Translating from Formal to Informal Language 


Rewrite the following formal statements in a variety of equivalent but more informal ways. 
Do not use the symbol Y or J. 


a. Vx ER, x =0. 
b. Vx ER, #-1. 
c. Im E Z* such that m? = m. 


Solution 


a. Every real number has a nonnegative square. 
Or: All real numbers have nonnegative squares. 
Or: Any real number has a nonnegative square. 
Or: The square of each real number is nonnegative. 


b. All real numbers have squares that do not equal —1. 
Or: No real numbers have squares equal to — 1. 
(The words none are or no ... are are equivalent to the words all are not.) 


Note In ordinary English, c. There is a positive integer whose square is equal to itself. 
the fourth statement in Or: We can find at least one positive integer equal to its own square. 
part (c) may be taken to Or: Some positive integer equals its own square. 


mean that there are at least Or: Some positive integers equal their own squares. | 
two positive integers equal 


to their own squares. In 


mathematics we under Another way to restate universal and existential statements informally is to place the 


stand the last two state- quantification at the end of the sentence. For instance, instead of saying “For any real num- 
ments in part (c) to mean ber x, x” is nonnegative,” you could say “x? is nonnegative for any real number x.” In such 
the same thing. a case the quantifier is said to “trail” the rest of the sentence. 
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Example 3.1.6 


Example 3.1.7 


Note The following two 
sentences mean the same 
thing: “All triangles have 
three sides” and “Every 

triangle has three sides.” 


Example 3.1.8 


Example 3.1.9 
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Trailing Quantifiers 

Rewrite the following statements so that the quantifier trails the rest of the sentence. 
a. For any integer n, 2n is even. 

b. There exists at least one real number x such that x” < 0. 

Solution 

a. 2n is even for any integer n. 


b. x? <0 for some real number x. 
Or: x < 0 for at least one real number x. E 


Translating from Informal to Formal Language 

Rewrite each of the following statements formally. Use quantifiers and variables. 
a. All triangles have three sides. 

b. No dogs have wings. 

c. Some programs are structured. 


Solution 


a. V triangle ż, t has three sides. 
Or: Vt E T, t has three sides (where T is the set of all triangles). 


b. V dog d, d does not have wings. 
Or: Vd E D, d does not have wings (where D is the set of all dogs). 


c. Ja program p such that p is structured. 
Or: Sp E P such that p is structured (where P is the set of all programs). E 


Universal Conditional Statements 


A reasonable argument can be made that the most important form of statement in math- 
ematics is the universal conditional statement: 


Yx, if P(x) then Q(x). 


Familiarity with statements of this form is essential if you are to learn to speak mathematics. 


Writing Universal Conditional Statements Informally 
Rewrite the following statement informally, without quantifiers or variables. 


Yx E R, if x > 2 then x > 4. 


Solution Ifa real number is greater than 2, then its square is greater than 4. 
Or: Whenever a real number is greater than 2, its square is greater than 4. 
Or: The square of any real number greater than 2 is greater than 4. 


Or: The squares of all real numbers greater than 2 are greater than 4. E 


Writing Universal Conditional Statements Formally 
Rewrite each of the following statements in the form 


V „if then 
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a. If areal number is an integer, then it is a rational number. 
b. All bytes have eight bits. 
c. No fire trucks are green. 


Solution 


a. V real number x, if x is an integer, then x is a rational number. 
Or: Yx E R, if x E Z then x E Q. 


b. Vx, if x is a byte, then x has eight bits. 


c. Vx, if x is a fire truck, then x is not green. 


It is common, as in (b) and (c) above, to omit explicit identification of the domain of 
predicate variables in universal conditional statements. E 


Careful thought about the meaning of universal conditional statements leads to another 
level of understanding for why the truth table for an if-then statement must be defined as it 
is. Consider again the statement 


Y real number x, if x > 2 then L >4. 
Your experience and intuition tell you that this statement is true. But that means that 
If x > 2 then x > 4 


must be true for every single real number x. Consequently, it must be true even for values 
of x that make its hypothesis “x > 2” false. In particular, both statements 


If1>2then1?>4 and If —3 > 2 then (—3% > 4 


must be true. In both cases the hypothesis is false, but in the first case the conclusion 
“17> 4” is false, and in the second case the conclusion “(—3y > 4” is true. Hence, if an 
if-then statement has a false hypothesis, we have to interpret it as true regardless of whether 
its conclusion is true or false. 

Note also that the definition of valid argument is a universal conditional statement: 


For every combination of truth values for the component statements, 
if the premises are all true then the conclusion is also true. 


Equivalent Forms of Universal and Existential Statements 


Observe that the two statements “V real number x, if x is an integer then x is rational” and 
“Y integer x, x is rational” mean the same thing because the set of integers is a subset of 
the set of real numbers. Both have informal translations “All integers are rational.” In fact, 
a statement of the form 


vx E U, if P(x) then Q(x) 
can always be rewritten in the form 
Vx E D, Q(x) 


by narrowing U to be the subset D consisting of all values of the variable x that make P(x) 
true. Conversely, a statement of the form 


Vx E D, Q(x) 
can be rewritten as 


Vx, if x is in D then Q(x). 
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SEWAGE AALI Equivalent Forms for Universal Statements 


Rewrite the following statement in the two forms “Vx, if then ” and 
“y x, _____”: All squares are rectangles. 
Solution Vx, if x is a square then x is a rectangle. 

Y square x, x is a rectangle. E 


Similarly, a statement of the form “Sx such that P(x) and Q(x)” can be rewritten as 
“Jx E D such that Q(x,” where D is the set of all x for which P(x) is true. 


Equivalent Forms for Existential Statements 


A prime number is an integer greater than 1 whose only positive integer factors are it- 
self and 1. Consider the statement “There is an integer that is both prime and even.” Let 
Prime(n) be “n is prime” and Even(7) be “n is even.” Use the notation Prime(n) and Even(n) 
to rewrite this statement in the following two forms: 


a. Jn such that A 
b. 4 n such that 
Solution 


a. dn such that Prime(n) ^A Even(n). 


b. Two answers: J a prime number n such that Even(n). 
Jan even number n such that Prime(). E 


Bound Variables and Scope 


Consider the statement “For every integer x, x° = 0.” First note that you don’t have to call 
the variable x. You can use any name for it as long as you do so consistently. For instance, 
all the following statements have the same meaning: 


For every integer x, x =0. For every integer n, n=0. For every integer s, ’=0. 


In each case the variable simply holds a place for any element in the set of all integers. Each 
way of writing the statement says that whatever integer you might choose, when you square 
it the result will be nonnegative. It is important to note, however, that once you finish writ- 
ing the statement, whatever symbol you chose to use in it can be given an entirely different 
meaning when used in a different context. 

For example, consider the following statements: 


(1) For every integer x, wr =0. 
(2) There exists a real number x such that x =8. 


Statements (1) and (2) both call the variable x, but the x in Statement (1) serves a different 
function from the x in Statement (2). We say that the variable x is bound by the quantifier 
that controls it and that its scope begins when the quantifier introduces it and ends at the 
end of the quantified statement. 

The way variables are used in mathematics is similar to the way they are used in com- 
puter programming. A variable in a computer program also serves as a placeholder in the 
sense that it creates a location in computer memory (either actual or virtual) into which 
its values can be placed. In addition the way it can be bound in a program is similar to the 
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way that a mathematical variable can be bound in a statement. For example, consider the 
following two examples in Python: 


Program 1 Program 2 

def £(): def f(): 
X= "Hi" g="Hi" 
print X print S 

def g(): def g(): 
X = "Bye” S = "Bye" 
print X print sS 

£() £() 

g() g() 


The output for both programs is 


Hi 
Bye 


In each case the variable—whether X or S—is local to the function where it is defined. It 
is created each time the function is called and destroyed as soon as the call is complete. 
The local variable is bound by the function that defines it, and its scope is restricted to 
that function. Outside of the function definition the variable name can be used for any 
other purpose. That is why the functions f and g are allowed to use the same name for the 
variable in their definitions and why f and g define the same functions in both programs. 


Implicit Quantification 
Consider the statement 
If a number is an integer, then it is a rational number. 


As shown earlier, this statement is equivalent to a universal statement. However, it does 
not contain the telltale word all or every or any or each. The only clue to indicate its univer- 
sal quantification comes from the presence of the indefinite article a. This is an example 
of implicit universal quantification. 

Existential quantification can also be implicit. For instance, the statement “The number 
24 can be written as a sum of two even integers” can be expressed formally as “J even 
integers m and n such that 24 = m+n.” 

Mathematical writing contains many examples of implicitly quantified statements. 
Some occur, as in the first example above, through the presence of the word a or an. Oth- 
ers occur in cases where the general context of a sentence supplies part of its meaning. For 
example, in an algebra course in which the letter x is always used to indicate a real number, 
the predicate 


If x > 2 then xX? > 4 
is interpreted to mean the same as the statement 
For every real number x, if x > 2 then > 4. 


Mathematicians often use a double arrow to indicate implicit quantification symbolically. 
For instance, they might express the above statement as 


x>2 => ¥>4. 
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Let P(x) and Q(x) be predicates and suppose the common domain of x is D. 


e The notation P(x) = Q(x) means that every element in the truth set of P(x) is in the 
truth set of Q(x), or, equivalently, Vx, P@&) — Qw). 


e The notation P(x) <= Q(x) means that P(x) and Q(x) have identical truth sets, or, 
equivalently, Vx, P(x) <> QQ). 


seu enave Using = and 


Let 
O(n) be “n is a factor of 8,” 
R(n) be “n is a factor of 4,” 
S(n) be “n < 5 and n # 3,” 
and suppose the domain of n is Z*, the set of positive integers. Use the = and + symbols 
to indicate true relationships among Q(n), R(n), and S(n). 
Solution 
1. As noted in Example 3.1.2, the truth set of Q(n) is {1, 2, 4, 8} when the domain of n 
is Z*. By similar reasoning the truth set of R(n) is {1, 2, 4}. Thus it is true that every 
element in the truth set of R(n) is in the truth set of Q(n), or, equivalently, Vn in ZE, 
R(n) > Q(n). So R(n) = Q(n), or, equivalently 
nisa factor of4 = nisa factor of 8. 
2. The truth set of S(n) is {1, 2, 4}, which is identical to the truth set of R(n), or, equiva- 
lently, Vn in Zt, R(n) + Sn). So R(n) = S(n), or, equivalently, 
nisafactorof4 < n< 5andn #3. 
Moreover, since every element in the truth set of S(n) is in the truth set of Q(n), or, 
equivalently, Vn in Z*, S(n) > Q(n), then S(n) = Q(n), or, equivalently, 
n<Sandn#3 =  nisa factor of 8. | 


Some questions of quantification can be quite subtle. For instance, a mathematics text 
might contain the following: 


a. (xt1 =x? +2x+1. b. Solve 3x- 4 = 5. 


Although neither (a) nor (b) contains explicit quantification, the reader is supposed to un- 
derstand that the x in (a) is universally quantified, whereas the x in (b) is existentially 
quantified. When the quantification is made explicit, (a) and (b) become 


a. V real number x, (x + iby =x? +2x+1. 
b. Show (by finding a value) that 3 a real number x such that 3x — 4 = 5. 


The quantification of a statement—whether universal or existential—crucially deter- 
mines both how the statement can be applied and what method must be used to establish its 
truth. Thus it is important to be alert to the presence of hidden quantifiers when you read 
mathematics so that you will interpret statements in a logically correct way. 


Tarski’s World 


Tarski’s World is a computer program developed by information scientists Jon Barwise 
and John Etchemendy to help teach the principles of logic. It is described in their book 
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Alfred Tarski 
(1902-1983) 


TEST YOURSELF 
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The Language of First-Order Logic, which is accompanied by a CD containing the pro- 
gram Tarski’s World, named after the great logician Alfred Tarski. 


ZEWJGEARA Investigating Tarski’s World 


The program for Tarski’s World provides pictures of blocks of various sizes, shapes, and 
colors, which are located on a grid. Shown in Figure 3.1.1 is a picture of an arrangement of 
objects in a two-dimensional Tarski world. The configuration can be described using logical 
operators and—for the two-dimensional version—notation such as Triangle(x), meaning “x 
is a triangle,” Blue(y), meaning “y is blue,” and RightOf(x, y), meaning “x is to the right of 
y (but possibly in a different row).” Individual objects can be given names such as a, b, or c. 


Briscoe Center for American History 


FIGURE 3.1.1 


Determine the truth or falsity of each of the following statements. The domain for all vari- 
ables is the set of objects in the Tarski world shown in Figure 3.1.1. 


a. Vt, Triangle — Blue(?). 

b. Vx, Blue(x) > Triangle(). 

c. dy such that Square(y) ^A RightOf(d, y). 

d. 3z such that Square(z) ^A Gray(z). 

Solution 

a. This statement is true: Every triangle is blue. 

b. This statement is false. As a counterexample, note that e is blue and it is not a triangle. 
c. This statement is true because e and h are both square and d is to their right. 


d. This statement is false: All the squares are either blue or black. E 


Answers to Test Yourself questions are located at the end of each section. 


1. If P(x) is a predicate with domain D, the truth set 4. A statement of the form Vx € D, Q(x) is true if, 
of P(x) is denoted ___. We read these symbols and only if, Q(x) is for 
oul Loukas 5. A statement of the form 4x E D such that Q(x) is 
2. Some ways to express the symbol V in words are true if, and only if, Q(x) is for 
3. Some ways to express the symbol Jin words are 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


EXERCISE SET 3.1* 


3.1 PREDICATES AND QUANTIFIED STATEMENTS | 119 


1. A menagerie consists of seven brown dogs, two 5. Find the truth set of each predicate. 
black dogs, six gray cats, ten black cats, five blue a. Predicate: 6/d is an integer, domain: Z 
birds, six yellow birds, and one black bird. Deter- b. Predicate: 6/d is an integer, domain: Z* 
mine which of the following statements are true c. Predicate: 1 = x? < 4, domain: R 
and which are false. d. Predicate: 1 < x? < 4, domain: Z 
Ay There sai atiimnal m tie menagerie hat 1s tgu, 6. Let B(x) be “—10 < x < 10,” Find the truth set of 
b. Every animal in the menagerie is a bird or a À : 
aakannall B(x) for each of ane following domains. 
; i ao a. Z b. Z c. The set of all even integers 
c. Every animal in the menagerie is brown or 
gray or black. 7. Let S be the set of all strings of length 3 consist- 
d. There is an animal in the menagerie that is ing of a’s, b’s, and c’s. List all the strings in S that 
neither a cat nor a dog. satisfy the following conditions: 
e. No animal in the menagerie is blue. 1. Every string in S begins with b. 
f. There are in the menagerie a dog, a cat, and a 2. No string in S has more than one c. 
a Me pice ole 8. Let T be the set of all strings of length 3 consisting 
2. Indicate which of the following statements are true of 0’s and 1’s. List all the strings in T that satisfy 
and which are false. Justify your answers as best the following conditions: 
as you can. 1. For every string s in T, the second character 
a. Every integer is a real number. of s is 1 or the first two characters of s are 
b. 0 is a positive real number. the same. 
c. For every real number r, —r is a negative real 2. No string in T has all three characters the 
number. same. 
d. Every real number is an integer. ' : 
Find counterexamples to show that the statements in 
3. Let R(m, n) be the predicate “If m is a factor of n? 9-12 are false. 
then m is a factor of n, with domain for both m 9. Vx ER x= Ux. 
and n being Z the set of integers. 
a. Explain why R(m, n) is false if m = 25 and 10. Va E Z, (a — 1)/a is not an integer. 
n= 10. pice ch 
b. Give values different from those in part (a) for hy V Poste tegur tegna n yet Tae 
which R(m, n) is false. 12. V real numbers x and y, Vx +y = Vx+ Vy. 
i ee Wily RU PTS nee =a and 13. Consider the following statement: 
d. Give values different from those in part (c) for V basketball player x, x is tall. 
Winch Gime) te TEU, Which of the following are equivalent ways of 
4. Let Q(x, y) be the predicate “If x < y then L< ya expressing this statement? 


with domain for both x and y being R the set of 
real numbers. 


a. Every basketball player is tall. 
b. Among all the basketball players, some 


a. Explain why Q(x, y) is false if x = —2 and are tall. 
y=1. c. Some of all the tall people are basketball 
b. Give values different from those in part (a) for players. 


which Q(x, y) is false. 
c. Explain why Q(x, y) is true if x = 3 and y = 8. 
d. Give values different from those in part (c) for 
which Q(x, y) is true. 


d. Anyone who is tall is a basketball player. 

All people who are basketball players are tall. 

f. Anyone who is a basketball player is a tall 
person. 


p 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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14. 


H 15. 


16. 


17. 


18. 


CHAPTER3 THE LOGIC OF QUANTIFIED STATEMENTS 


Consider the following statement: 
Jx E R such that x° = 2. 


Which of the following are equivalent ways of 

expressing this statement? 

a. The square of each real number is 2. 

b. Some real numbers have square 2. 

c. The number x has square 2, for some real 
number x. 

d. If xis areal number, then x =2. 

e. Some real number has square 2. 

f. There is at least one real number whose square 
is 2. 


Rewrite the following statements informally in at 
least two different ways without using variables or 
quantifiers. 

a. V rectangle x, x is a quadrilateral. 

b. Ja set A such that A has 16 subsets. 


Rewrite each of the following statements in the 

form “VY X; = 

a. All dinosaurs are extinct. 

b. Every real number is positive, negative, or 
Zero. 

c. No irrational numbers are integers. 

d. No logicians are lazy. 

e. The number 2,147,581,953 is not equal to the 
square of any integer. 

f. The number —1 is not equal to the square of 
any real number. 


Rewrite each of the following in the form “4 —___ 
x such that $ 

a. Some exercises have answers. 

b. Some real numbers are rational. 


Let D be the set of all students at your school, and 

let M(s) be “s is a math major,” let C(s) be “s is a 

computer science student,” and let E(s) be “s is an 

engineering student.” Express each of the follow- 

ing statements using quantifiers, variables, and the 

predicates M(s), C(s), and E(s). 

a. There is an engineering student who is a math 
major. 

b. Every computer science student is an engineer- 
ing student. 

c. No computer science students are engineering 
students. 

d. Some computer science students are also math 
majors. 

e. Some computer science students are engineer- 
ing students and some are not. 
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H 20. 


21. 


22. 


23. 


24. 


25. 


Consider the following statement: 
V integer n, if n? is even then n is even. 


Which of the following are equivalent ways of 

expressing this statement? 

a. All integers have even squares and are even. 

b. Given any integer whose square is even, that 
integer is itself even. 

c. For all integers, there are some whose square 
is even. 

d. Any integer with an even square is even. 

e. Ifthe square of an integer is even, then that 
integer is even. 

f. All even integers have even squares. 


Rewrite the following statement informally in at 
least two different ways without using variables or 
the symbol Y or the words “for all.” 


V real numbers x, if x is positive 
then the square root of x is positive. 


Rewrite the following statements so that the quan- 

tifier trails the rest of the sentence. 

a. For any graph G, the total degree of G is even. 

b. For any isosceles triangle T, the base angles of 
T are equal. 

c. There exists a prime number p such that p is 
even. 

d. There exists a continuous function f such that f 
is not differentiable. 


Rewrite each of the following statements in the 

form “V x, if then = 

a. All Java programs have at least 5 lines. 

b. Any valid argument with true premises has a 
true conclusion. 


Rewrite each of the following statements in 

the two forms “Vx, if then ” and 

“Vy, ” (without an if-then). 

a. All equilateral triangles are isosceles. 

b. Every computer science student needs to take 
data structures. 


Rewrite the following statements in the two forms 
“A x such that ” and “3x such that 
and i 

a. Some hatters are mad. 

b. Some questions are easy. 


The statement “The square of any rational number 
is rational” can be rewritten formally as “For all 
rational numbers x, x? is rational” or as “For all Xx, 


if x is rational then x’ is rational.” Rewrite each 
of the following statements in the two forms 


“y xX, ” and “Vx, if , then —— ” 

or in the two forms “V x and y, ” and 

“Vx and y, if , then 2 

a. The reciprocal of any nonzero fraction is a 
fraction. 


b. The derivative of any polynomial function is a 
polynomial function. 

c. The sum of the angles of any triangle is 180°. 

d. The negative of any irrational number is 
irrational. 

e. The sum of any two even integers is even. 

f. The product of any two fractions is a fraction. 


26. Consider the statement “All integers are ratio- 
nal numbers but some rational numbers are not 


integers.” 
a. Write this statement in the form “Vx, if 
then , but d x such that ___..” 


b. Let Ratl(x) be “x is a rational number” and 
Int(x) be “x is an integer.” Write the given 
statement formally using only the symbols 
Ratl(@), Int), Y, 3, A, V, ~, and >. 


27. Refer to the picture of Tarski’s world given in Ex- 
ample 3.1.13. Let Above(x, y) mean that x is above 
y (but possibly in a different column). Determine 
the truth or falsity of each of the following state- 
ments. Give reasons for your answers. 
a. Vu, Circle(u) — Gray(u). 
b. Vu, Gray(u) — Circle(u). 
c. Jy such that Square(y) ^A Above(y, d). 

d. 3z such that Triangle(z) ^ Above(f, z). 

In 28-30, rewrite each statement without using quanti- 


fiers or variables. Indicate which are true and which are 
false, and justify your answers as best as you can. 


28. Let the domain of x be the set D of objects dis- 
cussed in mathematics courses, and let Real(x) be 
“x is a real number,” Pos(x) be “x is a positive real 
number,” Neg(x) be “x is a negative real number,” 
and Int(x) be “x is an integer.” 
a. Pos(0) 


b. Vx, Real(x) ^A Neg(x) — Pos(—x) 


ANSWERS FOR TEST YOURSELF 
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c. Vx, Int(x) — Real(x) 
d. 4x such that Real) ^A ~Int(x) 


29. Let the domain of x be the set of geometric figures 
in the plane, and let Square(x) be “x is a square” 
and Rect(x) be “x is a rectangle.” 


a. 4x such that Rect(x) ^ Square(x) 


b. 4x such that Rect(x) \ ~Square(x) 
c. Vx, Square(x) — Rect(x) 


30. Let the domain of x be Z, the set of integers, and 
let Odd(x) be “x is odd,” Prime(x) be “x is prime,” 
and Square(x) be “x is a perfect square.” (An inte- 
ger n is said to be a perfect square if, and only if, 
it equals the square of some integer. For example, 
25 is a perfect square because 25 = 57.) 


a. 4x such that Prime(x) A ~ Odd(x) 
b. Vx, Prime(x) — ~Square(x) 
c. dx such that Odd) ^A Square(x) 


H 31. In any mathematics or computer science text other 
than this book, find an example of a statement that 
is universal but is implicitly quantified. Copy the 
statement as it appears and rewrite it making the 
quantification explicit. Give a complete citation 
for your example, including title, author, publisher, 


year, and page number. 


32. Let R be the domain of the predicate variable x. 
Which of the following are true and which are 
false? Give counter examples for the statements 
that are false. 

a x>2>x>1 

b. x>2>x >4 
co 2 >42 
dvr >45 |x| =2 


33. Let R be the domain of the predicate variables a, 
b, c, and d. Which of the following are true and 
which are false? Give counterexamples for the 
statements that are false. 

a. a>Oandb>0=>ab>0 
b. a< 0andb<0=ab<0 
c. ab=0>a=0orb=0 

d. a < bandc < d= ac < bd 


1. {x E D|P(x)}; the set of all x in D such that P(x) 
2. Possible answers: for every, for any, for each, for 
arbitrary, given any, forall 3. Possible answers: there 


exists, there exist, there exists at least one, for some, for 


at least one, we can finda 4. true; every x in D (Some 
alternative answers: all x in D; each individual x in D) 
5. true; at least one x in D (Alternative answer: some 


xin D) 
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EFJ Predicates and Quantified Statements II 


TOUCHSTONE: Stand you both forth now: stroke your chins, and swear by your 
beards that I am a knave. 

CELIA: By our beards—if we had them—thou art. 

TOUCHSTONE: By my knavery—if I had it—then I were; but if you swear by that 
that is not, you are not forsworn. —William Shakespeare, As You Like It 


This section continues the discussion of predicates and quantified statements begun in 
Section 3.1. It contains the rules for negating quantified statements; an exploration of the 
relation among V, 3, ^, and V; an introduction to the concept of vacuous truth of universal 
statements; examples of variants of universal conditional statements; and an extension of 
the meaning of necessary, sufficient, and only if to quantified statements. 


Negations of Quantified Statements 


Consider the statement “All mathematicians wear glasses.” Many people would say 
that its negation is “No mathematicians wear glasses,” but if even one mathemati- 
cian does not wear glasses, then the sweeping statement that a// mathematicians wear 
glasses is false. So a correct negation is “There is at least one mathematician who does 
not wear glasses.” 

The general form of the negation of a universal statement follows immediately from 
the definitions of negation and of the truth values for universal and existential statements. 


Theorem 3.2.1 Negation of a Universal Statement 


The negation of a statement of the form 
Vx in D, Ox) 
is logically equivalent to a statement of the form 
ax in D such that ~ Q(x). 
Symbolically, 


~(vx € DOO = ax € D such that ~ QQ). 


Thus 


The negation of a universal statement (“all are”) is logically equivalent to an 
existential statement (“‘some are not” or “‘there is at least one that is not”). 


Note that when we speak of logical equivalence for quantified statements, we mean 
that the statements always have identical truth values no matter what predicates are sub- 
stituted for the predicate symbols and no matter what sets are used for the domains of 
the predicate variables. 

Now consider the statement “Some snowflakes are the same.” What is its negation? For 
this statement to be false means that not a single snowflake is the same as any other. In 
other words, “No snowflakes are the same,” or “All snowflakes are different.” 
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The general form for the negation of an existential statement follows immediately from 
the definitions of negation and of the truth values for existential and universal statements. 


Theorem 3.2.2 Negation of an Existential Statement 


The negation of a statement of the form 
Jx in D such that Q(x) 
is logically equivalent to a statement of the form 
Vx in D, ~O(). 
Symbolically, 


~(dx € D such that O(@)) = Vx € D, ~ QQ). 


Thus 


The negation of an existential statement (“‘some are”) is logically 
equivalent to a universal statement (“none are” or “all are not”). 


CAESA Negating Quantified Statements 


Write formal negations for the following statements: 

a. V primes p, p is odd. 

b. Ja triangle T such that the sum of the angles of T equals 200°. 

Solution 

a. By applying the rule for the negation of a V statement, you can see that the answer is 
Ja prime p such that p is not odd. 


b. By applying the rule for the negation of a 3 statement, you can see that the answer is 


V triangles T, the sum of the angles of T does not equal 200°. a 


You need to exercise special care to avoid mistakes when writing negations of state- 
ments that are given informally. One way to avoid error is to rewrite the statement formally 
and take the negation using the formal rule. 


CAFE More Negations 


Rewrite the following statements formally. Then write formal and informal negations. 
a. No politicians are honest. 


b. The number 1,357 is not divisible by any integer between 1 and 37. 


Solution 

a. Formal version: Y politicians x, x is not honest. 
Formal negation: Ja politician x such that x is honest. 
Informal negation: Some politicians are honest. 


b. This statement has a trailing quantifier. Written formally it becomes: 


V integer n between 1 and 37, 1,357 is not divisible by n. 
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Note Which is true: Its negation is therefore 
the statement in part (b) 
or its negation? Is 1,357 
divisible by some integer An informal version of the negation is 
between 1 and 37? Or is 

1,357 not divisible by any 
integer between 1 and 37? 


Jan integer n between 1 and 37 such that 1,357 is divisible by n. 


The number 1,357 is divisible by some integer between 1 and 37. a 


Another important way to avoid error when taking negations of statements, whether 
stated formally or informally, is to ask yourself, “What exactly would it mean for the given 
statement to be false? What statement, if true, would be equivalent to saying that the given 
statement is false?” 


CAMEE Still More Negations 


Write informal negations for the following statements: 

a. All computer programs are finite. 

b. Some computer hackers are over 40. 

Solution 

a. What exactly would it mean for this statement to be false? The statement asserts that 
all computer programs satisfy a certain property. So for it to be false, there would 


have to be at least one computer program that does not satisfy the property. Thus the 
answer is 


There is a computer program that is not finite. 
Or: Some computer programs are infinite. 


b. This statement is equivalent to saying that there is at least one computer hacker with 
a certain property. So for it to be false, not a single computer hacker can have that 
property. Thus the negation is 


No computer hackers are over 40. 


Or: All computer hackers are 40 or under. E 

A Informal negations of many universal statements can be constructed simply by in- 
serting the word not or the words do not at an appropriate place. However, the resulting 

Caution! Just inserting statements may be ambiguous. For example, a possible negation of “All mathematicians 


the word not to negate a 
quantified statement can 
result in a statement that 
is ambiguous. 


wear glasses” is “All mathematicians do not wear glasses.” The problem is that this 
sentence has two meanings. With the proper verbal stress on the word not, it could be in- 
terpreted as the logical negation. (What! You say that all mathematicians wear glasses? 
Nonsense! All mathematicians do not wear glasses.) On the other hand, stated in a flat 
tone of voice (try it!), it would mean that all mathematicians are nonwearers of glasses; 
that is, not a single mathematician wears glasses. This is a much stronger statement than 
the logical negation: It implies the negation but is not equivalent to it. 


Negations of Universal Conditional Statements 


Negations of universal conditional statements are of special importance in mathe- 
matics. The form of such negations can be derived from facts that have already been 
established. 

By definition of the negation of a for all statement, 


~ (Vx, P(x) > Qx) = dx such that ~(P(x) = Qœ). 3.2.1 
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But the negation of an if-then statement is logically equivalent to an and statement. More 
precisely, 
~(P@ —> QQ) = Pœ A ~ QQ). Bene 
Substituting (3.2.2) into (3.2.1) gives 
~ (Vx, P(x) > Q) = 3x such that (P(x) A ~Q). 


Written somewhat less symbolically, this becomes 


Negation of a Universal Conditional Statement 


~ (Vx, if P(x) then Q(x)) = 3x such that P(x) and ~ Q(x). 


Negating Universal Conditional Statements 


Write a formal negation for statement (a) and an informal negation for statement (b). 
a. V person p, if p is blond then p has blue eyes. 


b. If a computer program has more than 100,000 lines, then it contains a bug. 


Solution 
a. 4a person p such that p is blond and p does not have blue eyes. 


b. There is at least one computer program that has more than 100,000 lines and does not 
contain a bug. E 


The Relation among V, A, ^, and V 


The negation of a for all statement is a there exists statement, and the negation of a there 
exists statement is a for all statement. These facts are analogous to De Morgan’s laws, 
which state that the negation of an and statement is an or statement and that the negation of 
an or statement is an and statement. This similarity is not accidental. In a sense, universal 
statements are generalizations of and statements, and existential ments are generalizations 
of or statements. 

If Q(x) is a predicate and the domain D of x is the set {x1, X2, ..., Xn}, then the statements 


vx E D, Q(x) and Q(x) \ O02) A+++ A Qn) 
are logically equivalent. For example, let Q(x) be “x-x = x” and suppose D = {0, 1}. Then 
Vx E D, Q(x) 

can be rewritten as 

V binary digits x, x-x = x. 
This is equivalent to 

0-0=0 and 1:1=1, 
which can be rewritten in symbols as 


Q(0) ^ Qad). 
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Note In ordinary lan- 
guage the words in gen- 
eral mean that something 
is usually, but not always 
the case. (In general, I 
take the bus, but today I 
walked.) In mathemat- 
ics the words in general 
mean that something is 
always true. 


Similarly, if Q(x) is a predicate and D = {x,, %,...,x,}, then the statements 


dx E D such that Q(x) and Q(x) v Om) v = v O(x,) 


are logically equivalent. For example, let Q(x) be “x +x = x” and suppose D = {0, 1}. 
Then 


dx E D such that Q(x) 


can be rewritten as 


da binary digit x such that x +x = x. 
This is equivalent to 
0+0=0 or 14+1=1, 


which can be rewritten in symbols as 


Q(0) v QC). 


Vacuous Truth of Universal Statements 


Suppose a bowl sits on a table and next to the bowl is a pile of five blue and five gray balls, 
any of which may be placed in the bowl. If three blue balls and one gray ball are placed 
in the bowl, as shown in Figure 3.2.1(a), the statement “All the balls in the bowl are blue” 
would be false (since one of the balls in the bowl is gray). 

Now suppose that no balls at all are placed in the bowl, as shown in Figure 3.2.1(b). 
Consider the statement 


All the balls in the bowl are blue. 


Is this statement true or false? The statement is false if, and only if, its negation is true. 
And its negation is 


There exists a ball in the bowl that is not blue. 


But the only way this negation can be true is for there actually to be a nonblue ball in the 
bowl. And there is not! Hence the negation is false, and so the statement is true “by default.” 


BF & SS. 


(a) (b) 
FIGURE 3.2.1 


In general, a statement of the form 
vx in D, if P(x) then Q(x) 


is called vacuously true or true by default if, and only if, P(x) is false for every x in D. 
In mathematics, the words in general signal that what is to follow is a generalization of 
some aspect of the example that always holds true. 


Variants of Universal Conditional Statements 


Recall from Section 2.2 that a conditional statement has a contrapositive, a converse, and an 
inverse. The definitions of these terms can be extended to universal conditional statements. 
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Definition 


Consider a statement of the form Vx E D, if P(x) then Q(x). 


1. Its contrapositive is the statement Yx E D, if ~ Q(x) then ~P(x). 
2. Its converse is the statement Vx E D, if Q(x) then P(x). 
3. Its inverse is the statement Vx E D, if ~P(x) then ~ Q(x). 


CEMS Contrapositive, Converse, and Inverse of a Universal Conditional Statement 


Write a formal and an informal contrapositive, converse, and inverse for the following 
statement: 


If a real number is greater than 2, then its square is greater than 4. 


Solution The formal version of this statement is Vx € R, if x > 2 then >. 
Contrapositive: Yx E R, if xX < 4thenx = 2. 
Or: If the square of a real number is less than or equal to 4, 
then the number is less than or equal to 2. 


Converse: Yx E R, if X? > 4 then x > 2. 
Or: If the square of a real number is greater than 4, then the 
number is greater than 2. 


Inverse: Vx E R, if x S 2 then <4. 
Or: If a real number is less than or equal to 2, then the square 
of the number is less than or equal to 4. 


Note that in solving this example, we have used the equivalence of “x + a” and “x = a” 
for all real numbers x and a. (See page 47.) a 


In Section 2.2 we showed that a conditional statement is logically equivalent to its con- 
trapositive and that it is not logically equivalent to either its converse or its inverse. The 
following discussion shows that these facts generalize to the case of universal conditional 
statements and their contrapositives, converses, and inverses. 

Let P(x) and Q(x) be any predicates, let D be the domain of x, and consider the statement 


vx E D, if P(x) then Qx) 
and its contrapositive 


Vx E D, if ~Q(x) then ~P(x). 


Any particular x in D that makes “if P(x) then Q(x)” true also makes “if ~ Q(x) then ~ P(x)” 
true (by the logical equivalence between p — q and ~q — ~p). It follows that the sen- 
tence “If P(x) then Q(@)” is true for all x in D if, and only if, the sentence “If ~Q(x) then 
~ P(x)” is true for all x in D. 

Thus we write the following and say that a universal conditional statement is logically 
equivalent to its contrapositive: 


Yx E D, if P(x) then O(x) = Vx E D, if ~ Q(x) then ~P(x) 


In Example 3.2.5 we noted that the statement 
vx ER, if x >2thenx >4 


has the converse Vx E R, if xX > Athen x > 2. 
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Observe that the statement is true whereas its converse is false (since, for instance, 
(-3y° = 9 > 4 but —3 + 2). This shows that a universal conditional statement may have a 
different truth value from its converse. Hence a universal conditional statement is not logi- 
cally equivalent to its converse. This is written in symbols as follows: 


Yx E D, if P(x) then O(x) # Vx E D, if QŒ) then P(x). 


In exercise 35 at the end of this section, you are asked to provide an example to show that 
a universal conditional statement is not logically equivalent to its inverse. 


Vx E D, if P(x) then Q(x) £ Vx E D, if ~P(x) then ~Q(x). 


Necessary and Sufficient Conditions, Only If 


The definitions of necessary, sufficient, and only if can also be extended to apply to uni- 
versal conditional statements. 


Definition 


e “Vx, r(x) is a sufficient condition for s(x)” means “Vx, if r(x) then s(x).” 

e “Vx, r(x) is a necessary condition for s(x)’ means “Vx, if ~r(x) then ~s(x)” or, 
equivalently, “Vx, if s(x) then r.” 

e “Vx, r(x) only if s(x)” means “Vx, if ~s(x) then ~r” or, equivalently, “Vx, if r(x) 
then s(x).” 


Necessary and Sufficient Conditions 


Rewrite each of the following as a universal conditional statement, quantified either explic- 
itly or implicitly. Do not use the word necessary or sufficient. 


a. Squareness is a sufficient condition for rectangularity. 


b. Being at least 35 years old is a necessary condition for being president of the United 
States. 


Solution 
a. A formal version of the statement is 


Vx, if x is a square, then x is a rectangle. 
Or, with implicit universal quantification: 


If a figure is a square, then it is a rectangle. 


b. Using formal language, you could write the answer as 


V person x, if x is younger than 35, then x 
cannot be president of the United States. 


Or, by the equivalence between a statement and its contrapositive: 


V person x, if x is president of the United States, 
then x is at least 35 years old. E 
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1. 


EXERCISE SET 3.2 


only if 
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Rewrite the following as a universal conditional statement: 


A product of two numbers is 0 only if one of the numbers is 0. 


Solution Using informal language, you could write the answer as 


If it is not the case that one of two numbers is 0, 
then the product of the numbers is not 0. 


In other words, 


If neither of two numbers is 0, then the product of the numbers is not 0. 


Or, by the equivalence between a statement and its contrapositive: 


If a product of two numbers is 0, then one of the numbers is 0. E 


A negation for “All R have property S” is “There 
is R that T 


. A negation for “Some R have property S” is 


a3 ” 


. A negation for “For every x, if x has property P 


” 


then x has property Q” is “ 


. The converse of “For every x, if x has property P 


” 


then x has property Q” is “ 


. The contrapositive of “For every x, if x has prop- 


2”? 


erty P then x has property Q” is “ 


. The inverse of “For every x, if x has property P 


” 


then x has property Q” is “ 


1. Which of the following is a negation for “All 


2. 


discrete mathematics students are athletic”? More 

than one answer may be correct. 

a. There is a discrete mathematics student who is 
nonathletic. 

b. All discrete mathematics students are 
nonathletic. 

c. There is an athletic person who is not a dis- 
crete mathematics student. 

d. No discrete mathematics students are athletic. 

e. Some discrete mathematics students are 
nonathletic. 

f. No athletic people are discrete mathematics 
students. 


Which of the following is a negation for “All dogs 
are loyal’? More than one answer may be correct. 
a. All dogs are disloyal. 

No dogs are loyal. 

Some dogs are disloyal. 

Some dogs are loyal. 

There is a disloyal animal that is not a dog. 


panos 


f. There is a dog that is disloyal. 
g- No animals that are not dogs are loyal. 
h. Some animals that are not dogs are loyal. 


. Write a formal negation for each of the following 


statements. 

a. V string s, s has at least one character. 

b. Y computer c, c has a CPU. 

c. Ja movie m such that m is over 6 hours long. 

d. 3a band b such that b has won at least 10 
Grammy awards. 


. Write an informal negation for each of the follow- 


ing statements. Be careful to avoid negations that 
are ambiguous. 

a. All dogs are friendly. 

b. All graphs are connected. 

c. Some suspicions were substantiated. 

d. Some estimates are accurate. 


5. Write a negation for each of the following statements. 


a. Every valid argument has a true conclusion. 
b. All real numbers are positive, negative, or zero. 
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Write a negation for each statement in 6 and 7. 


6. a. Sets A and B do not have any points in common. 
b. Towns P and Q are not connected by any road 
on the map. 


7. a. This vertex is not connected to any other vertex 
in the graph. 
b. This number is not related to any even number. 


8. Consider the statement “There are no simple solu- 
tions to life’s problems.” Write an informal nega- 
tion for the statement, and then write the statement 
formally using quantifiers and variables. 


Write a negation for each statement in 9 and 10. 


9. VY real number x, if x > 3 then > 9. 


10. V computer program P, if P compiles without error 
messages, then P is correct. 


In each of 11-14 determine whether the proposed nega- 
tion is correct. If it is not, write a correct negation. 


11. Statement: The sum of any two irrational 
numbers is irrational. 
Proposed negation: The sum of any two irrational 
numbers is rational. 


12. Statement: The product of any irrational 
number and any rational num- 
ber is irrational. 

Proposed negation: The product of any irrational 
number and any rational num- 
ber is rational. 


13. Statement: For every integer n, if n is 
even then n is even. 
Proposed negation: For every integer n, if n’ is 
even then n is not even. 


14. Statement: For all real numbers x, and x), 
if x = x5 then x; = x2. 
Proposed negation: For all real numbers x, and x3, 
if x = x5 then xı # x2. 
15. Let D = {—48,—14,—8, 0, 1,3, 16, 23, 26, 32, 36}. 
Determine which of the following statements are 


true and which are false. Provide counterexamples 
for the statements that are false. 


a. Vx E D, if x is odd then x > 0. 
b. Vx E D, if x is less than 0 then x is even. 
c. Vx € D, if xis even then x =£ 0. 


d. Vx E D, if the ones digit of x is 2, then the tens 
digit is 3 or 4. 

e. Vx E D, if the ones digit of x is 6, then the tens 
digit is 1 or 2. 
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In 16-23, write a negation for each statement. 

16. Y real number x, if x? = 1 then x > 0. 

17. V integer d, if 6/d is an integer then d = 3. 

18. Vx E R, if x(x + 1) > 0 then x >Oorx<—l. 


19. Vn E Z, if n is prime then n is odd or n = 2. 


20. V integers a, b, and c, if a — b is even and b — c is 
even, then a — c is even. 


21. V integer n, if n is divisible by 6, then n is divisible 
by 2 and n is divisible by 3. 


22. Ifthe square of an integer is odd, then the integer 
is odd. 


23. If a function is differentiable then it is continuous. 


24. Rewrite the statements in each pair in if-then form 
and indicate the logical relationship between them. 
a. All the children in Tom’s family are female. 
All the females in Tom’s family are children. 
b. All the integers that are greater than 5 and end 
in 1, 3, 7, or 9 are prime. 
All the integers that are greater than 5 and are 
prime end in 1, 3, 7, or 9. 


25. Each of the following statements is true. In each 
case write the converse of the statement, and give a 
counterexample showing that the converse is false. 
a. Ifnis any prime number that is greater than 2, 
then n + 1 is even. 

b. If m is any odd integer, then 2m is even. 

c. If two circles intersect in exactly two points, 
then they do not have a common center. 

In 26-33, for each statement in the referenced exercise 

write the contrapositive, converse, and inverse. Indicate 

as best as you can which of these statements are true and 

which are false. Give a counterexample for each that is false. 


26. Exercise 16 27. Exercise 17 


28. Exercise 18 29. Exercise 19 
30. Exercise 20 31. Exercise 21 
32. Exercise 22 33. Exercise 23 


34. Write the contrapositive for each of the following 
statements. 

a. Ifnis prime, then n is not divisible by any 
prime number from 2 through Vn. (Assume 
that n is a fixed integer.) 

b. If A and B do not have any elements in com- 
mon, then they are disjoint. (Assume that 
A and B are fixed sets.) 


35. 


* 36. 


37. 


38. 


Give an example to show that a universal condition- 
al statement is not logically equivalent to its inverse. 


If P(x) is a predicate and the domain of x is the set 

of all real numbers, let R be “Vx E Z, Pœ), let S 

be “Vx E Q, P(x), and let T be “Vx E R, P(@).” 

a. Find a definition for P(x) (but do not use “x € Z”) 
so that R is true and both S and T are false. 


b. Find a definition for P(x) (but do not use “x E Q”) 
so that both R and S are true and T is false. 


Consider the following sequence of digits: 0204. 
A person claims that all the 1’s in the sequence 
are to the left of all the 0’s in the sequence. Is this 
true? Justify your answer. (Hint: Write the claim 
formally and write a formal negation for it. Is the 
negation true or false?) 


True or false? All occurrences of the letter u in Dis- 
crete Mathematics are lowercase. Justify your answer. 


Rewrite each statement of 39-44 in if-then form. 


39. 


40. 


41. 


42. 


43. 


ANSWERS FOR TEST YOURSELF 


Earning a grade of C— in this course is a suffi- 
cient condition for it to count toward graduation. 


Being divisible by 8 is a sufficient condition for 
being divisible by 4. 


Being on time each day is a necessary condition 
for keeping this job. 


Passing a comprehensive exam is a necessary 
condition for obtaining a master’s degree. 


A number is prime only if it is greater than 1. 
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44. A polygon is square only if it has four sides. 


Use the facts that the negation of a V statement is 

a4 statement and that the negation of an if-then 
statement is an and statement to rewrite each of the 
statements 45-48 without using the word necessary or 
sufficient. 


45. Being divisible by 8 is not a necessary condition 
for being divisible by 4. 


46. Having a large income is not a necessary condi- 
tion for a person to be happy. 


47. Having a large income is not a sufficient condition 
for a person to be happy. 


48. Being a polynomial is not a sufficient condition 
for a function to have a real root. 


49. The computer scientists Richard Conway and 
David Gries once wrote: 


The absence of error messages during 
translation of a computer program is only a 
necessary and not a sufficient condition for 
reasonable [program] correctness. 


Rewrite this statement without using the words 
necessary or sufficient. 


50. A frequent-flyer club brochure states, “You may 
select among carriers only if they offer the same 
lowest fare.” Assuming that “only if” has its for- 
mal, logical meaning, does this statement guaran- 
tee that if two carriers offer the same lowest fare, 
the customer will be free to choose between them? 
Explain. 


1. some (Alternative answers: at least one; an); does not 


x has property P. 5. For every x, if x does not have 


have property S. 2. No R have property S. 3. There property Q then x does not have property P. 6. For every 
is an x such that x has property P and x does not have x, if x does not have property P then x does not have 
property Q. 4. For every x, if x has property Q then property Q. 


EX] Statements with Multiple Quantifiers 


It is not enough to have a good mind. The main thing is to use it well. —René Descartes 


Imagine you are visiting a factory that manufactures computer microchips. The factory 
guide tells you, 


“There is a person supervising every detail of the production process.” 
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Note that this statement contains informal versions of both the existential quantifier there 
is and the universal quantifier every. Which of the following best describes its meaning? 


e There is one single person who supervises all the details of the production process. 


e For any particular production detail, there is a person who supervises that detail, but 
there might be different supervisors for different details. 


As it happens, either interpretation could be what the guide meant. (Reread the sentence to 
be sure you agree!) Taken by itself, his statement is genuinely ambiguous, although other 
things he may have said (the context for his statement) might have clarified it. In our ordi- 
nary lives, we deal with this kind of ambiguity all the time. Usually context helps resolve 
it, but sometimes we simply misunderstand each other. 

In mathematics, formal logic, and computer science, by contrast, it is essential that we 
all interpret statements in exactly the same way. For instance, the initial stage of software 
development typically involves careful discussion between a programmer analyst and a 
client to turn vague descriptions of what the client wants into unambiguous program speci- 
fications that client and programmer can mutually agree on. 

Because many important technical statements contain both J and VY, a convention has de- 
veloped for interpreting them uniformly. When a statement contains more than one kind 
of quantifier, we imagine the actions suggested by the quantifiers as being performed 
in the order in which the quantifiers occur. For instance, consider a statement of the form 


Vx in set D, Jy in set E such that x and y satisfy property P(x, y). 
To show that such a statement is true, you must be able to meet the following challenge: 


Imagine that someone is allowed to choose any element whatsoever from the set D, and 


Note The scope of Vx : , ; 
r imagine that the person gives you that element. Call it x. 


extends throughout the 
statement, whereas the e The challenge for you is to find an element y in E so that the person’s x and your y, taken 


scope of Sy starts in the together, satisfy property P(x, y). 
middle. That is why the 


value of y depends onthe Because you do not have to specify the y until after the other person has specified the x, 
value of x. you are allowed to find a different value of y for each different x you are given. 


CAESA Truth of a V5 Statement in a Tarski World 


Consider the Tarski world shown in Figure 3.3.1. 


D0 |A 
m a E 


i 


Show that the following statement is true in this world: 


FIGURE 3.3.1 


For every triangle x, there is a square y such that x and y have the same color. 
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Note The value of x 
cannot be changed once 
it is specified because 
the scope of dx extends 
throughout the entire 
statement. 
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Solution The statement says that no matter which triangle someone gives you, you will 
be able to find a square of the same color. There are only three triangles, d, f, and i. The fol- 
lowing table shows that for each of these triangles a square of the same color can be found. 


Given x = choose y = and check that y is the same color as x. 
d e yes ¥ 
j h 
fori org yes V . 


Now consider a statement containing both V and J, where the 3 comes before the V: 


dx in set D such that Vy in set E, x and y satisfy property P(x, y). 


To show that a statement of this form is true: 
You must find one single element (call it x) in D with the following property: 


e After you have found your x, someone is allowed to choose any element whatsoever 
from E. The person challenges you by giving you that element. Call it y. 


e Your job is to show that your x together with the person’s y satisfy property P(x, y). 


Your x has to work for any y the person might give you; you are not allowed to change 
your x once you have specified it initially. 


Truth of a SV Statement in a Tarski World 


Consider again the Tarski world in Figure 3.3.1. Show that the following statement is true: 
There is a triangle x such that for every circle y, x is to the right of y. 


Solution The statement says that you can find a triangle that is to the right of all the 
circles. Actually, either d or i would work for all of the three circles, a, b, and c, as you can 
see in the following table. 


Choose x = Then: given y = check that x is to the right of y. 
dori a yes ¥ 
b yes ¥ 
c eso 
sh E 


Here is a summary of the convention for interpreting statements with two different 
quantifiers: 


Interpreting Statements with Two Different Quantifiers 
If you want to establish the truth of a statement of the form 
Vx in D, Sy in E such that P(x, y) 


your challenge is to allow someone else to pick whatever element x in D they wish 
and then you must find an element y in E that “works” for that particular x. 


If you want to establish the truth of a statement of the form 
dx in D such that Vy in E, P(x, y) 


your job is to find one particular x in D that will “work” no matter what y in E any- 
one might choose to challenge you with. 
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CAMEE Interpreting Statements with More Than One Quantifier 


A college cafeteria line has four stations: salads, main courses, desserts, and beverages. 
The salad station offers a choice of green salad or fruit salad; the main course station of- 
fers spaghetti or fish; the dessert station offers pie or cake; and the beverage station offers 
milk, soda, or coffee. Three students, Uta, Tim, and Yuen, go through the line and make 
the following choices: 


Uta: green salad, spaghetti, pie, milk 
Tim: fruit salad, fish, pie, cake, milk, coffee 
Yuen: spaghetti, fish, pie, soda 


These choices are illustrated in Figure 3.3.2. 


Salads 


green salad 
fruit salad 


Uta z 
Main courses 


spaghetti 
fish 
Desserts 
pie 

cake 


Tim 


Yuen 
Beverages 


milk 
soda 
coffee 


FIGURE 3.3.2 


Write each of following statements informally and find its truth value. 

a. Jan item J such that V student S, S chose I. 

b. 3a student S such that V item J, S chose J. 

c. Ja student S such that V station Z, 3 an item Jin Z such that S chose 7. 

d. V student S and V station Z, 3 an item Jin Z such that § chose I. 

Solution 

a. There is an item that was chosen by every student. This is true; every student chose 


pie. 


b. There is a student who chose every available item. This is false; no student chose all 
nine items. 


c. There is a student who chose at least one item from every station. This is true; both 
Uta and Tim chose at least one item from every station. 


d. Every student chose at least one item from every station. This is false, Yuen did not 
choose a salad. E 


Translating from Informal to Formal Language 


Most problems are stated in informal language, but solving them often requires translating 
them into more formal terms. 
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Translating Statements with Multiple Quantifiers from Informal 
to Formal Language 


The reciprocal of a real number a is a real number b such that ab = 1. The following two 
statements are true. Rewrite them formally using quantifiers and variables. 
a. Every nonzero real number has a reciprocal. 


Note The number 0 has b. There is a real number with no reciprocal. 


no reciprocal. , 
Solution 


a. V nonzero real number u, 3 a real number v such that uv = 1. 


b. 3a real number c such that V real number d, cd # 1. E 


CAMEE There Is a Smallest Positive Integer 


Recall that every integer is a real number and that real numbers are of three types: positive, 
negative, and zero (zero being neither positive nor negative). Consider the statement “There 
is a smallest positive integer.” Write this statement formally using both symbols 4 and VY. 


Solution To say that there is a smallest positive integer means that there is a positive 
integer m with the property that no matter what positive integer n a person might pick, m 
will be less than or equal to n: 


Ja positive integer m such that V positive integer n, m <n. 


Note that this statement is true because 1 is a positive integer that is less than or equal to 
every positive integer. 


positive integers 


-5 -4 -3 -2 -l 0 1 2 3 4 5 
e e e e e e e e e e E 


SEU EER There Is No Smallest Positive Real Number 


Imagine the positive real numbers on the real number line. These numbers correspond to 
all the points to the right of 0. Observe that no matter how small a real number x is, the 
number x/2 will be both positive and less than x.* 


-2 -1 0 x 1 2 
< ł H H H H > 


Thus the following statement is true: “There is no smallest positive real number.” Write 
this statement formally using both symbols V and J. 


Solution V positive real number x, 3 a positive real number y such that y < x. a 


The Definition of Limit of a Sequence 


The definition of limit of a sequence, studied in calculus, uses both quantifiers V and 3 and 
also if-then. We say that the limit of the sequence a, as n goes to infinity equals L and write 


lima, = L 


no 


if, and only if, the values of a, become arbitrarily close to L as n gets larger and larger 
without bound. More precisely, this means that given any positive number £, we can find 


*This can be deduced from the properties of the real numbers given in Appendix A. Because x is positive, 
0 < x. Add x to both sides to obtain x < 2x. Then 0 < x < 2x. Now multiply all parts of the inequality by the 
positive number 1/2. This does not change the direction of the inequality, so 0 < x/2 < x. 
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an integer N such that whenever n is larger than N, the number a, sits between L — e and 

L+ e on the number line. 

L-e L L+e 
ji 


© 3 


da, must lie in here when n > N 


Symbolically: 


Ve > 0, Jan integer N such that V integer n, 
ifn >N then L-e <a, <L+e. 


Considering the logical complexity of this definition, it is no wonder that many students 
find it hard to understand. E 


Ambiguous Language 


The drawing in Figure 3.3.3 is a famous example of visual ambiguity. When you look at it 
for a while, you will probably see either a silhouette of a young woman wearing a large hat 
or an elderly woman with a large nose. Whichever image first pops into your mind, try to 
see how the drawing can be interpreted in the other way. (Hint: The mouth of the elderly 
woman is the necklace on the young woman.) 


Chronicle/Alamy Stock Photo 


FIGURE 3.3.3 


Once most people see one of the images, it is difficult for them to perceive the other. 
So it is with ambiguous language. Once you interpreted the sentence at the beginning of 
this section in one way, it may have been hard for you to see that it could be understood 
in the other way. Perhaps you had difficulty even though the two possible meanings were 
explained, just as many people have difficulty seeing the second interpretation for the 
drawing even when they are told what to look for. 
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Although statements written informally may be open to multiple interpretations, we 
cannot determine their truth or falsity without interpreting them one way or another. 
Therefore, we have to use context to try to ascertain their meaning as best we can. 


Negations of Statements with More Than One Quantifier 


You can use the same rules to negate statements with several quantifiers that you used to 
negate simpler quantified statements. Recall that 


~(vx in D, P(x)) = 3x in D such that ~ P(x). 


and 


~(Ax in D such that P(x)) = Vx in D, ~P(). 


Thus 


~(vx in D, Jy in E such that P(x, y)) = 4x in D such that ~ Gy in E such that P(x, y)) 
= Jx in D such that Vy in E, ~ P(x, y) 


Similarly, 
~(dx in D such that Vy in E, P(x, y)) = Vx in D, ~(Vy in E, P(x, y)) 
= Vx in D, dy in E such that ~P(x, y) 


These facts are summarized as follows: 


Negations of Statements with Two Different Quantifiers 


~(Vx in D, 3y in E such that P(x, y) = 3x in D such that Vy in E, ~P(x, y) 
~(dx in D such that Vy in E, P(x, y)) = Vx in D, Jy in E such that ~P(x, y) 


Negating Statements in a Tarski World 


Refer to the Tarski world of Figure 3.3.1, which is reprinted here for reference. 


JORT 
m |2| E 


: 
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Write a negation for each of the following statements, and determine which is true, the 
given statement or its negation. 


a. For every square x, there is a circle y such that x and y have the same color. 


b. There is a triangle x such that for every square y, x is to the right of y. 


Solution 


a. First version of negation: 4a square x such that ~(a circle y such 
that x and y have the same color). 
Final version of negation: 3 a square x such that V circle y, x and y do not have 
the same color. 


The negation is true. Square e is black and no circle in this Tarski world is black, so there 
is a square that does not have the same color as any circle. 


b. First version of negation: V triangle x, ~(V square y, x is to the right of y). 
Final version of negation: V triangle x, 3 a square y such that x is not to the right 
of y. 


The negation is true because no matter what triangle is chosen, it is not to the right of 
square g or square j, which are the only squares in this Tarski world. E 


Order of Quantifiers 


Consider the following two statements: 


V person x, J a person y such that x loves y. 


Ja person y such that V person x, x loves y. 


Note that except for the order of the quantifiers, these statements are identical. However, 
the first means that given any person, it is possible to find someone whom that person 
loves, whereas the second means that there is one amazing individual who is loved by 
all people. (Reread the statements carefully to verify these interpretations!) The two 
sentences illustrate an extremely important property about statements with two different 
quantifiers. 


In a statement containing both V and J, changing the order of the quantifiers can 
significantly change the meaning of the statement. 


Interestingly, however, if one quantifier immediately follows another quantifier of the 
same type, then the order of the quantifiers does not affect the meaning. Consider the com- 
mutative property of addition of real numbers, for example: 


V real number x and V real number y, x+y = y +x. 
This means the same as 

V real number y and V real number x, x+ y = y+x. 
Thus the property can be expressed a little less formally as 


Y real numbers x and y, x+ y = y+ x. 
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Quantifier Order in a Tarski World 


Look again at the Tarski world of Figure 3.3.1. Do the following two statements have the 
same truth value? 


a. For every square x there is a triangle y such that x and y have different colors. 


b. There exists a triangle y such that for every square x, x and y have different colors. 


Solution Statement (a) says that if someone gives you one of the squares from the Tarski 
A world, you can find a triangle that has a different color. This is true. If someone gives you 
Caitionl i acitement square g or h (which are gray), you can use triangle d (which is black); if someone gives you 
containstwodii ereit square e (which is black), you can use either triangle for i (which are gray); and if someone 
quantifiers, reversing gives you square j (which is blue), you can use triangle d (which is black) or triangle f or i 
their order may change (which are gray). 
the truth value of the Statement (b) says that there is one particular triangle in the Tarski world that has a dif- 
statement to its opposite. ferent color from every one of the squares in the world. This is false. Two of the triangles 


are gray, but they cannot be used to show the truth of the statement because the Tarski 
world contains gray squares. The only other triangle is black, but it cannot be used either 
because there is a black square in the Tarski world. 

Thus one of the statements is true and the other is false, and so they have opposite 
truth values. E 


Formal Logical Notation 


In some areas of computer science, logical statements are expressed in purely symbolic 
notation. The notation involves using predicates to describe all properties of variables 
and omitting the words such that in existential statements. (When you try to figure 
out the meaning of a formal statement, however, it is helpful to think the words such 
that to yourself each time they are appropriate.) The formalism also depends on the 
following facts: 


“Vy in D, P(x)” can be written as “Vx (x in D —> P(x)),” and 
“Jx in D such that P(x)” can be written as “Sx (x in D A P(x)).” 


We illustrate the use of these facts in Example 3.3.10. 


ZEWA Formalizing Statements in a Tarski World 


Consider once more the Tarski world of Figure 3.3.1: 
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29 66 


Let Triangle(x), Circle(x), and Square(x) mean “x is a triangle,” “x is a circle,” and “x is a 


29 66 


square”; let Blue(x), Gray(x), and Black(x) mean “‘x is blue,” “x is gray,” and “x is black”; let 
RightOf(x, y), Above(x, y), and SameColorAs(x, y) mean “x is to the right of y,” “x is above 
y; and “x has the same color as y”; and use the notation x = y to denote the predicate “x is 
equal to y.” Let the common domain D of all variables be the set of all the objects in the 
Tarski world. Use formal logical notation to write each of the following statements, and 


write a formal negation for each statement. 
. For every circle x, x is above f. 


a 
b. There is a square x such that x is black. 


O 


. For every circle x, there is a square y such that x and y have the same color. 


d. There is a square x such that for every triangle y, x is to the right of y. 


Solution 


a. Statement: Yx(Circle(x) — Above(x, f)) 
Negation: ~ (Vx (Circle(x) — Above(x, f))) 
= dx ~(Circle(x) — Above(x, f)) 
by the law for negating a V statement 
= 4x(Circle(x) A ~Above(x, f)) 
by the law of negating an if-then statement 
b. Statement: 4x(Square(x) ^ Black(x)) 
Negation: ~ (Ax(Square(x) A Black(x))) 
= Vx ~(Square(x) A Black(x)) 
by the law for negating a 3 statement 
= Vx(~Square(x) v ~Black()) 
by De Morgan’s law 
c. Statement: Vx(Circle(x) — 4y(Square(y) A SameColor(x, y))) 
Negation: ~ (Vx(Circle(x) — Sy(Square(y)  SameColor(x, y)))) 
= dx ~(Circle(x) — Jy(Square(y) A SameColor(x, y))) 
by the law for negating a V statement 
= dx(Circle(x) A ~ (A y(Square(y) ^A SameColor(, y)))) 
by the law for negating an if-then statement 
= dx(Circle(x) A V y(~(Square(y) ^A SameColor(x, y)))) 
by the law for negating a 3 statement 
= dx(Circle(x) A Vy(~Square(y) V ~SameColor(x, y))) 
by De Morgan’s law 


d. Statement: 4x(Square(x) ^A Vy(Triangle(y) — RightOf(, y))) 
Negation: ~ (Ax(Square(x) A Vy(Triangle(y) — RightOf(x, y)))) 

= Vx~ (Square(x) ^A Vy(Triangle(x) — RightOf(, y))) 
by the law for negating a 3 statement 

= Vx(~ Square(x) v ~(Vy(Triangle(y) — RightOf(x, y)))) 
by De Morgan’s law 

= Vax(~ Square(x) v 4y(~(Triangle(y) — RightOf(, y)))) 
by the law for negating a V statement 

= Vx(~ Square(x) v 4y(Triangle(y) A ~RightOf(, y))) 
by the law for negating an if-then statement 
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Example 3.3.11 


Note Different Prolog 
implementations follow 
different conventions as to 
how to represent constant, 
variable, and predicate 
names and forms of ques- 
tions and answers. The 
conventions used here 

are similar to those of 
Edinburgh Prolog. 
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The disadvantage of the fully formal notation is that because it is complex and some- 
what remote from intuitive understanding, when we use it, we may make errors that go 
unrecognized. The advantage, however, is that operations, such as taking negations, can 
be made completely mechanical and programmed on a computer. Also, when we become 
comfortable with formal manipulations, we can use them to check our intuition, and then 
we can use our intuition to check our formal manipulations. Formal logical notation is used 
in branches of computer science such as artificial intelligence, program verification, and 
automata theory and formal languages. 

Taken together, the symbols for quantifiers, variables, predicates, and logical connec- 
tives make up what is known as the language of first-order logic. Even though this lan- 
guage is simpler in many respects than the language we use every day, learning it requires 
the same kind of practice needed to acquire any foreign language. 


Prolog 


The programming language Prolog (short for programming in logic) was developed in 
France in the 1970s by A. Colmerauer and P. Roussel to help programmers working in the 
field of artificial intelligence. A simple Prolog program consists of a set of statements de- 
scribing some situation together with questions about the situation. Built into the language 
are search and inference techniques needed to answer the questions by deriving the an- 
swers from the given statements. This frees the programmer from the necessity of having 
to write separate programs to answer each type of question. Example 3.3.11 gives a very 
simple example of a Prolog program. 


A Prolog Program 


Consider the following picture, which shows colored blocks stacked on a table. 


g Wa g |= gray block b, |= blue block 3 


b, |= blue block 1 W, |= white block 1 


b, | = blue block 2 w, |= white block 2 


[= l- 


The following are statements in Prolog that describe this picture and ask two questions 
about it. 


isabove(g, bı) color(g, gray) color(b3, blue) 
isabove(b,, w1) color(b,, blue) color(w,, white) 
isabove(w, b>) color(b, blue) color(w, white) 
isabove(bp, b3) isabove(X, Z) if isabove(X, Y) and isabove(Y, Z) 
1. ?color(b,, blue) 2. Disabove(X, w 1) 


The statements “isabove(g, b)’ and “color(g, gray)” are to be interpreted as “g is above b,” 
and “g is colored gray.” The statement “isabove(X, Z) if isabove(X, Y) and isabove(Y, ZY’ is 
to be interpreted as “For all X, Y, and Z, if X is above Y and Y is above Z, then X is above Z.” 
Statement 1 


?color(b;, blue) 
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asks whether block b; is colored blue. Prolog answers this by writing 
Yes. 


Statement 2 
?isabove(X, w 1) 


asks for which blocks X the predicate “X is above w,” is true. Prolog answers by giving a 
list of all such blocks. In this case, the answer is 


X=b,X=g. 


Note that Prolog can find the solution X = b, by merely searching the original set of given 
facts. However, Prolog must infer the solution X = g from the following statements: 
isabove(g, bı), 
isabove(b;, w1), 
isabove(X, Z) if isabove(X, Y) and isabove(Y, Z). 
Write the answers Prolog would give if the following questions were added to the program 
above. 
a. lisabove(bz, w1) b. Xcolor(w;, X) c. ?color(X, blue) 
Solution 
a. The question means “Is b above w,?”; so the answer is “No.” 


b. The question means “For what colors X is the predicate ‘w is colored X ’ true?”; so 
the answer is “X = white.” 


c. The question means “For what blocks is the predicate ‘X is colored blue’ true?”; so the 


answer is “X = by,” “X = bo,” and “X = b3.” E 
TEST YOURSELF 
1. To establish the truth of a statement of the form 3. Consider the statement “Vx, Jy such that P(x, y), a 
“Vx in D, dy in E such that P(x, y),” you imagine property involving x and y, is true.” A negation for 
that someone has given you an element x from this statement is “ 4 


D but that you have no control over what that el- 
ement is. Then you need to find with the 
property that the x the person gave you together 
with the you subsequently found satisfy 
— 5. Suppose P(x, y) is some property involving x 

and y, and suppose the statement “Vx in D, dy in E 
such that P(x, yy’ is true. Then the statement “3x in 
D such that Vy in E, Pœ, yy” 

a. is true. 

b. is false. 

c. may be true or may be false. 


4. Consider the statement “3x such that Vy, P(x, y), a 
property involving x and y, is true.” A negation for 
this statement is “ 4 


2. To establish the truth of a statement of the form 
“Jx in D such that Vy in E, P(x, y),” you need 
to find so that no matter what a 
person might subsequently give you, will 
be true. 
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EXERCISE SET 3.3 


1. Let C be the set of cities in the world, let N be the e. Jan item / such that V student S, S did not 
set of nations in the world, and let P(c, n) be “c is choose J. 
the capital city of n.” Determine the truth values f. 3a station Z such that V student S, J an item J 
of the following statements. such that S' chose J from Z. 
i P(Tokyo, Japan) i P(Athens, Egyp t) Tl. Let S be the set of students at your school, let M 
c. P(Paris, France) d. P(Miami, Brazil) be the set of movies that have ever been released, 
2. Let G(x, y) be 2s y.” Indicate which of the fol- and let V(s, m) be “student s has seen movie m.” 
lowing statements are true and which are false. Rewrite each of the following statements without 
a. G(2, 3) b. Gd, 1) using the symbol V, the symbol J, or variables. 
c GG, 5) d. G(—2, 2) a. ds € S such that Vs, Casablanca). 
3. The following statement is true: “V nonzero b. Vs E S, V(s, Star Wars). 
number x, 3 a real number y such that xy = 1.” For c. Vs E S, 3m E M such that V(s, m). 
each x given below, find a y to make the predicate d. dm E M such that Vs € S, V(s, m). 
“xy = I” true. e. ds E S, Jt E S, and 3m E M such that s # t 
a x=2 b. x= -1 c. x = 3/4 and Vs, m) N Vit, m). 
4. The following statement is true: “Y real number x, f. Js E€ Sand 3t E S such that s # t and Ym E M, 
Jan integer n such that n > x.”* For each x given Vis, m) —> V(t, m). 
below, fird anın to make the pgedicate n > X igoe 12. Let D = E = {—2, —1, 0, 1, 2}. Write negations 


for each of the following statements and determine 
which is true, the given statement or its negation. 
a. Vxin D, dyin E such that x+y = 1. 
5. For every circle x there is a square y such that b. 3x in D such that Vy in E, x+ y = —y. 

x and y have the same color. c. Vxin D, Sy in E such that xy = y. 
d. 3x in D such that Vy in E, x = y. 


The statements in exercises 5-8 refer to the Tarski world 
given in Figure 3.3.1. Explain why each is true. 


6. For every square x there is a circle y such that 
x and y have different colors and y is above x. In each of 13-19, (a) rewrite the statement in English 
without using the symbol Y or 3or variables and express- 

ing your answer as simply as possible, and (b) write a 


7. There is a triangle x such that for every square 


y, x is above y. negation for the statement. 
8. There is a triangle x such that for every circle y, 13. V color C, J an animal A such that A is colored C. 
y is above x. 


14. Ja book b such that V person p, p has read b. 
9. Let D = E = {—2, —1, 0, 1, 2}. Explain why the 
following statements are true. 


a. Vx in D, Jy in E such that x + y = 0. 16. 3a real number u such that V real number v, uv = v. 
b. Sx in D such that Vy in E, x+y = y. 


15. Y odd integer n, 3an integer k such that n = 2k + 1. 


f i : 17. Vr E Q, J integers a and b such that r = a/b. 
10. This exercise refers to Example 3.3.3. Determine 


whether each of the following statements is true or 18. Vx E R, da real number y such that x + y = 0. 
false. 

a. V student S, 3 a dessert D such that S chose D. 19. Jx E e such that for every real number y, 

b. V student S, 3 a salad T such that S chose T. xty=0. 

c. Ja dessert D such that V student S, S chose D. 20. Recall that reversing the order of the quantifiers 
d. Ja beverage B such that V student D, D chose B. in a statement with two different quantifiers may 


*This is called the Archimedean principle because it was first formulated (in geometric terms) by the great Greek mathematician Archimedes 
of Syracuse, who lived from about 287 to 212 B.C.E. 
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change the truth value of the statement—but it 
does not necessarily do so. All the statements in 
the pairs below refer to the Tarski world of Figure 
3.3.1. In each pair, the order of the quantifiers is 
reversed but everything else is the same. For each 
pair, determine whether the statements have 
the same or opposite truth values. Justify your 
answers. 
a. (1) For every square y there is a triangle x such 
that x and y have different colors. 
(2) There is a triangle x such that for every 
square y, x and y have different colors. 
b. (1) For every circle y there is a square x such 
that x and y have the same color. 
(2) There is a square x such that for every 
circle y, x and y have the same color. 


21. For each of the following equations, determine 
which of the following statements are true: 
(1) For every real number x, there exists a real 
number y such that the equation is true. 
(2) There exists a real number x, such that for 
every real number y, the equation is true. 


Note that it is possible for both statements to be 

true or for both to be false. 

a. 2xt+y=7 

b. y+x=x+y 

c. x°—2xy+y’? =0 

d. ~—-5)y-1) =0 

e. x + y = -1 
In 22 and 23, rewrite each statement without using vari- 
ables or the symbol VY or 3. Indicate whether the state- 
ment is true or false. 


22. a. V real number x, 3 a real number y such that 


x+y=0. 
b. 3a real number y such that V real number x, 
x+y=0. 


23. a. V nonzero real number r, 3 a real number s 
such that rs = 1. 

b. 3a real number r such that V nonzero real 
number s, rs = 1. 


24. Use the laws for negating universal and existential 
statements to derive the following rules: 


a. ~(Vx E D(Vy E E(PG, YY) 
= 4x E Dy E E(~P(, y))) 

b. ~(dx E Dy E E(P, yY) 
= Vx E D(Vy E E(~Px, y))) 
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Each statement in 25-28 refers to the Tarski world of 
Figure 3.3.1. For each, (a) determine whether the state- 
ment is true or false and justify your answer, and (b) write 
a negation for the statement (referring, if you wish, to the 
result in exercise 24). 


25. V circle x and V square y, x is above y. 
26. V circle x and V triangle y, x is above y. 


27. 3a circle x and da square y such that x is above y 
and x and y have different colors. 


28. Ja triangle x and Ja square y such that x is above 
y and x and y have the same color. 

For each of the statements in 29 and 30, (a) write a new 

statement by interchanging the symbols V and J, and 

(b) state which is true: the given statement, the version 

with interchanged quantifiers, neither, or both. 


29. Yx E R, Jy E R such that x < y. 


30. 4x E R such that Vy E R (the set of negative real 
numbers), x > y. 


31. Consider the statement “Everybody is older than 
somebody.” Rewrite this statement in the form 
“y people x, J__.” 


32. Consider the statement “Somebody is older than 
everybody.” Rewrite this statement in the form 
“Ja person x such tht V__.” 

In 33-39, (a) rewrite the statement formally using 

quantifiers and variables, and (b) write a negation for the 

statement. 


33. Everybody loves somebody. 

34. Somebody loves everybody. 

35. Everybody trusts somebody. 

36. Somebody trusts everybody. 

37. Any even integer equals twice some integer. 

38. Every action has an equal and opposite reaction. 


39. There is a program that gives the correct answer to 
every question that is posed to it. 


40. In informal speech most sentences of the form 
“There is every ” are intended to be 
understood as meaning “V J , even 
though the existential quantifier there is comes 
before the universal quantifier every. Note that 
this interpretation applies to the following 


well-known sentences. Rewrite them using quan- 

tifiers and variables. 

a. There is a sucker born every minute. 

b. There is a time for every purpose under 
heaven. 


41. Indicate which of the following statements are true 
and which are false. Justify your answers as best 
you can. 


a. Vx © Z*, dy © Z such that x = y +1. 
b. Vx E Z, dy E Z such that x = y +1. 
dx E R such that Yy E R, x = y +1. 
d. Vx € R*, dy € R” such that xy = 1. 
e. Vx E R, dy E R such that xy =1. 

f. dx E R such that Yy E R, x+y = y. 
g. Vx © R*, dy € R” such that y < x. 

h. 3x E R” such that Vy E RË, x < y. 


o 


42. Write the negation of the definition of limit of a 
sequence given in Example 3.3.7. 


43. The following is the definition for lim, _, , f(x) = L: 


For every real number € > 0, there exists a real 
number 6 > 0 such that for every real number x, 
ifa-—d<x<atéandx # a then 


L-e<fix)<Lte. 


Write what it means for lim, _, , f(x) # L. In other 
words, write the negation of the definition. 


44. The notation J! stands for the words “there exists 
a unique.” Thus, for instance, “3! x such that x 
is prime and x is even” means that there is one 
and only one even prime number. Which of the 
following statements are true and which are false? 


Explain. 
a. J! real number x such that V real number 
yY, Y = y. 


b. d! integer x such that 1/x is an integer. 
c. V real number x, J! real number y such that 
xt+ty=0. 


*45. Suppose that P(x) is a predicate and D is the 


domain of x. Rewrite the statement “g! x E D 
such that P(x)” without using the symbol 2!. (See 
exercise 44 for the meaning of 3!.) 


In 46-54, refer to the Tarski world given in Figure 3.1.1, 
which is shown again here for reference. The domains of 
all variables consist of all the objects in the Tarski world. 
For each statement, (a) indicate whether the statement 
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is true or false and justify your answer, (b) write the given 
statement using the formal logical notation illustrated in 
Example 3.3.10, and (c) write a negation for the given state- 
ment using the formal logical notation of Example 3.3.10. 


A @ 


IE 
CIE 
E 


46. There is a triangle x such that for every square y, x 
is above y. 


47. There is a triangle x such that for every circle y, x 
is above y. 


48. For every circle x, there is a square y such that y is 
to the right of x. 


49. For every object x, if x is a circle then there is a 
square y such that y has the same color as x. 


50. For every object x, if x is a triangle then there is a 
square y such that y is below x. 


51. There is a square x such that for every triangle y, if 
y is above x then y has the same color as x. 


52. For every circle x and for every triangle y, x is to 
the right of y. 


53. There is a circle x and there is a square y such that 
x and y have the same color. 


54. There is a circle x and there is a triangle y such 
that x has the same color as y. 


Let P(x) and Q(x) be predicates and suppose D is the 
domain of x. In 55-58, for the statement forms in each 
pair, determine whether (a) they have the same truth 
value for every choice of P(x), Q(x), and D, or (b) there is 
a choice of P(x), Q(x), and D for which they have opposite 
truth values. 


55. Vx E D, (P(X) A QŒ), and 
(Vx E D, PŒ) A (Vx E D, Q) 
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56. 3x E D, (PQ) A Q(x), and In 59-61, find the answers Prolog would give if the 


(Ax E D, PŒ) A Ax E D, 0w) following questions were added to the program given 
in Example 3.3.11. 


57. Vx E D, (Pœ) v Qœ), and 59. a. ?isabove(b,, w1) 60. a. ?isabove(w), g) 
(vx E D, PŒ) v (Yx E D, Q) b. ?color(X, white) b. ?color(w>, blue) 
c. ?isabove(X, b3) c. ?isabove(X, bı) 


58. dx E D, (P(X) v Qœ), and 


(Ax E D, P(x)) V (dx = D, ow) 61. a. ?isabove(wo, b3) 


b. ?color(X, gray) 
c. ?isabove(g, X) 


ANSWERS FOR TEST YOURSELF 


1. an element y in E; y; P(x, y) 2. an element x in D; y 5. The answer is (c): the truth or falsity of a statement in 
in E; P(x, y) 3.4x such that Vy, the property P(x, y) is which the quantifiers are reversed depends on the nature of 
false. 4. Vx, dy such that the property P(x, y) is false. the property involving x and y. 


EZ3 Arguments with Quantified Statements 


The only complete safeguard against reasoning ill, is the habit of reasoning well; 
familiarity with the principles of correct reasoning; and practice in applying those 
principles. —John Stuart Mill 


The rule of universal instantiation (in-stan-she-AY-shun) says the following: 


Universal Instantiation 


If a property is true of everything in a set, then it is true of any particular thing in 
the set. 


Use of the words universal instantiation indicates that the truth of a property in a particu- 
lar case follows as a special instance of its more general or universal truth. The validity of 
this argument form follows immediately from the definition of truth values for a universal 
statement. One of the most famous examples of universal instantiation is the following: 


All men are mortal. 
Socrates is a man. 


<. Socrates is mortal. 


Universal instantiation is the fundamental tool of deductive reasoning. Mathematical 
formulas, definitions, and theorems are like general templates that are used over and over 
in a wide variety of particular situations. A given theorem says that such and such is true 
for all things of a certain type. If, in a given situation, you have a particular object of 
that type, then by universal instantiation, you conclude that such and such is true for that 
particular object. You may repeat this process 10, 20, or more times in a single proof or 
problem solution. 

As an example of universal instantiation, suppose you are doing a problem that requires 
you to simplify 


k+1 
r “T; 
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where r is a particular real number and k is a particular integer. You know from your study 
of algebra that the following universal statements are true: 

1. For every real number x and for all integers m and n, x"+x" = x""". 

2. For every real number x, xl =x. 


So you proceed as follows: 


Ply = tly! Step 1 
— (k+1)+1 Step 2 
k+2 


by basic algebra. 


Here is the reasoning behind steps 1 and 2. 


1 ; 
Step 1: For every real number x, x = x. universal truth 
r is a particular real number. particular instance 
1 : 
on a conclusion 


Step 2: For every real number x and for all integers 


+ ý 
m andn, xy = universal truth 
r is a particular real number and k + 1 
and 1 are particular integers. particular instance 
. „k+1, 1 — „(k+1)+1 : 
Pe Ai ie G : conclusion 


Both arguments are examples of universal instantiation. 


Universal Modus Ponens 


The rule of universal instantiation can be combined with modus ponens to obtain the valid 
form of argument called universal modus ponens. 


Universal Modus Ponens 


Formal Version Informal Version 
Vx, if P(x) then O(). If x makes P(x) true, then x makes Q(x) true. 
P(a) for a particular a. a makes P(x) true. 
` Qa). <. a makes Q(x) true. 


Note that the first, or major, premise of universal modus ponens could be written “All 
things that make P(x) true make Q(x) true,” in which case the conclusion would follow by 
universal instantiation alone. However, the if-then form is more natural to use in the major- 
ity of mathematical situations. 


Scceeam Recognizing Universal Modus Ponens 


Rewrite the following argument using quantifiers, variables, and predicate symbols. Is this 
argument valid? Why? 


If an integer is even, then its square is even. 
k is a particular integer that is even. 
z. k? is even. 
Solution The major premise of this argument can be rewritten as 


Vx, if x is an even integer then x? is even. 
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Let E(x) be “x is an even integer,” let S(x) be “x? is even,’ and let k stand for a particular 
integer that is even. Then the argument has the following form: 


Vx, if E(x) then S(x). 
E(k), for a particular k. 
JSC): 


This argument has the form of universal modus ponens and is therefore valid. E 


ZCWJGEXYA Drawing Conclusions Using Universal Modus Ponens 


Write the conclusion that can be inferred using universal modus ponens. 


If T is any right triangle with hypotenuse Pythagorean Theorem 


c and legs a and b, then C=a +h’. 


The triangle shown at the right is a right triangle c 
with both legs equal to 1 and hypotenuse c. 


Solution c? = + =2 

Note that if you take the nonnegative square root of both sides of this equation, you obtain 
c = V2. This shows that there is a line segment whose length is V2. Section 4.7 contains 
a proof that \/2 is not a rational number. E 


Use of Universal Modus Ponens in a Proof 


In Chapter 4 we discuss methods of proving quantified statements. Here is a proof that 
the sum of any two even integers is even. It makes use of the definition of even integer, 
namely, that an integer is even if, and only if, it equals twice some integer. (Or, more for- 
mally: V integers x, x is even if, and only if, 3 an integer—say, k—such that x = 2k.) 

Suppose m and n are particular but arbitrarily chosen even integers. Then m = 2r for 
some integer r,” and n = 2s for some integer s.” Hence 


m+n = 2rt+2s by substitution 
= 2(r+ 5) by factoring out the 2. 


Now r+ sis an integer,” and so 2(r + s) is even.® Thus m + n is even. 


The following expansion of the proof shows how each of the numbered steps is justified by 
arguments that are valid by universal modus ponens. 


Note The logical (1) Ifan integer is even, then it equals twice some integer. 
principle of existential m is a particular even integer. 
instantiation says that if ~. m equals twice some integer, say, r. 


we know or have deduced 
that something exists, we 
may give it aname. This 


(2) Ifan integer is even, then it equals twice some integer. 
n is a particular even integer. 


is the principle that allows ..n equals twice some integer, say, s. 
us to call the integers (3) | Ifa quantity is an integer, then it is a real number. 
rand s. rand s are particular integers. 


*. r and s are real numbers. 


For all a, b, and c, if a, b, and c are real numbers, then ab + ac = a(b + c). 
2, r, and s are particular real numbers. 
*,2r+2s =2(rts). 
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(4) For all u and v, if u and v are integers, then u + v is an integer. 
rand s are two particular integers. 
"r+ sis an integer. 


(5) Ifa number equals twice some integer, then that number is even. 
2(r + s) equals twice the integer r+ s. 
*,2(r+s) is even. 


Of course, the actual proof that the sum of even integers is even does not explicitly 
contain the sequence of arguments given above. In fact, people who are good at analyti- 
cal thinking are normally not even conscious that they are reasoning in this way because 
they have absorbed the method so completely that it has become almost as automatic as 
breathing. 


Universal Modus Tollens 


Another crucially important rule of inference is universal modus tollens. Its validity re- 
sults from combining universal instantiation with modus tollens. Universal modus tollens 
is the heart of proof of contradiction, which is one of the most important methods of math- 
ematical argument. 


Universal Modus Tollens 


Formal Version Informal Version 
Vx, if P(x) then Q). If x makes P(x) true, then x makes Q(x) true. 
~ QQ), for a particular a. a does not make Q(x) true. 
"~ ~P(a). .. a does not make P(x) true. 


Seley Recognizing the Form of Universal Modus Tollens 


Rewrite the following argument using quantifiers, variables, and predicate symbols. Write 
the major premise in conditional form. Is this argument valid? Why? 


All human beings are mortal. 
Zeus is not mortal. 


.. Zeus is not human. 


Solution The major premise can be rewritten as 
Vx, if x is human then x is mortal. 


Let H(x) be “x is human,” let M(x) be “x is mortal,” and let Z stand for Zeus. The argument 


becomes 
Vx, if Hx) then M(x) 
~M(Z) 
*. ~H(Z). 
This argument has the form of universal modus tollens and is therefore valid. E 
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Seu eee Drawing Conclusions Using Universal Modus Tollens 
Write the conclusion that can be inferred using universal modus tollens. 


All professors are absent-minded. 


Tom Hutchins is not absent-minded. 


Solution Tom Hutchins is not a professor. a 


Proving Validity of Arguments with Quantified Statements 


The intuitive definition of validity for arguments with quantified statements is the same as 
for arguments with compound statements. An argument is valid if, and only if, the truth 
of its conclusion follows necessarily from the truth of its premises. The formal definition 
is as follows: 


Definition 


To say that an argument form is valid means the following: No matter what particu- 
lar predicates are substituted for the predicate symbols in its premises, if the result- 
ing premise statements are all true, then the conclusion is also true. An argument is 
called valid if, and only if, its form is valid. It is called sound if, and only if, its form 
is valid and its premises are true. 


As already noted, the validity of universal instantiation follows immediately from the 
definition of the truth value of a universal statement. General formal proofs of validity of 
arguments in the predicate calculus are beyond the scope of this book. We give the proof of 
the validity of universal modus ponens as an example to show that such proofs are possible 
and to give an idea of how they look. 

Universal modus ponens asserts that 


Vx, if P(x) then O(a). 
P(a) for a particular a. 
“. O(). 


To prove that this form of argument is valid, suppose the major and minor premises are 
both true. [We must show that the conclusion “Q(a)” is also true.] By the minor premise, 
P(q) is true for a particular value of a. By the major premise and universal instantiation, 
the statement “If P(a) then Q(a)” is true for that particular a. But by modus ponens, since 
the statements “If P(a) then O(a)” and “P(a)” are both true, it follows that Q(a) is true also. 
[This is what was to be shown.] 

The proof of validity given above is abstract and somewhat subtle. We include the proof 
not because we expect that you will be able to make up such proofs yourself at this stage of 
your study. Rather, it is intended as a glimpse of a more advanced treatment of the subject, 
which you can try your hand at in exercises 35 and 36 at the end of this section if you wish. 

One of the paradoxes of the formal study of logic is that the laws of logic are used to 
prove that the laws of logic are valid! 

In the next part of this section we show how you can use diagrams to analyze the valid- 
ity or invalidity of arguments that contain quantified statements. Diagrams do not provide 
totally rigorous proofs of validity and invalidity, and in some complex settings they may 
even be confusing, but in many situations they are helpful and convincing. 
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Using Diagrams to Test for Validity 


Consider the statement 

All integers are rational numbers. 
Or, formally, 

V integer n, n is a rational number. 


Picture the set of all integers and the set of all rational numbers as disks. The truth of the 
given statement is represented by placing the integers disk entirely inside the rationals disk, 
as shown in Figure 3.4.1. 


rational numbers 


integers 


FIGURE 3.4.1 


Because the two statements “Vx E D, Q(x)” and “Vx, if x is in D then Q(x)” are logically 
equivalent, both can be represented by diagrams like the foregoing. 

Perhaps the first person to use diagrams like these to analyze arguments was the German 
mathematician and philosopher Gottfried Wilhelm Leibniz. Leibniz (LIPE-nits) was far 
ahead of his time in anticipating modern symbolic logic. He also developed the main ideas 
of the differential and integral calculus at approximately the same time as (and indepen- 

TE dently of) Isaac Newton (1642-1727). 
G. W. Leibniz sas : : 
(1646-1716) To test the validity of an argument diagrammatically, represent the truth of both prem- 
ises with diagrams. Then analyze the diagrams to see whether they necessarily represent 
the truth of the conclusion as well. 


Using a Diagram to Show Validity 


Use diagrams to show the validity of the following syllogism: 


a 
D 
© 
S 

£ 

E 
© 

oO 

S 
= 
= 
& 

= 
D 
faa] 


All human beings are mortal. 
Zeus is not mortal. 
<. Zeus is not a human being. 
Solution The major premise is pictured on the left in Figure 3.4.2 by placing a disk la- 


beled “human beings” inside a disk labeled “mortals.” The minor premise is pictured on 
the right in Figure 3.4.2 by placing a dot labeled “Zeus” outside the disk labeled “mortals.” 


mortals 


human beings Zeus 


Major premise Minor premise 


FIGURE 3.4.2 
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The two diagrams fit together in only one way, as shown in Figure 3.4.3. 


mortals 


human beings Zeus 


FIGURE 3.4.3 


Since the Zeus dot is outside the mortals disk, it is necessarily outside the human beings 
disk. Thus the truth of the conclusion follows necessarily from the truth of the premises. It 
is impossible for the premises of this argument to be true and the conclusion false; hence 
the argument is valid. E 


Using Diagrams to Show Invalidity 


Use a diagram to show the invalidity of the following argument: 


All human beings are mortal. 
Felix is mortal. 


.. Felix is a human being. 


Solution The major and minor premises are represented diagrammatically in Figure 3.4.4. 


mortals 


mortals 
human beings 


© Felix 


Major premise Minor premise 


FIGURE 3.4.4 


All that is known is that the Felix dot is located somewhere inside the mortals disk. 
Where it is located with respect to the human beings disk cannot be determined. Either one 
of the situations shown in Figure 3.4.5 might be the case. 


mortals 
mortals 


human beings human beings 


@ Felix 


(a) (b) 
FIGURE 3.4.5 
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The conclusion “Felix is a human being” is true in the first case but not in the second 
(Felix might, for example, be a cat). Because the conclusion does not necessarily follow 
Caution! ‘Becareful from the premises, the argument is invalid. E 


when using diagrams to 


test for validity! For in- The argument of Example 3.4.6 would be valid if the major premise were replaced by 
stance, in this exampleif its converse. But since a universal conditional statement is not logically equivalent to its 
you put the diagrams for converse, such a replacement cannot, in general, be made. We say that this argument ex- 
the premises together to hibits the converse error. 
obtain only Figure 3.4.5(a) 
and not Figure 3.4.5(b), 
you would conclude erro- Converse Error (Quantified Form) 
ion Anes een Formal Version Informal Version 
was valid. 

Vx, if P(x) then O(). If x makes P(x) true, then x makes Q(x) true. 

Q(q) for a particular a. a makes Q(x) true. 

<. P@,. < invalid z. a makes P(x) true. < invalid 
conclusion conclusion 


The following form of argument would be valid if a conditional statement were logi- 
cally equivalent to its inverse. But it is not, and the argument form is invalid. We say that 
it exhibits the inverse error. You are asked to show the invalidity of this argument form in 
the exercises at the end of this section. 


Inverse Error (Quantified Form) 


Formal Version Informal Version 
Vx, if P(x) then O(). If x makes P(x) true, then x makes Q(x) true. 
~P(a), for a particular a. a does not make P(x) true. 
*, ~Q(@). < invalid .. a does not make Q(x) true. < invalid 
conclusion conclusion 


cee An Argument with “No” 


Use diagrams to test the following argument for validity: 


No polynomial functions have horizontal asymptotes. 
This function has a horizontal asymptote. 
.. This function is not a polynomial function. 
Solution A good way to represent the major premise diagrammatically is shown in Fig- 
ure 3.4.6, two disks—a disk for polynomial functions and a disk for functions with hori- 


zontal asymptotes—that do not overlap at all. The minor premise is represented by placing 
a dot labeled “this function” inside the disk for functions with horizontal asymptotes. 


functions with 
horizontal asymptotes 


polynomial functions 


@ this function 


FIGURE 3.4.6 
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The diagram shows that “this function” must lie outside the polynomial functions disk, and 
so the truth of the conclusion necessarily follows from the truth of the premises. Hence the 
argument is valid. E 


An alternative way to solve Example 3.4.7 is to transform “No polynomial functions 
have horizontal asymptotes” into the equivalent statement “Vx, if x is a polynomial func- 
tion, then x does not have a horizontal asymptote.” If this is done, the argument can be seen 
to have the form 


Vx, if P(x) then Q). 
~Q(Q), for a particular a. 
„=~ Pla). 


where P(x) is “x is a polynomial function” and Q(x) is “x does not have a horizontal asymp- 
tote.” This is valid by universal modus tollens. 


Creating Additional Forms of Argument 


Universal modus ponens and modus tollens were obtained by combining universal instan- 
tiation with modus ponens and modus tollens. In the same way, additional forms of argu- 
ments involving universally quantified statements can be obtained by combining universal 
instantiation with other of the valid argument forms given in Section 2.3. For instance, in 
Section 2.3 the argument form called transitivity was introduced: 


pq 
q>r 
“por 
This argument form can be combined with universal instantiation to obtain the following 
valid argument form. 


Universal Transitivity 


Formal Version Informal Version 
Vx P(x) > Qx). Any x that makes P(x) true makes Q(x) true. 
Vx O(x) = R(X). Any x that makes Q(x) true makes R(x) true. 
“Vx P(x) = R(x). <. Any x that makes P(x) true makes R(x) true. 


eee Evaluating an Argument for Tarski’s World 


The following argument refers to the kind of arrangement of objects of various types and 
colors described in Examples 3.1.13 and 3.3.1. Reorder and rewrite the premises to show 
that the conclusion follows as a valid consequence from the premises. 


1. All the triangles are blue. 

2. If an object is to the right of all the squares, then it is above all the circles. 

3. If an object is not to the right of all the squares, then it is not blue. 

.. All the triangles are above all the circles. 

Solution Itis helpful to begin by rewriting the premises and the conclusion in if-then form: 
1. Vx, if x is a triangle, then x is blue. 


2. Vx, if x is to the right of all the squares, then x is above all the circles. 
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3. Vx, if x is not to the right of all the squares, then x is not blue. 


z. Vx, if x is a triangle, then x is above all the circles. 


The goal is to reorder the premises so that the conclusion of each is the same as the 
hypothesis of the next. Also, the hypothesis of the argument’s conclusion should be the 
same as the hypothesis of the first premise, and the conclusion of the argument’s conclu- 
sion should be the same as the conclusion of the last premise. To achieve this goal, it may 
be necessary to rewrite some of the statements in contrapositive form. 

In this example you can see that the first premise should remain where it is, but the sec- 
ond and third premises should be interchanged. Then the hypothesis of the argument is the 
same as the hypothesis of the first premise, and the conclusion of the argument’s conclusion 
is the same as the conclusion of the third premise. But the hypotheses and conclusions of 
the premises do not quite line up. This is remedied by rewriting the third premise in con- 
trapositive form. 


1. Vx, if x is a triangle, then x is blue. 

3. Vx, if x is blue, then x is to the right of all the squares. 

2. Vx, if x is to the right of all the squares, then x is above all the circles. 

Putting 1 and 3 together and using universal transitivity gives that 

4. Vx, if x is a triangle, then x is to the right of all the squares. 

And putting 4 together with 2 and using universal transitivity gives that 
z. Vx, if x is a triangle, then x is above all the circles, 


which is the conclusion of the argument. E 


Remark on the Converse and Inverse Errors 


One reason why so many people make converse and inverse errors is that the forms of the 
resulting arguments would be valid if the major premise were a biconditional rather than a 
simple conditional. And, as we noted in Section 2.2, many people tend to conflate bicon- 
ditionals and conditionals. 

Consider, for example, the following argument: 


All the town criminals frequent the Den of Iniquity bar. 
John frequents the Den of Iniquity bar. 
.. John is one of the town criminals. 


The conclusion of this argument is invalid—it results from making the converse error. 
Therefore, it may be false even when the premises of the argument are true. This type of 
argument attempts unfairly to establish guilt by association. 

The closer, however, the major premise comes to being a biconditional, the more likely 
the conclusion is to be true. If hardly anyone but criminals frequent the bar and John also 
frequents the bar, then it is likely (though not certain) that John is a criminal. On the basis 
of the given premises, it might be sensible to be suspicious of John, but it would be wrong to 
convict him. 

A variation of the converse error is a very useful reasoning tool, provided that it is used 
with caution. It is the type of reasoning that is used by doctors to make medical diagnoses 
and by auto mechanics to repair cars. It is the type of reasoning used to generate explana- 
tions for phenomena. It goes like this: 
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If a statement of the form 

For every x, if P(x) then Q(x) 
is true, and if 

Q(@) is true, for a particular a, 


then check out the statement P(a); it just might be true. For instance, suppose a doctor 
knows that 


For every x, if x has pneumonia, then x has a fever and chills, 
coughs deeply, and feels exceptionally tired and miserable. 


And suppose the doctor also knows that 


John has a fever and chills, coughs deeply, 
and feels exceptionally tired and miserable. 


On the basis of these data, the doctor concludes that a diagnosis of pneumonia is a strong 
possibility, though not a certainty. The doctor will probably attempt to gain further support 
for this diagnosis through laboratory testing that is specifically designed to detect pneu- 
monia. Note that the closer a set of symptoms comes to being a necessary and sufficient 
condition for an illness, the more nearly certain the doctor can be of his or her diagnosis. 

This form of reasoning has been named abduction by researchers working in artificial 
intelligence. It is used in certain computer programs, called expert systems, that attempt to 
duplicate the functioning of an expert in some field of knowledge. 


TEST YOURSELF 


1. The rule of universal instantiation says that if a 
some property is true for in a domain, then as 
it is true for ____ 


7 then the conclusion can be written 


4. If the first two premises of universal transitivity are 


EXERCISE SET 3.4 


1. 


. If the first two premises of universal modus 


ponens are written as “If x makes P(x) true, then 
x makes Q(x) true” and “For a particular value 
of a ____,” then the conclusion can be written 


. If the first two premises of universal modus tol- 


lens are written as “If x makes P(x) true, then x 
makes Q(x) true” and “For a particular value of 


written as “Any x that makes P(x) true makes Q(x) 
true” and “Any x that makes Q(x) true makes R(x) 
true,” then the conclusion can be written as “ 


” 


Diagrams can be helpful in testing an argument 
for validity. However, if some possible configu- 
rations of the premises are not drawn, a person 
could conclude that an argument was when 
it was actually —___ 


Let the following law of algebra be the first state- 
ment of an argument: For all real numbers a and b, 


(a+b) = a’ +2ab+b’. 


Suppose each of the following statements is, in 
turn, the second statement of the argument. Use 
universal instantiation or universal modus ponens 
to write the conclusion that follows in each case. 
a. a =x and b = y are particular real numbers. 
b. a =f, and b = f; are particular real numbers. 
c. a= 3u and b = 5v are particular real numbers. 
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2. 


d. a= g(r) and b = g(s) are particular real 
numbers. 

e. a = log(t,) and b = log(t,) are particular real 
numbers. 


Use universal instantiation or universal modus ponens to 
fill in valid conclusions for the arguments in 2-4. 


If an integer n equals 2-k and k is an integer, 
then n is even. 
0 equals 2-0 and 0 is an integer. 


3. For all real numbers a, b, c, and d, if b # 0 and 
d + 0 then a/b + c/d = (ad + bc)lbd. 
a=2,b=3,c = 4, and d = 5 are particular 
real numbers such that b # 0 and d # 0. 


4. V real numbers r, a, and b, if r is positive, then 
AÈ = r 
r= 3,a = 1/2, and b = 6 are particular real 
numbers such that r is positive. 


Use universal modus tollens to fill in valid conclusions for 
the arguments in 5 and 6. 


5. AU irrational numbers are real numbers. 
g is not a real number. 


6. Ifa computer program is correct, then compilation 
of the program does not produce error messages. 
Compilation of this program produces error 
messages. 


Some of the arguments in 7-18 are valid by universal 
modus ponens or universal modus tollens; others are 
invalid and exhibit the converse or the inverse error. State 


which are valid and which are invalid. Justify your answers. 


7. All healthy people eat an apple a day. 
Keisha eats an apple a day. 
*, Keisha is a healthy person. 


8. All freshmen must take a writing course. 
Caroline is a freshman. 
*, Caroline must take a writing course. 


9. Ifa graph has no edges, then it has a vertex of 
degree zero. 
This graph has at least one edge. 
*, This graph does not have a vertex of degree 
Zero. 


10. Ifa product of two numbers is 0, then at least 
one of the numbers is 0. 
For a particular number x, neither (2x + 1) nor 
(x — 7) equals 0. 
*, The product (2x + 1)(x — 7) is not 0. 


11. All cheaters sit in the back row. 
Monty sits in the back row. 
*, Monty is a cheater. 


12. Ifan 8-bit two’s complement represents a posi- 
tive integer, then the 8-bit two’s complement 
starts with a 0. 
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The 8-bit two’s complement for this integer does 
not start with a 0. 


*, This integer is not positive. 


13. 


For every student x, if x studies discrete math- 
ematics, then x is good at logic. 
Tarik studies discrete mathematics. 


*. Tarik is good at logic. 


14. 


If compilation of a computer program produces 
error messages, then the program is not correct. 
Compilation of this program does not produce 
error messages. 


‘, This program is correct. 


15. 


Any sum of two rational numbers is rational. 
The sum r + s is rational. 


*. The numbers r and s are both rational. 


16. 


If a number is even, then twice that number is 
even. 
The number 2n is even, for a particular number n. 


*, The particular number n is even. 


17. 


If an infinite series converges, then the terms go 
to 0. 0 
The terms of the infinite series ` — go to 0. 
n 
n=1 


œ 


‘. The infinite series X, — converges. 
n 


18. 


19. 


-. The infinite series 


n=1 


If an infinite series converges, then its terms go 
to 0. 


œo 


The terms of the infinite series do not 


go to 0. 


n=1 n 


o0 


n 


does not converge. 
n 
n=1 


Rewrite the statement “No good cars are cheap” 


in the form “Vx, if P(x) then ~ Q(x).” Indicate 
whether each of the following arguments is valid 
or invalid, and justify your answers. 


No good car is cheap. 
A Rimbaud is a good car. 
` A Rimbaud is not cheap. 
No good car is cheap. 
A Simbaru is not cheap. 
*, A Simbaru is a good car. 
No good car is cheap. 
A VX Roadster is cheap. 
*. A VX Roadster is not good. 
No good car is cheap. 
An Omnex is not a good car. 
*, An Omnex is cheap. 
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20. a. Use a diagram to show that the following and 3.3.1. Exercises 31 and 32 are adapted from Symbolic 


argument can have true premises and a false 
conclusion. 


Logic by Lewis Carroll.” 


. 28. 1. Every object that is to the right of all the blue 
All dogs are carnivorous. biects isab l the scans 
A aan objects is above all the triangles. 
annua 8- 2. If an object is a circle, then it is to the right of 
.. Aaron is not carnivorous. . 
Wh lude ab h lidi all the blue objects. 

a ma ae e avout te va P ei 3. If an object is not a circle, then it is not gray. 
ENA 1Y OF hE TOLONE Argument ent: *, All the gray objects are above all the triangles. 
Explain how the result from part (a) leads to 
this conclusion. 29. 1. All the objects that are to the right of all the 

Yx if h triangles are above all the circles. 
x, if P(@) then QQ). 2. If an object is not above all the black objects, 
~P(@ for a particular a. then it is not a square. 
` ~Q(@). 3. All the objects that are above all the black 
Indicate whether the arguments in 21-27 are valid or eee oo 
se uae All the squares are above all the circles. 
invalid. Support your answers by drawing diagrams. 
21. All people are mice. 30. 1. Ifan object is above all the triangles, then it is 
; above all the blue objects. 
All mice are mortal. cae ; 
. 2. Ifan object is not above all the gray objects, 
. All people are mortal. ae 
then it is not a square. 
22. All discrete mathematics students can tell a 3. Every black object is a square. 
valid argument from an invalid one. 4. Every object that is above all the gray objects 
All thoughtful people can tell a valid argument is above all the triangles. 
from an invalid one. If an object is black, then it is above all the 
*, All discrete mathematics students are thoughtful. blue objects. 
23. All teachers occasionally make mistakes. 31. 1. I trust every animal that belongs to me. 
No gods ever make mistakes. 2. Dogs gnaw bones. 
‘, No teachers are gods. 3. I admit no animals into my study unless they 
24. No vegetarians eat meat. wah bee when told todo so; . 
; 4. All the animals in the yard are mine. 
All vegans are vegetarian. . : 3 
. 5. I admit every animal that I trust into my study. 
. No vegans eat meat. : ae 
6. The only animals that are really willing to beg 
25. No college cafeteria food is good. when told to do so are dogs. 
No good food is wasted. All the animals in the yard gnaw bones. 
*. No coll feteria food i ted. 
BPE eee errant eg eee ner 32. 1. When I work a logic example without grum- 
26. All polynomial functions are differentiable. bling, you may be sure it is one I understand. 
All differentiable functions are continuous. 2. The arguments in these examples are not ar- 
*, All polynomial functions are continuous. ranged in regular order like the ones I am used 
. to. 
27. [Adapted from L Carroll. 
l idea An aS l 3. No easy examples make my head ache. 
Nothing intelligible ever puzzles me. ; ; 
Loci 4. I can’t understand examples if the arguments 
ogic puzzles me. : . 
“Lanes : ies are not arranged in regular order like the ones 
z. Logic is unintelligible. 
j oy P Iam used to. 
In exercises 28-32, reorder the premises in each of the 5. Inever grumble at an example unless it gives 


arguments to show that the conclusion follows as a valid 
consequence from the premises. It may be helpful to 
rewrite the statements in if-then form and replace some 
of them by their contrapositives. Exercises 28-30 refer to 
the kinds of Tarski worlds discussed in Examples 3.1.13 
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me a headache. 
These examples are not easy. 


*Lewis Carroll, Symbolic Logic (New York: Dover, 1958), 
pp. 118, 120, 123. 


In 33 and 34 a single conclusion follows when all the given 
premises are taken into consideration, but it is difficult 

to see because the premises are jumbled up. Reorder the 
premises to make it clear that a conclusion follows logi- 
cally, and state the valid conclusion that can be drawn. 

(It may be helpful to rewrite some of the statements in 
if-then form and to replace some statements by their 
contrapositives.) 


33. 1. No birds except ostriches are at least 2.75 meters 
tall. 

2. There are no birds in this aviary that belong to 
anyone but me. 

No ostrich lives on mince pies. 


4. Ihave no birds less than 2.75 meters high. 


G9 


All writers who understand human nature are 
clever. 


34. 1. 


ANSWERS FOR TEST YOURSELF 
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2. No one is a true poet unless he can stir the hu- 

man heart. 

Shakespeare wrote Hamlet. 

4. No writer who does not understand human 
nature can stir the human heart. 

5. None but a true poet could have written 
Hamlet. 


bo 


*35. Derive the validity of universal modus tollens 
from the validity of universal instantiation and 
modus tollens. 


*36. Derive the validity of universal form of part (a) of 
the elimination rule from the validity of universal 
instantiation and the valid argument called elimi- 
nation in Section 2.3. 


1. all elements; any particular element in the domain (Or: each 
individual element of the domain) 2. P(a) is true; Q(a) is 


true 3. Q(@) is false; P(a) is false 4. Any x that makes P(x) 


true makes R(x) true. 5. valid; invalid (Or: invalid; valid) 
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ELEMENTARY NUMBER 
THEORY AND METHODS 
OF PROOF 


The underlying content of this chapter consists of properties of integers (whole numbers), 
rational numbers (integer fractions), and real numbers. The underlying theme of the chap- 
ter is how to determine the truth or falsity of a mathematical statement. 

Here is an example involving a concept used frequently in computer science. Given any 
real number x, the floor of x, or greatest integer in x, denoted |x], is the largest integer that 
is less than or equal to x. On the number line, |x] is the integer immediately to the left of x 
(or equal to x if x is, itself, an integer). Thus [2.3] = 2, |12.99999| = 12, and |-1.5| = —2. 
Consider the following two questions: 


1. For any real number x, is |x — 1] = [x]—1? 
2. For any real numbers x and y, is |x — y] = |x| —Ly]? 


Take a few minutes to try to answer these questions for yourself. 

It turns out that the answer to (1) is yes, whereas the answer to (2) is no. Are these the an- 
swers you got? If not, don’t worry. In Section 4.6 you will learn the techniques you need to 
answer these questions and more. If you did get the correct answers, congratulations! You 
have excellent mathematical intuition. Now ask yourself, “How sure am I of my answers? 
Were they plausible guesses or absolute certainties? Was there any difference in certainty 
between my answers to (1) and (2)? Would I have been willing to bet a large sum of money 
on the correctness of my answers?” 

One of the best ways to think of a mathematical proof is as a carefully reasoned argu- 
ment to convince a skeptical listener (often yourself) that a given statement is true. Imagine 
the listener challenging your reasoning every step of the way, constantly asking, “Why is 
that so?” If you can counter every possible challenge, then your proof as a whole will be 
correct. 

As an example, imagine proving to someone not very familiar with mathematical nota- 
tion that if x is a number with 5x + 3 = 33, then x = 6. You could argue as follows: 


If 5x+3 = 33, then 5x+3 minus 3 will equal 33 — 3 because subtracting the same 
number from two equal quantities gives equal results. But 5x +3 minus 3 equals 5x 
because adding 3 to 5x and then subtracting 3 just leaves 5x. Also, 33 — 3 = 30. Hence 
5x = 30. This means that x is a number which when multiplied by 5 equals 30. But the 
only number with this property is 6. Therefore, if 5x +3 = 33 then x = 6. 


Of course there are other ways to phrase this proof, depending on the level of mathemat- 
ical sophistication of the intended reader. In practice, mathematicians often omit reasons 
for certain steps of an argument when they are confident that the reader can easily supply 
them. When you are first learning to write proofs, however, it is better to err on the side 
of supplying too many reasons rather than too few. All too frequently, when even the best 
mathematicians carefully examine some “details” in their arguments, they discover that 
those details are actually false. One of the most important reasons for requiring proof in 
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Note Most quotients of 
integers are not integers. 
For example, 3 + 2, which 
equals 3/2, is not an inte- 
ger, and 3 + 0 is not even 
a number. 
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mathematics is that writing a proof forces us to become aware of weaknesses in our argu- 
ments and in the unconscious assumptions we have made. 

Sometimes correctness of a mathematical argument can be a matter of life or death. 
Suppose, for example, that a mathematician is part of a team charged with designing a new 
type of airplane engine, and suppose that the mathematician is given the job of determin- 
ing whether the thrust delivered by various engine types is adequate. If you knew that the 
mathematician was only fairly sure, but not positive, of the correctness of his analysis, 
would you want to ride in the resulting aircraft? 

At a certain point in Lewis Carroll’s Alice in Wonderland (see exercise 28 in Section 2.2), 
the March Hare tells Alice to “say what you mean.” In other words, if she means a thing, 
then that is exactly what she should say. In this chapter, perhaps more than in any other 
mathematics course you have ever taken, you will need to say what you mean. Precision of 
thought and language is essential to achieve the mathematical certainty that is necessary for 
you to have complete confidence in your solutions to mathematical problems. 


Direct Proof and Counterexample I: Introduction 


Mathematics, as a science, commenced when first someone, probably a Greek, 
proved propositions about “any” things or about “some” things without specification 
of definite particular things. —Alfred North Whitehead, 1861-1947 


Both discovery and proof are integral parts of problem solving. When you think you have 
discovered that a certain statement is true, try to figure out why it is true. If you succeed, you 
will know that your discovery is genuine. Even if you fail, the process of trying will give you 
insight into the nature of the problem and may lead you to discover that the statement is false. 
For complex problems, the interplay between discovery and proof is not reserved to the end 
of the problem-solving process but, rather, is an important part of each step. 


Assumptions 


e In this text we assume a familiarity with the laws of basic algebra, which are listed 
in Appendix A. 


e We also use the three properties of equality: For all objects A, B, and C, (1) A = A, 
(2) if A = B, then B = A, and (3) if A = B and B = C, then A = C. 

e And we use the principle of substitution: For all objects A and B, if A = B, then 
we may substitute B wherever we have A. 


e In addition, we assume that there is no integer between 0 and | and that the set of 
all integers is closed under addition, subtraction, and multiplication. This means 
that sums, differences, and products of integers are integers. 


The mathematical content of this section primarily concerns even and odd integers and 
prime and composite numbers. 


Even, Odd, Prime, and Composite Integers 


In order to evaluate the truth or falsity of a statement, you must understand what the state- 
ment is about. In other words, you must know the meanings of all terms that occur in 
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the statement. Mathematicians define terms very carefully and precisely and consider it 
important to learn definitions virtually word for word. 


Definitions 


An integer n is even if, and only if, n equals twice some integer. An integer n is odd 
if, and only if, n equals twice some integer plus 1. 
Symbolically, for any integer, n 
niseven <> n = 2k for some integer k 


nisodd < n= 2k+1 for some integer k 


It follows from the definition that if you are doing a problem in which you know that a 
particular integer is even, you can deduce that it has the form 2: (some integer). Conversely, 
if you know that an integer equals 2-(some integer), then you can deduce that the integer 


is even. 
Know a particular orn n has the form 
integer n is even. => 2-(some integer). 
Know n has the form deduce : 
n is even. 


2-(some integer). 


This illustrates why both the if and the only-if parts of definitions are important in math- 
ematical reasoning. In stating definitions, however, mathematics books often replace the 
words if-and-only-if by the single word if, perhaps to seem less formal. For instance, the 
definition of even might be given as “An integer is even if it equals twice some integer.” But 
when the definition is actually used in a proof, both the if and the only-if parts are usually 
needed. So, even when the only-if part of a definition is not stated explicitly, you are sup- 
posed to understand intuitively that it should be included. 

Also observe that the definitions of even and odd integers are quantified statements. 
In Section 3.1 we pointed out that variables used in quantified statements are local, 
which means that they are bound by the quantifier to which they are attached and that 
their scopes extend only to the end of the quantified statements that contain them. As 
a result, the particular names used for the variables have no meaning themselves and 
are freely replaceable by other names. For example, you can substitute any symbols you 
like in place of n and k in the definitions of even and odd without changing the meaning 
of the definitions. 


For every integer n, n is even if, and only if, n = 2r for some integer r. 
For every integer m, m is even if, and only if, m = 2a for some integer a. 
For every integer a, a is odd if, and only if, a = 2s + 1 for some integer s. 


For every integer k, k is odd if, and only if, k = 2n +1 for some integer n. 


Even and Odd Integers 


Use the definitions of even and odd to justify your answers to the following questions. 
a. Is 0 even? 
b. Is —301 odd? 


c. Ifa and b are integers, is 6a°b even? 
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d. If a and b are integers, is 10a + 8b +1 odd? 


e. Is every integer either even or odd? 


Solution 
a. Yes, 0 is even because 0 = 2-0. 


b. Yes, —301 is odd because —301 = 2(—151) +1 and —151 is an integer. 


c. Yes, 6a°b is even because 6a°b = 2(3a7b) and 3a°b is an integer since it is a product 
of integers. 


d. Yes, 10a + 8b +1 is odd because 10a + 8b + 1 = 2(5a + 4b) + 1 and 5a + 4b is an 
integer since it is a sum of products of integers. 


e. Yes, every integer is either even or odd. However, the reason for this fact is not im- 
mediately apparent. It can be deduced using the method of proof by contradiction, 
which is introduced in Section 4.7. It is also a consequence of the quotient-remainder 
theorem, which is stated in Section 4.5. a 


The integer 6, which equals 2-3, is a product of two smaller positive integers. On the 
other hand, 7 cannot be written as a product of two smaller positive integers; its only posi- 
tive factors are 1 and 7. A positive integer, such as 7, that cannot be written as a product of 
two smaller positive integers is called prime. 


Definition 


An integer n is prime if, and only if, n > 1 and for all positive integers r and s, if 
n = rs, then either r or s equals n. An integer n is composite if, and only if, n > 1 
and n = rs for some integers r and s with 1 <<r<nandl<s<n. 

In symbols: For each integer n with n > 1, 


nis prime <> V positive integers r and s, if n = rs 
then either r = 1 ands = norr = nand s =1. 


nis composite <> 4d positive integers r and s such that n = rs 
and l<r<nandl<s<n. 


Prime and Composite Numbers 
a. Is 1 prime? 
b. Is every integer greater than | either prime or composite? 
c. Write the first six prime numbers. 


d. Write the first six composite numbers. 


Solution 


Note The reason fornot a. No. A prime number is required to be greater than 1. 
allowing 1 to be prime is 


dealin Sek b. Yes. Let n be any integer that is greater than 1. Consider all pairs of positive integers 


rand s such that n = rs. There exist at least two such pairs, namely, r = n and s = 1 
andr = | and s = n. Moreover, since n = rs, all such pairs satisfy the inequalities 
1=r=nand1=s <n. Ifnis prime, then these two pairs are the only ways to write 
nas rs. Otherwise, there exists a pair of positive integers r and s such that n = rs and 
neither r nor s equals either 1 or n. Therefore, in this case 1 < r < n and 1 < s <n, and 
hence n is composite. 
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c. 2, 3,5, 7, 11, 13 
d. 4, 6, 8, 9, 10, 12 | 


Proving Existential Statements 
According to the definition given in Section 3.1, a statement in the form 
dx E D such that Q(x) 
is true if, and only if, 
Q(x) is true for at least one x in D. 


One way to prove this is to find an x in D that makes Q(x) true. Another way is to give a set 
of directions for finding such an x. Both of these methods are called constructive proofs 
of existence. The logical principle underlying such a proof is called existential general- 
ization. It says that if you know a certain property is true for a particular object, then you 
may conclude that “there exists an object for which the property is true.” 


Sear Constructive Proofs of Existence 


a. Prove: Jan even integer n that can be written in two ways as a sum of two 
prime numbers. 


b. Suppose that r and s are integers. Prove: J an integer k such that 22r + 18s = 2k. 


Solution 

a. Letn = 10. Then 10 = 5+5 =3+7 and 3, 5, and 7 are all prime numbers. Thus 
Jan even integer—namely, 10—that can be written in two ways as a sum of two 
prime numbers. 


b. Let k = 11r+9s. Then kis an integer because it is a sum of products of integers, and 
by substitution, and the distributive law of algebra, 


2k = 2(11r + 9s) = 22r+ 18s. 


Thus 4 an integer, namely k, such that 22r + 18s = 2k. E 


A nonconstructive proof of existence involves showing either (a) that the existence 
of a value of x that makes Q(x) true is guaranteed by an axiom or a previously proved 
theorem or (b) that the assumption that there is no such x leads to a contradiction. The 
disadvantage of a nonconstructive proof is that it may give virtually no clue about where 
or how x may be found. The widespread use of digital computers in recent years has 
led to some dissatisfaction with this aspect of nonconstructive proofs and to increased 
efforts to produce constructive proofs containing directions for computer calculation of 
the quantity in question. 


Disproving Universal Statements by Counterexample 


To disprove a statement means to show that it is false. Consider the question of disproving 
a statement of the form 


Yx in D, if P(x) then Q. 
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Example 4.1.4 


Note Example 4.1.4 
shows that it is not always 
true that if the squares of 
two numbers are equal, 
then the numbers are 
equal. However, it is true 
that if the squares of two 
positive numbers are 
equal, then the numbers 
are equal. 
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Showing that this statement is false is equivalent to showing that its negation is true. The 
negation of the statement is existential: 


Jx in D such that P(x) and not Q(x). 


But to show that an existential statement is true, we generally give an example, and because 
the example is used to show that the original statement is false, we call it a counterexample. 
Thus the method of disproof by counterexample can be written as follows: 


Disproof by Counterexample 


To disprove a statement of the form “Vx E D, if P(x) then Q(x); find a value of x in 
D for which the hypothesis P(x) is true and the conclusion Q(x) is false. Such an x is 
called a counterexample. 


Disproof by Counterexample 
Disprove the following statement by finding a counterexample: 


Y real numbers a and b, if a = b’ thena = b. 


Solution To disprove this statement, you need to find real numbers a and b such that the 
hypothesis a’ = b° is true and the conclusion a = b is false. The fact that both positive and 
negative integers have positive squares helps in the search. If you flip through some possi- 
bilities in your mind, you will quickly see that 1 and —1 will work (or 2 and —2, or 0.5 and 
—0.5, and so forth). You only need one such pair to give a counterexample. 


Statement: V real numbers a and b, if g= b’, then a = b. 


Counterexample: Let a = 1 and b = —1. Then a” = 1° = 1 and b’ = (-1)° = 1, 
and so a” = b°. Buta # b since 1 # —1. 


After observing that a property holds in a large number of cases, you may guess that it 
holds in all cases. You may, however, run into difficulty when you try to prove your guess. 
Perhaps you just have not figured out the key to the proof, or perhaps your guess is false. 
Consequently, when you are having serious difficulty proving a general statement, you 
should interrupt your efforts to look for a counterexample. Analyzing the kinds of prob- 
lems you are encountering in your proof efforts may help in the search. It may even happen 
that if you find a counterexample and therefore prove the statement false, your understand- 
ing may be sufficiently clarified so that you can formulate a more limited but true version 
of the statement by changing the hypothesis. 


Proving Universal Statements 


The vast majority of mathematical statements to be proved are universal. In discussing 
how to prove such statements, it is helpful to imagine them in a standard form: 


Yx E D, if P(x) then Q(x). 
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Sections 1.1 and 3.1 give examples showing how to write any universal statement in this 
form. When D is finite or when only a finite number of elements satisfy P(x), such a state- 
ment can be proved by the method of exhaustion. 


The Method of Exhaustion 


Use the method of exhaustion to prove the following statement: 


Vn E Z, if n is even and 4 =n S 26 then n can be written as a sum of 
two prime numbers. 


Solution =2+2 6=3+3 8=3+5 10 =5+5 
12=5+7 14=11+3 16=5+11 18=7+11 
20 =7+13 22=5+17 24=5+19 26 =7+19 a 


In most mathematical situations, however, the method of exhaustion cannot be used. For 
instance, to prove by exhaustion that every even integer greater than 2 can be written as a 
sum of two prime numbers you would have to check every even integer. But this is impos- 
sible because there are infinitely many such numbers. 

Even when the domain is finite, it may be infeasible to use the method of exhaustion. 
Imagine, for example, trying to check by exhaustion that the multiplication circuitry of a 
particular computer gives the correct result for every pair of numbers in the computer’s 
range. Since a typical computer would require thousands of years just to compute all pos- 
sible products of all numbers in its range (not to mention the time it would take to check 
the accuracy of the answers), checking correctness by the method of exhaustion is obvi- 
ously impractical. 

The most powerful technique for proving a universal statement is one that works re- 
gardless of the size of the domain over which the statement is quantified. It is based on a 
logical principle sometimes called universal generalization. A more descriptive name is 
generalizing from the generic particular. 


Generalizing from the Generic Particular 


To show that every element of a set satisfies a certain property, suppose x is a 
particular but arbitrarily chosen element of the set, and show that x satisfies 
the property. 


The principle of generalizing from the generic particular is not a typical part of everyday 
reasoning. Its main use is to determine that a general mathematical statement is correct. 
The example below introduces the idea. 


Seek Generalizing from the Generic Particular 


At some time you may have been shown a “mathematical trick” like the following. You 
ask a person to pick any number, add 5, multiply by 4, subtract 6, divide by 2, and sub- 
tract twice the original number. Then you astound the person by announcing that their 
final result was 7. How does this “trick” work? Imagine that an empty box LJ contains 
whatever number the person picked. The table shows that by the end of the calculations, 
whatever was in the empty box was subtracted out of the answer. 
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Step Visual Result Algebraic Result 

Pick a number. x 

Add 5. || | x+5 

Multiply by 4. || | 
it (x+5)-4 = 4x+20 
i 

Subtract 6. 
I (4x + 20) —6 = 4x+ 14 
[I | 

Divide by 2. ia ati oy 47 


Subtract twice the original number. (Qx+7)—2n =7 


The x in the table above is another way of holding a place for the number the person picked. 
It is particular (because it is a single object), but it is also arbitrarily chosen or generic 
(because any number whatsoever can be put in its place). So you can generalize from the 
generic particular to conclude that if you follow the steps you will always get 7, regardless 
of the initial value you put in place of x or inside the empty box. E 


The point of having x be arbitrarily chosen (or generic) is to make a proof that can be gen- 
eralized to all elements of the domain. By choosing x arbitrarily, you are making no special 
assumptions about x that are not also true of all other elements of the domain. The word generic 
means “sharing all the common characteristics of a group or class.” Thus everything you de- 
duce about a generic element x of the domain is equally true of any other element of the domain. 

When the method of generalizing from the generic particular is applied to a property of 
the form “If P(x) then Q(a),” the result is the method of direct proof. Recall that the only 
way an if-then statement can be false is for the hypothesis to be true and the conclusion to 
be false. Thus, given the statement “If P(x) then Q(x),” if you can show that the truth of P(x) 
compels the truth of Q(x), then you will have proved the statement. It follows by the method 
of generalizing from the generic particular that to show that “Vx, if P(x) then Q(),” is true 
for every element x in a set D, you suppose x is a particular but arbitrarily chosen element 
of D that makes P(x) true, and then you show that x makes Q(x) true. 


Method of Direct Proof 
1. Express the statement to be proved in the form “For every x € D, if P(x) then 
Q(x).” (This step is often done mentally.) 


2. Start the proof by supposing x is a particular but arbitrarily chosen element of D 
for which the hypothesis P(x) is true. (This step is often abbreviated “Suppose 
x E D and P(a).”) 


3. Show that the conclusion Q(x) is true by using definitions, previously established 
results, and the rules for logical inference. 
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A Direct Proof of a Theorem 


A Prove that the sum of any two even integers is even. 
Solution Whenever you are presented with a statement to be proved, it is a good idea to 
Caution! The word two ask yourself whether you believe it to be true. In this case you might imagine some pairs 


in this statement does of even integers—say 2+4,6+ 10, 12+12,28+54—and mentally check that their 
not necessarily refer T sums are even. However, since you cannot possibly check all pairs of even numbers, you 
a t Itt oEptS. Ifa cannot know for sure that the statement is true in general by checking its truth in these 
choice of integers is made ; i : : 
sas : particular instances. Many properties hold for a large number of examples and yet fail to 
arbitrarily, the integers : 
be true in general. 


are very likely to be 
distinct, but they might be To prove this statement in general, you need to show that no matter what even inte- 
the samé: gers are given, their sum is even. But given any two even integers, it is possible to repre- 
sent them as 2r and 2s for some integers r and s. And by the distributive law of algebra, 
2r+2s = 2(r + s), which is even because r+ s is an integer. Thus the statement is true 
in general. 
Suppose the statement to be proved is much more complicated than this. What method 


can you use to derive a proof? You can begin by expressing the statement formally. 
Formal Restatement: V integers m and n, if m and n are even then m + n is even. 


This statement is universally quantified over an infinite domain. Thus to prove it in gen- 
eral, you need to show that no matter what two integers you might be given, if both of them 
are even then their sum will also be even. 

Next ask yourself, “How should I start the proof?” or “What am I supposing?” The an- 
swer to such a question gives you the starting point, or first sentence, of the proof. 


Starting Point: Suppose m and n are any particular but arbitrarily chosen integers that are 
even. 


Or, in abbreviated form: 
Suppose m and n are any even integers. 


Then ask yourself, “What conclusion do I need to show in order to complete the proof?” 


To Show: m+n is even. 


At this point you need to ask yourself, “How do I get from the starting point to the con- 
clusion?” Since both involve the term even integer, you must use the definition of this 
term—and thus you must know what it means for an integer to be even. It follows from the 
definition that since m and n are even, each equals twice some integer. One of the basic 
laws of logic, called existential instantiation, says, in effect, that if you know something 
exists, you can give it aname. However, you cannot use the same name to refer to two dif- 
ferent things, both of which are currently under discussion. 


Existential Instantiation 
A If the existence of a certain kind of object is assumed or has been deduced, then 
Caution! Because mand it can be given a name, as long as that name is not currently being used to refer to 


nare arbitrarily chosen something else in the same discussion. 


they can be any pair of 
even integers whatsoever. 


But if you write m = 2r Thus since m equals twice some integer, you can give that integer a name, and since n 


andn = 2r, then m would equals twice some integer, you can also give that integer a name: 
equal n, which is not 


usually the case. m = 2r, for some integer r and n = 2s, for some integer s. 
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Note Introducing ż to 
equal r + s is another use 
of existential instantiation. 
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Now what you want to show is that m + n is even. In other words, you want to show that 
m+n equals 2-(some integer). Having just found alternative representations for m (as 27) 
and n (as 2s), it seems reasonable to substitute these representations in place of m and n: 


mt+n = 2r+2s. 


Your goal is to show that m + n is even. By definition of even, this means that m+n can 
be written in the form 


2-(some integer). 


This analysis narrows the gap between the starting point and what is to be shown to 
showing that 


2r+2s = 2-(some integer). 
Why is this true? First, because of the distributive law from algebra, which says that 
2r+2s = 2(rt+s), 


and, second, because the sum of any two integers is an integer, which implies that r+ s is 
an integer. 

This discussion is summarized by rewriting the statement as a theorem and giving a 
formal proof of it. (In mathematics, the word theorem refers to a statement that is known to 
be true because it has been proved.) The formal proof, as well as many others in this text, 
includes explanatory notes to make its logical flow apparent. Such comments are purely a 
convenience for the reader and could be omitted entirely. For this reason they are italicized 
and enclosed in italic square brackets: / J. 

Donald Knuth, one of the pioneers of the science of computing, has compared con- 
structing a computer program from a set of specifications to writing a mathematical proof 
based on a set of axioms.* In keeping with this analogy, the bracketed comments can be 
thought of as similar to the explanatory documentation provided by a good programmer. 
Documentation is not necessary for a program to run, but it helps a human reader under- 
stand what is going on. 


Theorem 4.1.1 


The sum of any two even integers is even. 


Proof: Suppose m and n are any [particular but arbitrarily chosen] even integers. [We 
must show that m+n is even.] By definition of even, m = 2r and n = 2s for some 
integers r and s. Then 


m+n = 2rt+2s by substitution 
= 2(rt+s) by factoring out a 2. 


Let t = r+ s. Note that t is an integer because it is a sum of integers. Hence 
m+n=2t where fis an integer. 


It follows by definition of even that m + n is even. [This is what we needed to show.]' 


*Donald E. Knuth, The Art of Computer Programming, 2nd ed., Vol. I (Reading, MA: Addison-Wesley, 
1973), p. ix. 


‘See page 148 for a discussion of the role of universal modus ponens in this proof. 
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Most theorems, like Theorem 4.1.1, can be analyzed to a point where you realize that 
as soon as a certain thing is shown, the theorem will be proved. When that thing has been 
shown, it is natural to end the proof with the words “this is what we needed to show” or “as 
was to be shown.” The Latin words for this are quod erat demonstrandum, or Q.E.D. for 
short. Proofs in older mathematics books end with these initials. 

Note that both the if and the only if parts of the definition of even were used in the 
proof of Theorem 4.1.1. Since m and n were known to be even, the only if (=) part of the 
definition was used to deduce that m and n had a certain general form. Then, after some 
algebraic substitution and manipulation, the if (=) part of the definition was used to de- 
duce that m + n was even. 


Getting Proofs Started 


Believe it or not, once you understand the idea of generalizing from the generic particular 
and the method of direct proof, you can write the beginnings of proofs even for theorems 
you do not understand. The reason is that the starting point and what is to be shown in a 
proof depend only on the linguistic form of the statement to be proved, not on the content 
of the statement. 


Identifying the “Starting Point” and the “Conclusion to Be Shown” 


Write the first sentence of a proof (the “starting point”) and the last sentence of a proof (the 
“conclusion to be shown”) for the following statement: 


Note You are not expected Every complete bipartite graph is connected. 

to know anything about 

complete bipartite graphs. Solution Itis helpful to rewrite the statement formally using a quantifier and a variable: 
domain hypothesis conclusion 


A a a 
Formal Restatement: For every graph G, if G is complete bipartite, then G is connected. 


The first sentence, or starting point, of a proof supposes the existence of an object (in this 
case G) in the domain (in this case the set of all graphs) that satisfies the hypothesis of the 
if-then part of the statement (in this case that G is complete bipartite). The conclusion to 
be shown is just the conclusion of the if-then part of the statement (in this case that G is 
connected). 


Starting Point: Suppose G is a [particular but arbitrarily chosen] graph such that G is 
complete bipartite. 


Conclusion to Be Shown: G is connected. 
Thus the proof has the following shape: 


Proof: 
Suppose G is a [particular but arbitrarily chosen] graph such that G is complete bipartite. 


Therefore, G is connected. E 


Fill in the Blanks for a Proof 


Fill in the blanks in the proof of the following theorem. 


Theorem: For all integers r and s, if r is even and s is odd then 3r + 2s is even. 
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Proof: 
Suppose r and s are any [particular but arbitrarily chosen] integers such that r is even 
and s is odd. 
[We must show that 3r + 2s is even.] 


By _@ | r = 2m and s = 2n + | for some integers m and n. 


Then 
3r+ 2s = 3(2m) + 2(2n + 1) by © _ 
= 6m+4n+2 by multiplying out 
= 2(3m+2n+ 1) by factoring out 2 


Lett = 3m+2n+1. 

Then ż is an integer because m, n, 3, 2, and | are integers and because ©, 

Hence 3r+2s = 2t, where t is an integer, and so by _@) 3r+2s is even 
[as was to be shown]. 


Solution 


(a) definition of even and odd, (b) substitution, (c) products and sums of integers are inte- 
gers, (d) definition of even. 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. An integer is even if, and only if, ___. 5. According to the method of generalizing from the 
generic particular, to show that every element of 


2. An integer is odd if, and only if, xg : . 
a Set satisfies a certain property, suppose x is a 
3. An integer n is prime if, and only if, —— ______, and show that 
4. The most common way to disprove a universal 6. To use the method of direct proof to prove a state- 
statement is to find___. ment of the form, “For every x in a set D, if P(x) 
then Q(x); one supposes that ____ and one 
shows that 


EXERCISE SET 4.1* 


In 1-4 justify your answers by using the definitions of 4. Assume that r and s are particular integers. 

even, odd, prime, and composite numbers. a. Is 4rs even? 

b. Is 6r+4s* +3 odd? 

c. Ifr and s are both positive, is r+2rsts? 
composite? 


1. Assume that k is a particular integer. 
a. Is —17 an odd integer? 
b. Is 0 neither even nor odd? 


c. Is 2k—1 odd? Prove the statements in 5-11. 


2. Assume that c is a particular integer. 
a. Is —6c an even integer? 


b. Is 8c +5 an odd integer? 
c. Is (c? + 1)—(c?— 1) —2 an even integer? 6. There are distinct integers m and n such that 


11. ; 
m +7 is an integer. 


5. There are integers m and n such that m > 1 and 
n> l and m + nis an integer. 


3. Assume that m and n are particular integers. 
a. Is 6m + 8n even? 7. There are real numbers a and b such that 


b. Is 10mn +7 odd? Woah = Va+ Vb. 
c. Ifm>n>0,ism-n’ composite? 


*For exercises with blue numbers, solutions are given in Appendix B. The symbol H indicates that only a hint or partial solution is given. The 
symbol * signals that an exercise is more challenging than usual. 
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8. There is an integer n > 5 such that 2” — 1 is prime. 


9. There is areal number x such that x > 1 and 
2 >x! 


Definition: An integer n is called a perfect square if, 
and only if, n = K? for some integer k. 


10. There is a perfect square that can be written as a 
sum of two other perfect squares. 


11. There is an integer n such that 2n? — 5n+2 is 
prime. 


In 12-13, (a) write a negation for the given statement, 
and (b) use a counterexample to disprove the given state- 
ment. Explain how the counterexample actually shows 
that the given statement is false. 


12. For all real numbers a and b, if a < b then 
a <b’. 


13. For every integer n, if n is odd then nt is odd. 


Disprove each of the statements in 14-16 by giving a 
counterexample. In each case explain how the counterex- 
ample actually disproves the statement. 


14. For all integers m and n, if 2m + n is odd then m 
and n are both odd. 


15. For every integer p, if p is prime then pP — lis 
even. 


16. For every integer n, if n is even then n+ Lis 
prime. 

In 17-20, determine whether the property is true for all 

integers, true for no integers, or true for some integers 

and false for other integers. Justify your answers. 


17. (a+b = @ +b? 
ac ate 
—+— = 

b d btd 
19. —a" = (—a)" 


18. 


20. The average of any two odd integers is odd. 


Prove the statement in 21 and 22 by the method of 
exhaustion. 


21. Every positive even integer less than 26 can be ex- 
pressed as a sum of three of fewer perfect squares. 
(For instance, 10 = 17+ 3° and 16 = 4°.) 


22. For each integer n with 1 =n = 10, n?°—n+1lis 
a prime number. 

Each of the statements in 23-26 is true. For each, (a) 

rewrite the statement with the quantification implicit as 

If , then , and (b) write the first sentence 
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of a proof (the “starting point”) and the last sentence of 

a proof (the “conclusion to be shown”). (Note that you do 
not need to understand the statements in order to be able 
to do these exercises.) 


23. For every integer m, if m > 1 then 0 < o <1. 
24. For every real number x, if x > 1 then > x. 


25. For all integers m and n, if mn = 1 thenm =n =1 
orm=n=-—l. 


26. For every real number x, if 0 < x < | then vr <x. 


H 27. Fill in the blanks in the following proof. 


Theorem: For every odd integer n, n? is odd. 


Proof: Suppose n is any _@ By definition of 
odd, n = 2k + 1 for some integer k. Then 


n? = ( _(b) p by substitution 
= 42 +4k+1 by multiplying out 
= UK +2k)+1 by factoring out a 2 


Now 2k? + 2k is an integer because it is a sum of 
products of integers. Therefore, n equals 2: (an in- 
teger) + 1, and so © is odd by definition of odd. 
Because we have not assumed anything about n 
except that it is an odd integer, it follows from the 
principle of that for every odd integer n, 


n? is odd. 
In each of 28-31: a. Rewrite the theorem in three differ- 
ent ways: as V. „if then „as 
V___,____ (without using the words if or then), 
and as If , then (without using an explicit 


universal quantifier). 
b. Fill in the blanks in the proof of the theorem. 


28. Theorem: The sum of any two odd integers is even. 


Proof: Suppose m and n are any [particular but 
arbitrarily chosen] odd integers. 


[We must show that m + n is even.] 


By © m= 2r+1andn = 2s + 1 for some 
integers r and s. 


Then 
mtn=(2rt1)+(Q2st1) by © 
= 2r+2s+2 
= 2(rt+st+1) by algebra. 


Let u = r+s+1. Then u is an integer because r, 
s, and | are integers and because ©. 


4.2 DIRECT PROOF AND COUNTEREXAMPLE II: WRITING ADVICE 173 


Hence m+n = 2u, where u is an integer, and so, 
by @ m +n is even [as was to be shown]. 


29. Theorem: The negative of any even integer is 
even. 


Proof: Suppose n is any [particular but arbitrarily 
chosen] even integer. 


[We must show that —n is even.] 


By _@_, n = 2k for some integer k. 


Then 
=n = —(2k) by ©) _ 
= 2(—-k) by algebra. 
Let r = —k. Then r is an integer because (— 1) and 


k are integers and _©)_. 


Hence —n = 2r, where r is an integer, and so —n is 
even by (d) [as was to be shown]. 


30. Theorem 4.1.2: The sum of any even integer and 
any odd integer is odd. 


Proof: Suppose m is any even integer and n is 
_f@) By definition of even, m = 2r for some 
_(b) and by definition of odd, n = 2s + 1 for 


ANSWERS FOR TEST YOURSELF 


some integer s. By substitution and algebra, 
m+n=_© = 2(r+s)+1. 
Since r and s are both integers, so is their sum 


r+ s. Hence m+n has the form twice some inte- 
ger plus one, and so @d) by definition of odd. 


31. Theorem: Whenever n is an odd integer, 5n° +7 
is even. 


Proof: Suppose n is any [particular but arbitrarily 
chosen] odd integer. 


[We must show that 5n? +7 is even. ] 


By definition of odd, n = (@)__ for some integer k. 
Then 
5n°-+7 = _(b)_ by substitution 


5(4 +4k+1)+7 
= 20K + 20k + 12 


= 2(10k + 10k +6) by algebra. 


Let t = _(©_. Then ris an integer because prod- 
ucts and sums of integers are integers. 


Hence 5n? +7 = 2t, where f is an integer, and thus 
_@)_ by definition of even fas was to be shown]. 


1. it equals twice some integer 2. it equals twice some 
integer plus 1 3. n is greater than 1 and if n equals the 
product of any two positive integers, then one of the integers 
equals 1 and the other equals n. 4. a counterexample 


5. particular but arbitrarily chosen element of the set; 
x satisfies the given property 6. x is a particular but 
arbitrarily chosen element of the set D that makes the 
hypothesis P(x) true; x makes the conclusion Q(x) true. 


EFJ Direct Proof and Counterexample II: Writing Advice 


“... it is demanded for proof that every doubt becomes impossible.” —Carl Friedrich 


Gauss (1777-1855) 


Think of a proof as a way to communicate a convincing argument for the truth of a math- 
ematical statement. When you write a proof, try to be clear and complete. Keep in mind 
that a classmate reading your proof will see only what you actually write down, not any 
unexpressed thoughts behind it. Ideally, your proof will lead your reader to understand why 


the given statement is true. 


Directions for Writing Proofs of Universal Statements 


Over the years, the following rules of style have become fairly standard for writing the 


final versions of proofs: 


1. Copy the statement of the theorem to be proved on your paper. 
This makes the theorem statement available for reference to anyone reading the proof. 
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2. Clearly mark the beginning of your proof with the word Proof. 
This word separates general discussion about the theorem from its actual proof. 


3. Make your proof self-contained. 

This means that you should explain the meaning of each variable used in your proof 
in the body of the proof. Thus you will begin proofs by introducing the initial variables 
and stating what kind of objects they are. The first sentence of your proof would be 
something like “Suppose m and n are any even integers” or “Let x be a real number such 
that x is greater than 2.” This is similar to declaring variables and their data types at the 
beginning of a computer program. 

At a later point in your proof, you may introduce a new variable to represent a quan- 
tity that is known at that point to exist. For example, if you have assumed that a particu- 
lar integer n is even, then you know that n equals 2 times some integer, and you can give 
this integer a name so that you can work with it concretely later in the proof. Thus if 
you decide to call the integer, say, s, you would write, “Since n is even, n = 2s for some 
integer s,” or “since n is even, there exists an integer, say s, such that n = 2s.” 


4. Write your proof in complete, grammatically correct sentences. 
This does not mean that you should avoid using symbols and shorthand abbrevia- 
tions, just that you should incorporate them into sentences. For example, the proof of 
Theorem 4.1.1 contains the sentence 


Then m+n = 2r+2s by substitution 
= 2(rt+s) by factoring out 2. 


To read such text as a sentence, read the first equals sign as “equals” and each subse- 
quent equals sign as “which equals.” 


5. Keep your reader informed about the status of each statement in your proof. 

Your reader should never be in doubt about whether something in your proof has 
been assumed or established or is still to be deduced. If something is assumed, preface it 
with a word like Suppose or Assume. If it is still to be shown, preface it with words like, 
We must show that or In other words, we must show that. This is especially important if 
you introduce a variable in rephrasing what you need to show. (See Common Mistakes 
on the next page.) 


6. Give a reason for each assertion in your proof. 

Each assertion in a proof should come directly from the hypothesis of the theorem, 
or follow from the definition of one of the terms in the theorem, or be a result obtained 
earlier in the proof, or be a mathematical result that has previously been established or 
is agreed to be assumed. Indicate the reason for each step of your proof using phrases 
such as by hypothesis, by definition of... by theorem... and so forth. 

It is best to refer to definitions and theorems by name or number. If you need to 
state one in the body of your proof, avoid using a variable when you write it because 
otherwise your proof could end up with a variable that has two conflicting meanings.* 

Proofs in more advanced mathematical contexts often omit reasons for some steps 
because it is assumed that students either understand them or can easily figure them out 
for themselves. However, in a course that introduces mathematical proof, you should 
make sure to provide the details of your arguments because you cannot guarantee that 
your readers have the necessary mathematical knowledge and sophistication to supply 
them on their own. 

*When a variable is used to state a definition, the scope of the variable extends only to the end of the defini- 


tion. After that, the symbol for the variable no longer has the same meaning. Confusion can result from think- 
ing that the meaning of the symbol continues into other parts of the proof. 
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7. Include the “little words and phrases” that make the logic of your arguments clear. 
When writing a mathematical argument, especially a proof, indicate how each sen- 
tence is related to the previous one. Does it follow from the previous sentence or from a 
combination of the previous sentence and earlier ones? If so, start the sentence with the 
word Because or Since and state the reason why it follows, or write Then, or Thus, or 
So, or Hence, or Therefore, or Consequently, or It follows that, and include the reason 
at the end of the sentence. For instance, in the proof of Theorem 4.1.1, once you know 
that m is even, you can write: “By definition of even, m = 2r for some integer r,’ or you 
can write, “Then m = 2r for some integer r by definition of even.” And when you write 
“Then m+n = 2r + 2s,” add the words by substitution to explain why you are allowed 
to write 2r in place of m and 2s in place of n. 

If a sentence expresses a new thought or fact that does not follow as an immediate 
consequence of the preceding statement but is needed for a later part of a proof, intro- 
duce it by writing Observe that, or Note that, or Recall that, or But, or Now. 

Sometimes in a proof it is desirable to define a new variable in terms of previous 
variables. In such a case, introduce the new variable with the word Let. For instance, 
in the proof of Theorem 4.1.1, once it is known that m +n = 2(r + s), where r and s are 
integers, a new variable ¢ is introduced to represent r + s. The convention in mathemat- 
ics and computer science is to put a new variable to the left of the equal sign and the 
expression that defines it to the right of the sign. Thus the proof goes on to say, “Let 
t = r+ s. Then fis an integer because it is a sum of two integers.” 


8. Display equations and inequalities. 

The convention is to display equations and inequalities on separate lines to increase 
readability, both for other people and for ourselves so that we can more easily check our 
work for accuracy. We follow the convention in the text of this book, but in order to save 
space, we violate it in a few of the exercises and in many of the solutions contained in 
Appendix B. So you may need to copy out some parts of solutions on scratch paper to 
understand them fully. Please follow the convention in your own work. Leave plenty of 
empty space, and don’t be stingy with paper! 


Variations among Proofs 


It is rare that two proofs of a given statement, written by two different people, are identical. 
Even when the basic mathematical steps are the same, the two people may use different 
notation or may give differing amounts of explanation for their steps, or may choose dif- 
ferent words to link the steps together into paragraph form. An important question is how 
detailed to make the explanations for the steps of a proof. This must ultimately be worked 
out between the writer of a proof and the intended reader, whether they be student and 
teacher, teacher and student, student and fellow student, or mathematician and colleague. 
Your teacher may provide explicit guidelines for you to use in your course. Or you may 
follow the example of the proofs in this book (which are generally explained rather fully 
in order to be understood by students at various stages of mathematical development). 
Remember that the phrases written inside brackets / / are intended to elucidate the logical 
flow or underlying assumptions of the proof and need not be written down at all. It is your 
decision whether to include such phrases in your own proofs. 


Common Mistakes 


The following are some of the most common mistakes people make when writing math- 
ematical proofs. 
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1. Arguing from examples. 

Looking at examples is one of the most helpful practices a problem solver can engage 
in and is encouraged by all good mathematics teachers. However, it is a mistake to think 
that a general statement can be proved by showing it to be true for some individual 
cases. A property referred to in a universal statement may be true in many instances 
without being true in general. 

Consider the following “proof” that the sum of any two even integers is even 
(Theorem 4.1.1). 


This is true because if m = 14 and n = 6, which are both even, 
then m +n = 20, which is also even. 


Some people find this kind of argument convincing because it does, after all, 
consist of evidence in support of a true conclusion. But remember that when we 
discussed valid arguments, we pointed out that an argument may be invalid and 
yet have a true conclusion. In the same way, an argument from examples may be 
mistakenly used to “prove” a true statement. In the previous example, it is not suf- 
ficient to show that the conclusion “m+n is even” is true for m = 14 and n = 6. 
You must give an argument to show that the conclusion is true for any arbitrarily 
chosen even integers m and n. 


2. Using the same letter to mean two different things. 
Some beginning theorem provers give a new variable quantity the same letter name 
as a previously introduced variable. Consider the following “proof” fragment: 


Suppose m and n are any odd integers. Then by definition of odd, 
m = 2k+ l andn = 2k + 1 where k is an integer. 


You might think of a variable in a mathematical proof as similar to a global variable 
in a computer program: once introduced, it has the same meaning throughout the pro- 
gram. In other words, its scope extends to the end of the program. In this example, 
using the symbol k in the expressions for both m and n makes k a global variable. As 
a result, both m and n equal 2k + 1, and thus are equal to each other. The proof then 
only shows that a sum of two identical odd integers is even, not that the sum of two 
arbitrarily chosen odd integers is even. 


3. Jumping to a conclusion. 
To jump to a conclusion means to allege the truth of something without giving an 
adequate reason. Consider the following “proof” that the sum of any two even integers 
is even. 


Suppose m and n are any even integers. By definition of even, 
m = 2randn = 2s for some integers r and s. Then m+n = 2r + 2s. 
So m+ nis even. 


The problem with this “proof” is that to show an integer is even one needs to show 
that it equals twice some integer. This proof jumps to the conclusion that m + n is even 
without having found an integer that, when doubled, equals m + n. 


4. Assuming what is to be proved. 
To assume what is to be proved is a variation of jumping to a conclusion. As an 
example, consider the following “proof” of the fact that the product of any two odd 
integers is odd: 


Suppose m and n are any odd integers. When any odd integers are 
multiplied, their product is odd. Hence mn is odd. 
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Note At this point in the 
proof, we do not know 
whether there is an inte- 
ger s making this equality 
true. 


A 


Caution! A reader of 
your proof should never 
be in doubt about what 
you have already shown 
and what still needs to 
be shown. 
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5. Confusion between what is known and what is still to be shown. 


A more subtle way to jump to a conclusion occurs when the conclusion is restated 
using a variable. Here is an example in a “proof” that the product of any two odd inte- 
gers is odd: 


Suppose m and n are any odd integers. We must show that mn is 


odd. This means that there exists an integer s such that 


mn = 2s+ 1, 


Also by definition of odd, there exist integers a and b such that 
m= 2a+ l andn = 2b+1. 
Then 


l 


mn = (2a+1)2b+1) = 2s+1. 


So, since s is an integer, mn is odd by definition of odd. 

In this example, when the author restated the conclusion to be shown (that mn is 
odd), the author wrote “there exists an integer s such that mn = 2s + 1.” But we only 
know that the integer s exists if we know that mn is odd, which is what the author is 
trying to show. Thus, in the sentence starting with the word “Then,” the author jumped 
to an unjustified conclusion. This mistake might have been avoided if the author had 
written 


“This means we must show that there exists an integer s such that mn = 2s + 1.” 


An even better way to avoid this kind of error is not to introduce a variable into a proof 
unless it is either part of the hypothesis or deducible from it. 


. Use of any when the correct word is some. 


There are a few situations in which the words any and some can be used interchange- 
ably. For instance, in starting a proof that the square of any odd integer is odd, one could 
correctly write, “Suppose m is any odd integer” or “Suppose m is some odd integer.” In 
most situations, however, the words any and some are not interchangeable. Here is the 
start of a “proof” that the square of any odd integer is odd, which uses any when the 
correct word is some: 


Suppose m is a particular but arbitrarily chosen odd integer. 
By definition of odd, m = 2a + 1 for any integer a. 


In the second sentence it is incorrect to say that “m = 2a + 1 for any integer a” because 
a cannot be just “any” integer; in fact, solving m = 2a + 1 for a shows that the only 
possible value for a is (m—1)/2. The correct way to finish the second sentence is, 
“m = 2a + 1 for some integer a” or “there exists an integer a such that m = 2a + 1.” 


. Misuse of the word if. 


Another common error is not serious in itself, but it reflects imprecise thinking that 
sometimes leads to problems later in a proof. This error involves using the word if when 
the word because is really meant. Consider the following proof fragment: 


Suppose p is a prime number. If p is prime, then p cannot be written 
as a product of two smaller positive integers. 


The use of the word if in the second sentence is inappropriate. It suggests that the 
primeness of p is in doubt. But p is known to be prime by the first sentence. It cannot 
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be written as a product of two smaller positive integers because it is prime. Here is a 
correct version of the fragment: 


Suppose p is a prime number. Because p is prime, p cannot be written 
as a product of two smaller positive integers. 


An Odd Integer Minus an Even Integer 


Prove that the difference of any odd integer and any even integer is odd. Use only the defi- 
nitions of odd and even and the Assumptions listed on page 161, not any other properties 
of odd and even integers. Follow the directions given in this section for writing proofs of 
universal statements. 


Solution 

You may already have a sense that the statement to be proved is true, but to make sure your 

intuition is correct and to develop a careful proof, rewrite the statement using names such as 

a and b for the odd and even integers so that you will have a convenient way to refer to them: 
For all integers a and b, if a is odd and b is even, then a — b is odd. 


Or: For every odd integer a and every even integer b, the difference a — b is odd. 
Or: If a is any odd integer and b is any even integer, then a — b is odd. 


Thus the starting point for your proof would be something like, “Suppose a is any odd in- 
teger and b is any even integer,’ and the conclusion to be shown would be “We must show that 
a— bis odd.” If, in addition, you know how to use the definitions of odd and even, you will have 
reduced the creative part of developing the proof to a small, but crucial, section in the middle. 


Theorem 4.2.1 


The difference of any odd integer and any even integer is odd. 


Proof: 


1. Suppose a is any odd integer and b is any even integer. [We must show that a—b 
is odd.] 


By definition of odd, a = 2r + 1 for some integer r, and b = 2s for some integer s. 
Then a-—b=(2r+1)—2s by substitution 

= 2r—2st+1 by combining like terms 

= 2(r—s)+1 by factoring out 2. 
Let t=r-s. 


Then ż is an integer because it is a difference of integers. 


So, by substitution, a — b = 2t + 1, where t is an integer. 


PO NAMWRWH 


Hence a — b is odd [as was to be shown]. 


Note that lines 1-3 follow immediately from the general structure of the proof, the defini- 
tions of odd and even, and substitution. In order to figure out your next steps, it can be helpful 
to refer to what must be shown—namely, that a — b is odd. According to the definition of 
odd, you can conclude that a — b is odd if you can show that it equals 2-(some integer) + 1. 
So showing that a — b is odd involves transforming (2r + 1) — 2s into 2-(some integer) + 1. 
Lines 4—8 show the steps for doing this, and line 9 concludes that what was to be shown has 
been achieved. a 
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Some of the exercises at the end of the section are based on actual student work and ask 
you to identify mistakes in “proofs” that have been proposed. Example 4.2.2 illustrates the 
kind of care that must be taken in evaluating a proof. 


Identifying a Mistake in a Proposed Proof 


Find the mistake in the following “proof.” 
Theorem: If n is any even integer, then (—1)" = 1. 


Proof: 


1. Suppose n is any even integer. [We must show that (—1)" is even.] 
2. By definition of even, n = 2a for some integer a. 

3. Then (-1)" = (-1)”4 by substitution 

4 = ((-1)° by a law of exponents 

5 


=1 because any nonzero real number squared is positive. 


Solution 


This “proof” incorrectly jumps to a conclusion in line 5. Although it is true that the square 
of any nonzero real number is positive, it does not follow that the square of (—1)* is 1. 
Exercise 10 at the end of this section asks you to give a correct proof of this theorem. E 


Showing That an Existential Statement Is False 


Recall that the negation of an existential statement is universal. It follows that to prove an 
existential statement is false, you must prove a universal statement (its negation) is true. 


Disproving an Existential Statement 
Show that the following statement is false: 
There is a positive integer n such that n’ +3n+2is prime. 
Solution Proving that the given statement is false is equivalent to proving its negation is 
true. The negation is 
For all positive integers n, n’ +3n+2 is not prime. 


Because the negation is universal, it is proved by generalizing from the generic particular. 
Claim: The statement “There is a positive integer n such that n’ +3n+2is prime” is false. 
Proof: 


Suppose n is any [particular but arbitrarily chosen] positive integer. [We will show that 
n? + 3n + 2 is not prime.] Factoring shows that 


n?+3n+2 =(nt+1\(n+2). 


In addition, n+ 1 and n+2 are integers (because they are sums of integers), and both 
n+1>1andn+2>1 (because n= 1). Thus n?+3n+2 is a product of two integers 
each greater than 1, and so n’ +3n+2 is not prime. E 


Conjecture, Proof, and Disproof 


More than 350 years ago, the French mathematician Pierre de Fermat claimed that it is 
impossible to find positive integers x, y, and z with x” +y” = z” if n is an integer that is 
at least 3. (For n = 2, the equation has many integer solutions, such as 3? +4 = 5° and 
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5? + 12? = 13°.) Fermat wrote his claim in the margin of a book, along with the comment 
“I have discovered a truly remarkable PROOF of this theorem which this margin is too 
small to contain.” No proof, however, was found among his papers, and over the years 
some of the greatest mathematical minds tried and failed to discover a proof or a counter- 
example for what came to be known as Fermat’s last theorem. 

In 1986 Kenneth Ribet of the University of California at Berkeley showed that if a cer- 
tain other statement, the Taniyama—Shimura conjecture, could be proved, then Fermat’s 
theorem would follow. Andrew Wiles, an English mathematician and faculty member at 
Princeton University, had become intrigued by Fermat’s claim while still a child and, as 
an adult, had come to work in the branch of mathematics to which the Taniyama—Shimura 
conjecture belonged. As soon as he heard of Ribet’s result, Wiles immediately set to work 
to prove the conjecture. In June of 1993, after 7 years of concentrated effort, he presented 
a proof to worldwide acclaim. 

During the summer of 1993, however, while the proof was being carefully checked 
to prepare for publication, Wiles found a step he had difficulty justifying and which he 
ultimately realized was an error. Having worked alone for so long, he decided to call on 
a former student, Richard Taylor, then at Cambridge University in England, who agreed 
to join him in Princeton, and, together, they worked ceaselessly for months to resolve the 
problem. After almost a year without a breakthrough, Taylor encouraged Wiles to revisit 
an approach that had been abandoned years earlier, and, as Wiles examined the details, 
he suddenly saw that the reason it had failed was the exact reason another approach he 
had previously abandoned would succeed. By the end of 1994, the revised proof had been 
thoroughly checked and pronounced correct by experts in the field. It was published in the 
Annals of Mathematics in 1995. Several books and an excellent documentary have been 
produced that convey the drama and excitement of the discovery.* 

One of the oldest problems in mathematics that remains unsolved is the Gold- 
bach conjecture. In Example 4.1.5 it was shown that every even integer from 4 to 
fe SoS Sten Sas 26 can be represented as a sum of two prime numbers. More than 250 years ago, 
Andrew Wiles Christian Goldbach (1690-1764) conjectured that every even integer greater than 2 
(born 1953) can be so represented. Explicit computer-aided calculations have shown the conjec- 
ture to be true up to at least 10'8. But there is a huge chasm between 10'8 and infin- 
ity. As pointed out by James Gleick of the New York Times, many other plausible 
conjectures in number theory have proved false. Leonhard Euler (1707—1783), for 
example, proposed in the eighteenth century that a +b +c = d had no nontrivial 
whole number solutions. In other words, no three perfect fourth powers add up to an- 
other perfect fourth power. For many numbers, Euler’s conjecture looked good. But in 
1987 a Harvard mathematician, Noam Elkies, proved it wrong. One counterexample, 
found by Roger Frye of Thinking Machines Corporation in a long computer search, is 
95,800* + 217,5194 + 414,560 = 422,4814 İ 

In May 2000, “to celebrate mathematics in the new millennium,” the Clay Mathematics 
Institute of Cambridge, Massachusetts, announced that it would award prizes of $1 mil- 
lion each for the solutions to seven longstanding, classical mathematical questions. One of 
them, “P vs. NP,” asks whether problems belonging to a certain class can be solved on a 
computer using more efficient methods than the very inefficient methods that are presently 
known to work for them. This question is discussed briefly at the end of Chapter 11. 


Bettmann/Getty Images 


Pierre de Fermat 
(1601-1665) 


AP Images/Charles Rex Arbogast 


““The Proof,” produced in 1997, for the series Nova on the Public Broadcasting System; Fermat’s Enigma: 
The Epic Quest to Solve the World’s Greatest Mathematical Problem, by Simon Singh and John Lynch (New 
York: Bantam Books, 1998); Fermat’s Last Theorem: Unlocking the Secret of an Ancient Mathematical Prob- 
lem by Amir D. Aczel (New York: Delacorte Press, 1997). 


‘James Gleick, “Fermat’s Last Theorem Still Has Zero Solutions,” New York Times, April 17, 1988. 
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TEST YOURSELF 


1. The meaning of every variable used in a proof 
should be explained within 


2. Proofs should be written in sentences that are 
and 


3. Every assertion in a proof should be supported by 
a 


4. The following are some useful “little words and 
phrases” that clarify the reasoning in a proof: 
, , , , and 


EXERCISE SET 4.2 


5. A new thought or fact that does not follow as an 
immediate consequence of the preceding state- 
ment can be introduced by writing ____, 

, , , Or 


6. To introduce a new variable that is defined in 
terms of previous variables, use the word 


7. Displaying equations and inequalities increases 
the—— of a proof. 


8. Some proof-writing mistakes are ____, _____, 
, , , , and 


Prove the statements in 1-11. In each case use only the 
definitions of the terms and the Assumptions listed on 
page 161, not any previously established properties of 
odd and even integers. Follow the directions given in this 
section for writing proofs of universal statements. 


1. For every integer n, if n is odd then 3n + 5 is even. 


2. For every integer m, if m is even then 3m + 5 is 
odd. 


3. For every integer n, 2n — 1 is odd. 


4. Theorem 4.2.2: The difference of any even inte- 
ger minus any odd integer is odd. 


5. Ifa and b are any odd integers, then a+b’ is 
even. 


6. If kis any odd integer and m is any even integer, 
then K? + m? is odd. 


7. The difference between the squares of any two 
consecutive integers is odd. 


8. For any integers m and n, if m is even and n is odd 
then 5m + 3n is odd. 


9. If an integer greater than 4 is a perfect square, then 
the immediately preceding integer is not prime. 


10. If nis any even integer, then (—1)”" = 1. 


11. If mis any odd integer, then (—1)” = —1. 
Prove that the statements in 12-14 are false. 


12. There exists an integer m = 3 such that m — Lis 
prime. 


13. There exists an integer n such that 6n” + 27 is 
prime. 


14. There exists an integer k = 4 such that 
2k? — 5k +2 is prime. 
Find the mistakes in the “proofs” shown in 15-19. 


15. Theorem: For every integer k, if k > 0 then 
K +2k+ Lis composite. 


“Proof: For k = 2,k > Oandk? +2k+1 = 
2°+2-2+1 =9. And since 9 = 3:3, 
then 9 is composite. Hence the theorem is true.” 


16. Theorem: The difference between any odd integer 
and any even integer is odd. 


“Proof: Suppose n is any odd integer, and m is 
any even integer. By definition of odd, n = 2k + 1 
where k is an integer, and by definition of even, 
m = 2k where k is an integer. Then 


n—m = (2k+1)—2k=1, 


and 1 is odd. Therefore, the difference between 
any odd integer and any even integer is odd.” 


17. Theorem: For every integer k, if k > 0 then 
K +2k+ Lis composite. 


“Proof: Suppose k is any integer such that k > 0. 
If K? + 2k+ 1 is composite, then K + 2k+ 1 = rs 
for some integers r and s such that 


1<r<k4+2k+1 
and l<s<R4+2k4+1. 
Since R+2k+1=rs 


and both r and s are strictly between 1 and 
K +2k+ 1, then K +2k+ 1 is not prime. Hence 
K +2k+1 is composite as was to be shown.” 
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18. Theorem: The product of any even integer and 
any odd integer is even. 


“Proof: Suppose m is any even integer and n is 
any odd integer. If m-n is even, then by defini- 
tion of even there exists an integer r such that 

m:n = 2r. Also since m is even, there exists an in- 
teger p such that m = 2p, and since n is odd there 
exists an integer q such that n = 2q + 1. Thus 


mn = (2p)(2q + 1) = 2r, 


where r is an integer. By definition of even, then, 
m'n is even, as was to be shown.” 


19. Theorem: The sum of any two even integers 
equals 4k for some integer k. 


“Proof: Suppose m and n are any two even integers. 


By definition of even, m = 2k for some integer k 
and n = 2k for some integer k. By substitution, 


m+n = 2k+2k = 4k. 


This is what was to be shown.” 


In 20-38 determine whether the statement is true or 
false. Justify your answer with a proof or a counterexam- 
ple, as appropriate. In each case use only the definitions 
of the terms and the Assumptions listed on page 161, not 
any previously established properties. 


20. The product of any two odd integers is odd. 
21. The negative of any odd integer is odd. 
22. For all integers a and b, 4a + 5b + 3 is even. 


23. The product of any even integer and any integer is 
even. 


24. If a sum of two integers is even, then one of the 
summands is even. (In the expression a + b, a and 
b are called summands.) 


25. The difference of any two even integers is even. 


26. For all integers a, b, and c, if a, b, and c are con- 
secutive, then a + b + c is even. 


ANSWERS FOR TEST YOURSELF 


27. The difference of any two odd integers is even. 


H 28. For all integers n and m, if n — m is even then 


303. 
n° — m” is even. 
29. For every integer n, if n is prime then (—1)" = —1. 


30. For every integer m, if m > 2 then m —A4is 
composite. 


31. For every integer n, n—n+llisa prime number. 


32. For every integer n, 4n? +n+1)—3n’ isa perfect 
square. 


33. Every positive integer can be expressed as a sum 
of three or fewer perfect squares. 


H* 34. (Two integers are consecutive if, and only if, one 


is one more than the other.) Any product of four 
consecutive integers is one less than a perfect 
square. 


35. If mand n are any positive integers and mn is a 
perfect square, then m and n are perfect squares. 


36. The difference of the squares of any two consecu- 
tive integers is odd. 


H 37. For all nonnegative real numbers a and b, 


Vab = Vavb. (Note that if x is a nonnegative 
real number, then there is a unique nonnegative 
real number y, denoted Vx, such that y = x.) 


38. For all nonnegative real numbers a and b, 
Va+b = Va+ Vb. 


39. Suppose that integers m and n are perfect squares. 
Then m + n + 2V mn is also a perfect square. 
Why? 


H* 40. If p is a prime number, must 2” — 1 also be prime? 


Prove or give a counterexample. 


* 41. Ifnis a nonnegative integer, must 2%” + 1 be 


prime? Prove or give a counterexample. 


1. the body of the proof (or: the proof itself) 2. complete; 
grammatically correct 3. reason 4. Because; Since; 
Then; Thus; So; Hence; Therefore, Consequently; It follows 
that; By substitution 5. Observe that; Note that; Recall 
that; But; Now 6.Let 7. readability 8. Arguing from 
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examples; Using the same letter to mean two different 
things; Jumping to a conclusion; Assuming what is to be 
proved; Confusion between what is known and what is still 
to be shown; Use of any when the correct word is some; 
Misuse of the word if 
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ZX Direct Proof and Counterexample III: 
Rational Numbers 


Such, then, is the whole art of convincing. It is contained in two principles: to define 
all notations used, and to prove everything by replacing mentally the defined terms by 
their definitions. —Blaise Pascal, 1623-1662 


Sums, differences, and products of integers are integers. But most quotients of integers 
are not integers. Quotients of integers are, however, important; they are known as rational 
numbers. 


Definition 


A real number r is rational if, and only if, it can be expressed as a quotient of two 
integers with a nonzero denominator. A real number that is not rational is irrational. 
More formally, if r is a real number, then 


r is rational + J integers a and b such that r = ; and b # 0. 


The word rational contains the word ratio, which is another word for quotient. A rational 
number can be written as a ratio of integers. 


Determining Whether Numbers Are Rational or Irrational 


a. Is 10/3 a rational number? 
b. Is— 5 a rational number? 

c. Is 0.281 a rational number? 
d. Is 7 a rational number? 

e. Is 0 a rational number? 

f. Is 2/0 a rational number? 

g. Is 2/0 an irrational number? 
h 


. Is 0.12121212... a rational number (where the digits 12 are assumed to repeat 
forever)? 


i. If mand n are integers and neither m nor n is zero, is (m + n)/mn a rational number? 
Solution 

a. Yes, 10/3 is a quotient of the integers 10 and 3 and hence is rational. 

b. Yes, -5 = za, which is a quotient of the integers —5 and 39 and hence is rational. 


c. Yes, 0.281 = 281/1000. Note that the numbers shown on a typical calculator display 
are all finite decimals. An explanation similar to the one in this example shows that 
any such number is rational. It follows that a calculator with such a display can accu- 
rately represent only rational numbers. 


d. Yes, 7 = 7/1. 
e. Yes, 0 = 0/1. 


f. No, 2/0 is not a real number (division by 0 is not allowed). 
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g. No, because every irrational number is a real number, and 2/0 is not a real number. 
We discuss additional techniques for determining whether numbers are irrational in 
Sections 4.7, 4.8, and 7.4. 


h. Yes. Let x = 0.12121212... Then 100x = 12.12121212... Thus 


100x — x = 12.12121212... —0.12121212... = 12. 
But also 100x — x = 99x by basic algebra. 
Hence 99x = 12, 
12 
and so x= —. 
99 
Therefore, 0.12121212... = 12/99, which is a ratio of two nonzero integers and thus 


is a rational number. 

Note that you can use an argument similar to this one to show that any repeating 
decimal is a rational number. In Section 9.4 we show that any rational number can be 
written as a repeating or terminating decimal. 


i. Yes, since m and n are integers, so are m+n and mn (because sums and products of 
integers are integers). Also mn # 0 by the zero product property. One version of this 
property says the following: 


Zero Product Property 


If neither of two real numbers is zero, then their product is also not zero. 


(See Theorem T11 in Appendix A and exercise 8 at the end of this section.) It follows 
that (m + n)/mn is a quotient of two integers with a nonzero denominator and hence is 
a rational number. a 


More on Generalizing from the Generic Particular 


If you claim a property holds for all elements in a domain, then someone can challenge 
your claim by picking any element in the domain and asking you to prove that that element 
satisfies the property. To prove your claim, you must be able to meet all such challenges. 
In other words, you must have a way to convince the challenger that the property is true for 
an arbitrarily chosen element in the domain. 

For example, suppose “A” claims that every integer is a rational number. “B” challenges 
this claim by asking “A” to prove it for n = 7. “A” observes that 


7 
T= I which is a quotient of integers and hence rational. 
“B” accepts this explanation but challenges again with n = — 12. “A” responds that 
-12 = EE which is a quotient of integers and hence rational. 


Next “B” tries to trip up “A” by challenging with n = 0, but “A” answers that 


0= I which is a quotient of integers and hence rational. 
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As you can see, “A” is able to respond effectively to all “B’’s challenges because “A” has a 
general procedure for putting integers into the form of rational numbers: “A” just divides 
whatever integer “B” gives by 1. That is, no matter what integer n “B” gives “A”, “A” writes 


n= I which is a quotient of integers and hence rational. 


This discussion is an informal proof for the following theorem. 


Theorem 4.3.1 


Every integer is a rational number. 


In exercise 11 at the end of this section you are asked to condense the above discussion into 
a formal proof. 


Proving Properties of Rational Numbers 
The next example shows how to use the method of generalizing from the generic particular 
to prove a property of rational numbers. 
SEWE EWA Any Sum of Rational Numbers Is Rational 
Prove that the sum of any two rational numbers is rational. 


Solution Begin by mentally or explicitly rewriting the statement to be proved in the form 
“7 „if then a 


Formal Restatement: V real numbers r and s, if r and s are rational then r+ s is rational. 


Next ask yourself, “Where am I starting from?” or “What am I supposing?” The answer 
gives you the starting point, or first sentence, of the proof. 


Starting Point: Suppose r and s are any particular but arbitrarily chosen real numbers such 
that r and s are rational; or, more simply, 
Suppose r and s are any rational numbers. 
Then ask yourself, “What must I show to complete the proof?” 
To Show: r+ is rational. 
Finally ask, “How do I get from the starting point to the conclusion?” or “Why must r+ s 
be rational if both r and s are rational?” The answer depends in an essential way on the 


definition of rational. 
Rational numbers are quotients of integers, so to say that r and s are rational means that 


and s= 2 for some integers a, b, c, and d 
where b # 0 and d # 0. 


r= 


= 
b 


It follows by substitution that 


tsa 545 (4.3.1) 
rvs b a os 
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You need to show that r + s is rational, which means that r+ s can be written as a single 
fraction or ratio of two integers with a nonzero denominator. But the right-hand side of 
equation (4.3.1) is 


_ ad be by rewriting the fraction with a common 


a C 
b + d bd bd denominator 


_ ab+be by adding fractions with a 
= bd common denominator. 


Is this fraction a ratio of integers? Yes. Because products and sums of integers are integers, 
ad + bc and bd are both integers. Is the denominator bd # 0? Yes, by the zero product 
property (since b + 0 and d # 0). Thus r+ s is a rational number. 

This discussion is summarized as follows: 


Theorem 4.3.2 


The sum of any two rational numbers is rational. 


Proof: Suppose r and s are any rational numbers. [We must show that r + s is rational.] 
Then, by definition of rational, r = a/b and s = c/d for some integers a, b, c, and d 
with b # 0 and d + 0. Thus 


ac 
rts= a +— by substitution 


d 
_ad+bc 


by basic algebra. 
bd y g 


Let p = ad + bc and q = bd. Then p and q are integers because products and sums 
of integers are integers and because a, b, c, and d are all integers. Also q # 0 by the 
zero product property. Thus 


rt+s= where p and q are integers and q # 0. 


Therefore, r + s is rational by definition of a rational number fas was to be shown]. 


Deriving New Mathematics from Old 


Section 4.1 focused on establishing truth and falsity of mathematical theorems using only the 
basic algebra normally taught in secondary school; the fact that the integers are closed under 
addition, subtraction, and multiplication; and the definitions of the terms in the theorems 
themselves. In the future, when we ask you to prove something directly from the defini- 
tions, we will mean that you should restrict yourself to this approach. However, once a col- 
lection of statements has been proved directly from the definitions, another method of proof 
becomes possible. The statements in the collection can be used to derive additional results. 


sole Deriving Additional Results about Even and Odd Integers 
Suppose that you have already proved the following properties of even and odd integers: 
1. The sum, product, and difference of any two even integers are even. 


2. The sum and difference of any two odd integers are even. 
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The product of any two odd integers is odd. 
. The product of any even integer and any odd integer is even. 
The sum of any odd integer and any even integer is odd. 


. The difference of any odd integer minus any even integer is odd. 


IY DW w 


The difference of any even integer minus any odd integer is odd. 


Use the properties listed above to prove that if a is any even integer and b is any odd inte- 
ger, then wie tt is an integer. 

Solution Suppose a is any even integer and b is any odd integer. By property 3, b’ is 
odd, and by property 1, a’ is even. Then by property 5, a’ +b’ is odd, and because 1 is 
also odd, the sum (a? + b°) + 1 = a’ +b’ +1 is even by property 2. Hence, by definition 
of even, there exists an integer k such that a+b? +1 =2k. Dividing both sides by 2 gives 


pi es 2 2 
is ote ae | f . i Pete De ae ds p 
= = k, which is an integer. Thus = is an integer /as was to be shown]. E 


A corollary is a statement whose truth can be immediately deduced from a theorem 
that has already been proved. 


The Double of a Rational Number 


Derive the following as a corollary of Theorem 4.3.2. 


Corollary 4.2.3 


The double of a rational number is rational. 


Solution The double of a number is just its sum with itself. But since the sum of any 
two rational numbers is rational (Theorem 4.3.2), the sum of a rational number with itself 
is rational. Hence the double of a rational number is rational. Here is a formal version of 
this argument: 


Proof: Suppose r is any rational number. Then 2r = r + r is a sum of two rational numbers. 
So, by Theorem 4.3.2, 2r is rational. E 


TEST YOURSELF 


1. To show that a real number is rational, we must 2. An irrational number is a that is 
show that we can write it as 


3. Zero is a rational number because 


EXERCISE SET 4.3 


The numbers in 1-7 are all rational. Write each number as 6. 320.5492492492... 
a ratio of two integers. 
35 4 2 7. 52.4672167216721... 
I == 2. 4.6037 Loto 
6 > 9 8. The zero product property, says that if a product 
4. 0.37373737... of two real numbers is 0, then one of the numbers 
5. 0.56565656... must be 0. 
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a. Write this property formally using quantifiers 
and variables. 


b. Write the contrapositive of your answer to part (a). 


c. Write an informal version (without quantifier 


symbols or variables) for your answer to part (b). 


9. Assume that a and b are both integers and that 
a # Oand b + 0. Explain why (b — a)/(ab’) must 
be a rational number. 


10. Assume that m and n are both integers and that 
n + 0. Explain why (5m — 12n)/(4n) must be a 
rational number. 


11. Prove that every integer is a rational number. 


12. Let S be the statement “The square of any rational 
number is rational.” A formal version of S is “For 
every rational number r, r? is rational.” Fill in the 
blanks in the proof for S. 


Proof: Suppose that r is _@) By definition of 
rational, r = a/b for some Œ) with b #0. By 
substitution, 


r=_© = a’/b?. 


Since a and b are both integers, so are the prod- 
ucts a° and @)_. Also b° # 0 by the ©. 
Hence 7° is a ratio of two integers with a non- 
zero denominator, and so by definition of 
rational. 


13. Consider the following statement: The negative of 
any rational number is rational. 
a. Write the statement formally using a quantifier 
and a variable. 
b. Determine whether the statement is true or 
false and justify your answer. 


14. Consider the statement: The cube of any rational 
number is a rational number. 
a. Write the statement formally using a quantifier 
and a variable. 
b. Determine whether the statement is true or 
false and justify your answer. 
Determine which of the statements in 15-19 are true 
and which are false. Prove each true statement directly 
from the definitions, and give a counterexample for each 
false statement. For a statement that is false, determine 
whether a small change would make it true. If so, make 
the change and prove the new statement. Follow the 
directions for writing proofs on page 173. 


15. The product of any two rational numbers is a 
rational number. 
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H 16. The quotient of any two rational numbers is a 


rational number. 


H 17. The difference of any two rational numbers is a 


rational number. 
FFS 


H 18. Ifr and s are any two rational numbers, then —— 


is rational. 


H 19. For all real numbers a and b, if a < b then 


a<% <ph. 


(You may use the properties of inequalities in 
T17-T27 of Appendix A.) 


20. Use the results of exercises 18 and 19 to prove 
that given any two rational numbers r and s with 
r < s, there is another rational number between r 
and s. An important consequence is that there are 
infinitely many rational numbers in between any 
two distinct rational numbers. See Section 7.4. 


Use the properties of even and odd integers that are 
listed in Example 4.3.3 to do exercises 21-23. Indicate 
which properties you use to justify your reasoning. 


21. True or false? If m is any even integer and n is any 
odd integer, then m + 3n is odd. Explain. 


22. True or false? If a is any odd integer, then a° + a is 
even. Explain. 


23. True or false? If k is any even integer and m is any 
odd integer, then (k + a) —(m— D? is even. Explain. 


Derive the statements in 24-26 as corollaries of 
Theorems 4.3.1, 4.3.2, and the results of exercises 12, 13, 
14, 15, and 17. 


24. For any rational numbers r and s, 2r + 3s is rational. 


25. If ris any rational number, then 3 —2r+4is 
rational. 


26. For any rational number s, 55° + 8s” — 7 is rational. 

27. Itis a fact that if n is any nonnegative integer, then 
1-(1/2*?) 

EEEH 


(A more general form of this statement is proved in 
Section 5.2.) Is the right-hand side of this equation 
rational? If so, express it as a ratio of two integers. 


at P ee 
2 22 3 on 


28. Suppose a, b, c, and d are integers and a # c. Sup- 
pose also that x is a real number that satisfies the 
equation 

axtb _ 


cxtd 
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Must x be rational? If so, express x as a ratio of 
two integers. 


* 29. Suppose a, b, and c are integers and x, y, and z are 
nonzero real numbers that satisfy the following 
equations: 

XZ yz 


=a and =b and ——=c. 
x+y XZ ytz 


Is x rational? If so, express it as ratio of two integers. 


30. Prove that if one solution for a quadratic equa- 
tion of the form x? + bx + c = 0 is rational 
(where b and c are rational), then the other 
solution is also rational. (Use the fact that if 
the solutions of the equation are r and s, then 
xX +bx+c = (x — r)(x —S).) 


31. Prove that if a real number c satisfies a polyno- 
mial equation of the form 
rx? + rx +ryxt+ro = 0, 


where ro, r1, r2, and r3 are rational numbers, then 
c satisfies an equation of the form 


m + m +nıx +n = 0, 


where nọ, n1, 12, and n, are integers. 


Definition: A number c is called a root of a polynomial 
p(x) if, and only if, p(c) = 0. 


* 32. Prove that for every real number c, if c is a root of 
a polynomial with rational coefficients, then c is a 
root of a polynomial with integer coefficients. 


Use the properties of even and odd integers that are 
listed in Example 4.3.3 to do exercises 33 and 34. 


33. When expressions of the form (x — r)(x — s) are 
multiplied out, a quadratic polynomial is obtained. 
For instance, (x — 2)(x — (—7)) = (x— 2x +7) = 
x +5x— 14. 

H a. What can be said about the coefficients of 
the polynomial obtained by multiplying out 
(x — r)(x — s) when both r and s are odd integers? 
When both r and s are even integers? When one 
of r and s is even and the other is odd? 

b. It follows from part (a) that xX — 1253x +255 

cannot be written as a product of two polynomials 
with integer coefficients. Explain why this is so. 


* 34. Observe that 
(x — r)(x — s)\(x — À 
= - (r+s+ px + (rs + rs + st)x — rst. 


a. Derive a result for cubic polynomials similar 
to the result in part (a) of exercise 33 for 
quadratic polynomials. 

b. Can x? + 7x" — 8x — 27 be written as a product 
of three polynomials with integer coefficients? 
Explain. 


In 35-39 find the mistakes in the “proofs” that the sum of 
any two rational numbers is a rational number. 


35. “Proof: Any two rational numbers produce a 
rational number when added together. So if r and 
s are particular but arbitrarily chosen rational 
numbers, then r+ s is rational.” 


36. “Proof: Let rational numbers r= 1 and s = + be 
given. Thenr+s = i+ = 7, which is a rational 
number. This is what was to be shown.” 


37. “Proof: Suppose r and s are rational numbers. By 
definition of rational, r = a/b for some integers 
aand b with b # 0, and s = a/b for some integers 
a and b with b # 0. Then 

a a 2a 

ress StS 

b b b 
Let p = 2a. Then p is an integer since it is a prod- 
uct of integers. Hence r + s = p/b, where p and b 
are integers and b # 0. Thus r + s is a rational 
number by definition of rational. This is what was 
to be shown.” 


38. “Proof: Suppose r and s are rational numbers. 
Then r = a/b and s = c/d for some integers a, 
b, c, and d with b + 0 and d # 0 (by definition of 
rational). Then 

r+s= = + = 
b d 

But this is a sum of two fractions, which is a frac- 
tion. So r— s is a rational number since a rational 
number is a fraction.” 


39. “Proof: Suppose r and s are rational numbers. 
If r+ s is rational, then by definition of ratio- 
nal r +s = a/b for some integers a and b with 
b + 0. Also since r and s are rational, r = i/j and 
s = m/n for some integers i, j, m, and n with j # 0 
and n # 0. It follows that 


im a 

r+s=-+—=-, 

jon b 
which is a quotient of two integers with a nonzero 
denominator. Hence it is a rational number. This is 


what was to be shown.” 
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ANSWERS FOR TEST YOURSELF 


1. a ratio of integers with a nonzero denominator 2. real number; not rational 3.0 = I 


EJ Direct Proof and Counterexample IV: Divisibility 
The essential quality of a proof is to compel belief. —Pierre de Fermat 
When you were first introduced to the concept of division in elementary school, you were 


probably taught that 12 divided by 3 is 4 because if you separate 12 objects into groups of 
3, you get 4 groups with nothing left over. 


XXX XXX XXX XXX 


You may also have been taught to describe this fact by saying that “12 is evenly divisible 
by 3” or “3 divides 12 evenly.” 

The notion of divisibility is the central concept of one of the most beautiful subjects in 
advanced mathematics: number theory, the study of properties of integers. 


Definition 


If n and d are integers then 


n is divisible by d if, and only if, n equals d times some integer and d # 0. 
Note According to the 
definition of divisibility Instead of “n is divisible by d,” we can say that 
if you know that n and d 
are any integers such that 
d divides n, then you may 
assume that d is not equal 
to zero. 


n is a multiple of d, or 
dis a factor of n, or 
dis a divisor of n, or 
d divides n. 


The notation d|n is read “d divides n.” Symbolically, if n and d are integers: 
d |n <> Jan integer, say k, such that n = dk and d # 0. 


The notation d/n is read “d does not divide n.” 


Divisibility 


a. Is 21 divisible by 3? b. Does 5 divide 40? c. Does 7|42? 
d. Is 32 a multiple of — 16? e. Is 6 a factor of 54? f. Is 7 a factor of —7? 


Solution 
a. Yes, 21 = 3-7. b. Yes, 40 = 5-8. c. Yes, 42 = 7-6. 
d. Yes, 32 = (—16)-(—2). e. Yes, 54 = 6-9. f. Yes, -7 = 7-(-1). E 


SCW YWA Divisors of Zero 


If k is any nonzero integer, does k divide 0? 


Solution Yes, because 0 = k-0. B 
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Two useful properties of divisibility are (1) that if one positive integer divides a second 
positive integer, then the first is less than or equal to the second, and (2) that the only divi- 
sors of | are 1 and —1. 


Theorem 4.4.1 A Positive Divisor of a Positive Integer 


For all integers a and b, if a and b are positive and a divides b then a = b. 


Proof: Suppose a and b are any positive integers such that a divides b. [We must show 
that a = b.] By definition of divisibility, there exists an integer k so that b = ak. By 
property T25 of Appendix A, k must be positive because both a and b are positive. 
It follows that 


lsk 


because every positive integer is greater than or equal to 1. Multiplying both sides 
by a gives 


a< ka= b 


because multiplying both sides of an inequality by a positive number preserves the 
inequality by property T20 of Appendix A. Thus a = b [as was to be shown]. 


Theorem 4.4.2 Divisors of 1 


The only divisors of 1 are 1 and —1. 


Proof: Since 1-1 = 1 and (—1)(—1) = 1, both 1 and —1 are divisors of 1. Now sup- 
pose m is any integer that divides 1. Then there exists an integer n such that 1 = mn. 
By Theorem T25 in Appendix A, either both m and n are positive or both m and n 
are negative. If both m and n are positive, then m is a positive integer divisor of 1. By 
Theorem 4.4.1, m <= 1, and, since the only positive integer that is less than or equal 
to 1 is 1 itself, it follows that m = 1. On the other hand, if both m and n are nega- 
tive, then, by Theorem T12 in Appendix A, (—m)(—n) = mn = 1. In this case —m 
is a positive integer divisor of 1, and so, by the same reasoning, —m = 1 and thus 
m = —1. Therefore there are only two possibilities: either m = 1 or m = —1. So the 
only divisors of 1 are 1 and — 1. 


Divisibility and Algebraic Expressions 


a. Ifa and b are integers, is 3a + 3b divisible by 3? 
b. If k and m are integers, is 10km divisible by 5? 


Solution 


a. Yes. By the distributive law of algebra, 3a + 3b = 3(a + b) and a + b is an integer 
because it is a sum of two integers. 


b. Yes. By the associative law of algebra, 10km = 5-(2km) and 2km is an integer because 
it is a product of three integers. E 
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When the definition of divides is rewritten formally using the existential quantifier, the 


result is 
2 d|n <> Jan integer k such that n = dk and d + 0. 
Note If d = 0, then 7 is 
not defined, and so it is Since the negation of an existential statement is universal, it follows that d does not divide 
certainly not an integer. n (denoted dfn) if, and only if, V integer k, n # dk or d = 0; in other words, the quotient 


n/d is not an integer. 


n 
For allintegersnandd, d Ii n= 7 is not an integer. 


Checking Nondivisibility 
Does 4| 15? 


: Solution No, B = 3.75, which is not an integer. a 


Be careful to distinguish between the notation a|b and the notation a/b. The notation 
a|b stands for the sentence “a divides b,” which means that there is an integer k such that 
b = ak. Dividing both sides by a gives b/a = k, an integer. Thus, when a 0, a|b if, and 
only if, b/a is an integer. On the other hand, the notation a/b stands for the number a/b 
which is the result of dividing a by b and which may or may not be an integer. In particular, 
since the symbol | stands for the word “divides,” be sure to avoid writing something like 


4{G+5)=4[8. 


If read out loud, this becomes “4 divides the quantity 3 plus 5 equals 4 divides 8,” which 
is nonsense. 


Caution! a|b denotes the 
sentence “a divides b,” 
whereas a/b denotes the 
number a divided by b. 


Prime Numbers and Divisibility 


An alternative way to define a prime number is to say that an integer n > 1 is prime if, and 
only if, its only positive integer divisors are | and itself. E 
Proving Properties of Divisibility 


One of the most useful properties of divisibility is that it is transitive. If one number divides 
a second and the second number divides a third, then the first number divides the third. 


Transitivity of Divisibility 


Prove that for all integers a, b, and c, if a | b and b 


c, then alc. 


Solution Since the statement to be proved is already written formally, you can immedi- 
ately pick out the starting point, or first sentence of the proof, and the conclusion that must 
be shown. 


Starting Point: Suppose a, b, and c are particular but arbitrarily chosen integers such that 
a|b and b|c. 


To Show: a|c. 


You need to show that a 


c, or, in other words, that 


c = a-(some integer). 
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But since a|b, 


> 
II 


ar for some integer r. 44.1 


And since b 


C, 
c= bs for some integer s. 4.4.2 


Equation 4.4.2 expresses c in terms of b, and equation 4.4.1 expresses b in terms of a. Thus 
if you substitute 4.4.1 into 4.4.2, you will have an equation that expresses c in terms of a. 


c = bs by equation 4.4.2 
= (ar)s by equation 4.4.1. 


But (ar)s = a(rs) by the associative law for multiplication. Hence 
c = a(rs). 


Now you are almost finished. You have expressed c as a: (something). It remains only to 
verify that that something is an integer. But of course it is, because it is a product of two 
integers. 

This discussion is summarized as follows: 


ibility 


For all integers a, b, and c, if a divides b and b divides c, then a divides c. 


Theorem 4.4.3 Transitivity of Divis 


Proof: Suppose a, b, and c are any [particular but arbitrarily chosen] integers such 
that a divides b and b divides c. [We must show that a divides c.] By definition of 
divisibility, 
b=ar and c=bs_ for some integers rand s. 
By substitution 
c=bs 


= (ar)s 


a(rs) by basic algebra. 
Let k = rs. Then kis an integer since it is a product of integers, and therefore 
c =ak where kis an integer. 


Thus a divides c by definition of divisibility. [This is what was to be shown.] 


It would appear from the definition of prime that to show that an integer is prime you 
would need to show that it is not divisible by any integer greater than 1 and less than itself. 
In fact, you need only check whether it is divisible by a prime number less than or equal 
to itself. This follows from Theorems 4.4.1, 4.4.3, and the following theorem, which says 
that any integer greater than 1 is divisible by a prime number. The idea of the proof is quite 
simple. You start with a positive integer. If it is prime, you are done; if not, it is a product of 
two smaller positive factors. If one of these is prime, you are done; if not, you can pick one 
of the factors and write it as a product of still smaller positive factors. You can continue in 
this way, factoring the factors of the number you started with, until one of them turns out to 
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be prime. This must happen eventually because all the factors can be chosen to be positive 
and each is smaller than the preceding one. 


Theorem 4.4.4 Divisibility by a Prime 


Any integer n > 1 is divisible by a prime number. 


Proof: Suppose n is a [particular but arbitrarily chosen] integer that is greater than 
1. [We must show that there is a prime number that divides n.] If n is prime, then n is 
divisible by a prime number (namely itself), and we are done. If n is not prime, then, 
as discussed in Example 4.1.2b, 


n = roso Where rp and sọ are integers and 
1 <ro< nand1 < so <n. 


It follows by definition of divisibility that rọ |n. 
If rọ is prime, then rọ is a prime number that divides n, and we are done. If rọ is 
not prime, then 


ro = 11S, Where r; and s, are integers and 
1 <r; < rgand 1 < sı <7. 


It follows by the definition of divisibility that r; |rọ. But we already know that ro|n. 
Consequently, by transitivity of divisibility, r; |n. 

If r; is prime, then r; is a prime number that divides n, and we are done. If r; is 
not prime, then 


ri = r352 where r, and s, are integers and 
l<n<randl<s<r. 


It follows by definition of divisibility that r,|r,. But we already know that r; |n. Con- 
sequently, by transitivity of divisibility, rə |n. 

If r, is prime, then r, is a prime number that divides n, and we are done. If r, is 
not prime, then we may repeat the previous process by factoring r3 as 1383. 

We may continue in this way, factoring successive factors of n until we find a 
prime factor. We must succeed in a finite number of steps because each new factor 
is both less than the previous one (which is less than n) and greater than 1, and there 
are fewer than n integers strictly between | and n.* Thus we obtain a sequence 


Fo, F], Fas... Fko 


where k= 0, 1< r< r1 < <r <r < ro<n, and r;|n for each i= 0, 
1,2,...,k. The condition for termination is that r, should be prime. Hence r, is a 
prime number that divides n. [This is what we were to show.] 


Counterexamples and Divisibility 


To show that a proposed divisibility property is not universally true, you need only find one 
pair of integers for which it is false. 


*Strictly speaking, this statement is justified by an axiom for the integers called the well-ordering principle, 
which is discussed in Section 5.4. Theorem 4.4.4 can also be proved using strong mathematical induction, as 
shown in Example 5.4.1. 
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Checking a Proposed Divisibility Property 
Is the following statement true or false? For all integers a and b, if a|b and b|a then a = b. 


Solution This statement is false. Can you think of a counterexample just by concentrat- 
ing for a minute or so? 

The following discussion describes a mental process that may take just a few seconds. It 
is helpful to be able to use it consciously, however, to solve more difficult problems. 

To discover the truth or falsity of a statement such as the one given above, start off much 
as you would if you were trying to prove it. 


Starting Point: Suppose a and b are integers such that a|b and bla. 


Ask yourself, “Must it follow that a = b, or could it happen that a # b for some a and b?” 
Focus on the supposition. What does it mean? By definition of divisibility, the conditions 
a|b and b|a mean that 


b=ha and a=kb_ for some integers h and k. 


Must it follow that a = b, or can you find integers a and b that satisfy these equations for 
which a # b? The equations imply that 


b = ha = h(kb) = (hk)b. 


Since b 


a, b # 0, and so you can cancel b from the extreme left and right sides to obtain 
1 = hk. 


In other words, h and k are divisors of 1. But, by Theorem 4.4.2, the only divisors of 1 
are 1 and —1. Thus / and k are both | or are both —1. If h = k = 1, then b = a. But if 
h = k = -1, then b = —a and so a + b. This analysis suggests that you can find a coun- 
terexample by taking b = —a. Here is a formal answer: 


Proposed Divisibility Property: For all integers a and b, if a|b and b|a then 
a=b. 


Counterexample: Leta = 2andb = —2. Then —2 = (—1)-2and2 = (—1)-(—2), 
and thus 


a|b and bla, but a # b because 2 + —2. 


Therefore, the statement is false. 


The search for a proof will frequently help you discover a counterexample (provided the 
statement you are trying to prove is, in fact, false). Conversely, in trying to find a counter- 
example for a statement, you may come to realize the reason why it is true (if it is, in fact, 
true). The important thing is to keep an open mind until you are convinced by the evidence 
of your own careful reasoning. 


The Unique Factorization of Integers Theorem 


The most comprehensive statement about divisibility of integers is contained in the unique 
factorization of integers theorem. Because of its importance, this theorem is also called the 
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fundamental theorem of arithmetic. Although Euclid, who lived about 300 B.C.E., seems to 
have been acquainted with the theorem, it was first stated precisely by the great German 
mathematician Carl Friedrich Gauss (rhymes with house) in 1801. 

The unique factorization of integers theorem says that any integer greater than 1 either 
is prime or can be written as a product of prime numbers in a way that is unique except, 
perhaps, for the order in which the primes are written. For example, 


72 = 2:-2-2:3-3 = 2-3-3-2°2 = 3:-2-2-3-2, 


and so forth. The three 2’s and two 3’s may be written in any order, but any factorization of 
72 as a product of primes must contain exactly three 2’s and two 3’s—no other collection 
of prime numbers besides three 2’s and two 3’s multiplies out to 72. 


Note This theorem is the 
reason the number 1 is Theorem 4.4.5 Unique Factorization of Integers Theorem 
not allowed to be prime. (Fundamental Theorem of Arithmetic) 

If 1 were prime, then 
factorizations would not 
be unique. For example, 
6 = 2-3 = 1-2-3, and so n = pi py p3..- pis 
forth. 


Given any integer n > 1, there exist a positive integer k, distinct prime numbers p}, 
P2,---» Pk, and positive integers e€}, é2,..., €g such that 


and any other expression for n as a product of prime numbers is identical to this 
except, perhaps, for the order in which the factors are written. 


The proof of the unique factorization of integers theorem is outlined in the exercises for 
Sections 5.4 and 8.4. 

Because of the unique factorization theorem, any integer n > | can be put into a stan- 
dard factored form in which the prime factors are written in ascending order from left 
to right. 


Definition 


Given any integer n > 1, the standard factored form of n is an expression of the 
form 


—: E e e: ey 
ih = JO [by J a 


where k is a positive integer, p4, P2, - . - , P are prime numbers, e€}, €z, . - - , €g are posi- 
tive integers, and P P =- P, 


Writing Integers in Standard Factored Form 
Write 3,300 in standard factored form. 
Solution First find all the factors of 3,300. Then write them in ascending order: 
3,300 = 100-33 = 4-25-3-11 
= 2-2-5-5-3-11 = 27-3'-5?- 11. 
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TEST YOURSELF 


1. 


EXERCISE SET 4.4 


. Ifa and b are positive integers and a 
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Example 4.4.9 


Suppose m is an integer such that 


197 


Using Unique Factorization to Solve a Problem 


8-7°6°5°4-3-2-m = 17-16-15-14-13-12-11-10. 


Does 17|m? 


Solution Since 17 is one of the prime factors of the right-hand side of the equation, it is 
also a prime factor of the left-hand side (by the unique factorization of integers theorem). 
But 17 does not equal any prime factor of 8, 7, 6, 5, 4, 3, or 2 (because it is too large). Hence 
17 must occur as one of the prime factors of m, and so 17 | m. E 


To show that a nonzero integer d divides an integer 
n, we must show that 


. To say that d divides n means the same as saying 


that is divisible by 


b, then 


is less than or equal to 


For all integers n and d, df n if, and only if, 


. Ifa and b are integers, the notation a | b denotes 


and the notation a/b denotes 


. The transitivity of divisibility theorem says that 


for all integers a, b, and c, if then 


The divisibility by a prime theorem says that every 
integer greater than | is 


. The unique factorization of integers theorem says 


that any integer greater than | is either ______ or 
can be written as in a way that is unique 
except possibly for the in which the num- 
bers are written. 


Give a reason for your answer in each of 1-13. Assume 
that all variables represent integers. 


1. 


-i ~ -i 
G N 3 0 


of ws Oo o PS YON 


Is 52 divisible by 13? 

Does 7|56? 

Does 5|0? 

Does 3 divide (3k + 1)(3k + 2)(3k + 3)? 
Is 6m(2m + 10) divisible by 4? 

Is 29 a multiple of 3? 

Is —3 a factor of 66? 

Is 6a(a + b) a multiple of 3a? 

Is 4 a factor of 2a-34b? 

Does 7|34? 


. Does 13|73? 
. Ifn = 4k +1, does 8 divide n? — 1? 
. Ifn = 4k +3, does 8 divide n? — 1? 


14. 


Fill in the blanks in the following proof that for all 
integers a and b, if a|b then a|(—d). 


Proof: Suppose a and b are any integers such that 
_@ | By definition of divisibility, there exists an 
integer r such that . By substitution, 


b = —(ar) = a(-r). 


Lett = _©__. Then ris an integer because 

t = (—1)-r, and both —1 and r are integers. Thus, 
by substitution, —b = at, where t is an integer, and 
so by definition of divisibility, (d) , as was to be 
shown. 


Prove statements 15 and 16 directly from the definition 
of divisibility. 


15. 


H 16. 


17. 


For all integers a, b, and c, if a|b and a|c then 
al(b+o). 


For all integers a, b, and c, if a|b then a|c then 
a|(b—c). 


For all integers a, b, c, and d, if a|c and b|d then 
ab|cd. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


198 CHAPTER4 ELEMENTARY NUMBER THEORY AND METHODS OF PROOF 


18. Consider the following statement: The negative of 
any multiple of 3 is a multiple of 3. 
a. Write the statement formally using a quantifier 
and a variable. 
b. Determine whether the statement is true or 
false and justify your answer. 


34. Consider a string consisting of a’s, b’s, and c’s 
where the number of b’s is three times the num- 
ber of a’s and the number of c’s is five times the 
number of a’s. Prove that the length of the string is 
divisible by 3. 


35. Two athletes run a circular track at a steady 
pace so that the first completes one round in 8 
minutes and the second in 10 minutes. If they 
both start from the same spot at 4 P.M., when 
will be the first time they return to the start 
together? 


19. Show that the following statement is false: For all 
integers a and b, if 3|(a + b) then 3|(a — b). 


For each statement in 20-32, determine whether the 
statement is true or false. Prove the statement directly 
from the definitions if it is true, and give a counterex- 


ample if it is false. 


H 20. 


The sum of any three consecutive integers is divis- 
ible by 3. 


36. 


It can be shown (see exercises 44—48) that an 
integer is divisible by 3 if, and only if, the sum of 
its digits is divisible by 3; an integer is divisible 
by 9 if, and only if, the sum of its digits is divis- 


21. a product of any two even integers is a multiple ible by 9; an integer is divisible by 5 if, and only 
as if, its right-most digit is a 5 or a 0; and an integer 
H 22. A necessary condition for an integer to be divisible is divisible by 4 if, and only if, the number 
by 6 is that it be divisible by 2. formed by its right-most two digits is divisible by 
be bs : = 4. Check the following integers for divisibility by 
23. A sufficient condition for an integer to be divisible 3.4.5 and9 
by 8 is that it be divisible by 16. a. 637,425,403,705,125 
24. For all integers a, b, and c, if a|b and a|c then b. 12,858,306,120,312 
a| (2b — 3c). c. 517,924,440,926,512 
d. 14,328,083,360,232 
25. For all integers a, b, and c, if a is a factor of c and 
b is a factor of c then ab is a factor of c. 37. Use the unique factorization theorem to write the 
H 26. Foralli badeu eani following integers in standard factored form. 
; we all integers a, b, and c, if ab|c then a|c an a. 1,176 b. 5.733 c. 3,675 
c. 
H For all i b ande ifalbt oth p 38. Letn = 8,424. 
27. For all integers a, b, and c, if a|(b + c) then a| a. Write the prime factorization for n. 
or alc. b. Write the prime factorization for n. 
28. For all integers a, b, and c, if a|bc then a|b or alc. H c. Is n divisible by 20? Explain. 
H d. What is the least positive int that 
29. For all integers a and b, if a|b then a° |b’. ER ee 
8,424 -m is a perfect square? 
y ` 2 
30. For all integers a and n, if a|n“ and a <n then 39. Suppose that in standard factored form 
a|n. a = pi' p? . .. p, where k is a positive integer; pı, 
31. For all integers a and b, if a| 10b then a|10 or a|b. P2- - - , Pk are prime numbers; and e}, €z, ..., €x 
J A astoa chain in which Jih are positive integers. 
oa a SR T EWA ene ie a. What is the standard factored form for a°? 
SUD Om 11S Ever W Cadi CUSTOMEr Wio Mazes b. Find the least positive integer k such that 
a purchase. If some of the numbers on the card add 4.25 Qo: eis 
. ; 2°-3°-7-11°-k is a perfect cube (that is, it 
up to 100, then the customer wins $100. A certain : ‘ : 
é ive equals an integer to the third power). Write the 
customer receives a card containing the numbers . 
resulting product as a perfect cube. 
72, 21, 15, 36, 69, 81, 9, 27, 42, and 63. 40. a. Ifa and b are integers and 12a = 25b, does 
Will the customer win $100? Why or why not? 12|b? does 25|a? Explain. 
: : Lo. . b. If x and y are integers and 10x = 9y, does 
33. Is it possible to have a combination of nickels, 


dimes, and quarters that add up to $4.72? Explain. 
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H 41. How many zeros are at the end of 45°-88°7 Explain 
how you can answer this question without actually 
computing the number. (Hint: 10 = 2-5.) 


42. If nis an integer and n > 1, then n! is the product 
of n and every other positive integer that is less 
than n. For example, 5! = 5-4-3-2-1. 

a. Write 6! in standard factored form. 

b. Write 20! in standard factored form. 

c. Without computing the value of (20!) 
determine how many zeros are at the end of 
this number when it is written in decimal form. 
Justify your answer. 


H* 43. Ata certain university 2/3 of the mathematics 
students and 3/5 of the computer science students 
have taken a discrete mathematics course. The 
number of mathematics students who have taken 
the course equals the number of computer science 
students who have taken the course. If there are at 
least 100 mathematics students at the university, 
what are the least possible number of mathematics 
students and the least possible number of com- 
puter science students at the university? 


Definition: Given any nonnegative integer n, the decimal 
representation of n is an expression of the form 


dy dy—1*** dyd\do, 


where k is a nonnegative integer, do, di, do,..., dy (called 
the decimal digits of n) are integers from 0 to 9 inclu- 
sive, dk # 0 unless n = 0 and k = 0, and 


n= dy 10" +dp_1-108! + +++ +d>-10° +d, +10 + dp. 


(For example, 2,503 = 2-10°+5-107+0-10+3.) 


44. Prove that if n is any nonnegative integer whose 
decimal representation ends in 0, then 5|n. (Hint: 
If the decimal representation of a nonnegative 
integer n ends in do, then n = 10m + dp for some 
integer m.) 


45. Prove that if n is any nonnegative integer whose 
decimal representation ends in 5, then 5|n. 


46. Prove that if the decimal representation of 
a nonnegative integer n ends in dıdọ and if 


ANSWERS FOR TEST YOURSELF 


1. n equals d times some integer and d # 0 (Or: there is an 
integer r such that n = dr and d #0) 2.n;d 3.a;b 
4. A is not an integer 5. the sentence “a divides b”; the 


H* 47. 


* 48 


* 49. 


50. 


4| (10d, + do), then 4|n. (Hint: If the decimal 
representation of a nonnegative integer n ends 
in ddo, then there is an integer s such that 

n = 100s + 10d, + do.) 


Observe that 


7,524 = 7-1,000+5-100+2-10+4 

= 7(999 + 1)+5(99 + 1) +2(9+ 1) +4 

= (7-999 + 7) + (5-99 +5) +(2:94+2)+4 

= (7:999 + 5:99 + 2:9) +(7+5+2+4) 

= (7-111-94+5-11:-9+2-9)+(7+54+2+4) 

= (7-111 +5-114+2)-9+(7+5+2+4) 

= (an integer divisible by 9) 

+ (the sum of the digits of 7,524). 
Since the sum of the digits of 7,524 is divisible by 
9, 7,524 can be written as a sum of two integers 
each of which is divisible by 9. It follows from 
exercise 15 that 7,524 is divisible by 9. 
Generalize the argument given in this example 

to any nonnegative integer n. In other words, prove 
that for any nonnegative integer n, if the sum of 


the digits of n is divisible by 9, then n is divisible 
by 9. 


Prove that for any nonnegative integer n, if the 
sum of the digits of n is divisible by 3, then n is 
divisible by 3. 


Given a positive integer n written in decimal form, 
the alternating sum of the digits of n is obtained 
by starting with the right-most digit, subtracting 
the digit immediately to its left, adding the next 
digit to the left, subtracting the next digit, and 

so forth. For example, the alternating sum of the 
digits of 180,928 is 8-2+9-0+8-1 = 22. 
Justify the fact that for any nonnegative integer n, 
if the alternating sum of the digits of n is divisible 
by 11, then n is divisible by 11. 


The integer 123,123 has the form abc,abc, where 
a, b, and c are integers from 0 through 9. Consider 
all six-digit integers of this form. Which prime 
numbers divide every one of these integers? Prove 
your answer. 


number obtained when a is divided by b 6. a divides b 


and 


b divides c; a divides c__7. divisible by some prime 


number 8. prime; a product of prime numbers; order 
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CH Direct Proof and Counterexample V: Division into 
Cases and the Quotient-Remainder Theorem 


Be especially critical of any statement following the word “obviously.” 
—Anna Pell Wheeler, 1883—1966 


When you divide 11 by 4, you get a quotient of 2 and a remainder of 3. 


2 <— quotient 
alt 

8 

3 <— remainder 


Another way to say this is that 11 equals 2 groups of 4 with 3 left over: 


XXX XXX XXX 


T T 


2 groups of 4 3 left over 
Or, 
11 =2-44+3. 
2 groups of 4 3 left over 


The number left over (3) is less than the size of the groups (4) because if 4 or more were 
left over, another group of 4 could be formed. 

The quotient-remainder theorem says that when any integer n is divided by any positive 
integer d, the result is a quotient q and a nonnegative integer remainder r that is smaller than d. 


Theorem 4.5.1 The Quotient-Remainder Theorem 


Given any integer n and positive integer d, there exist unique integers q and r such 
that 


n=dqtr and O=r<d. 


The proof that there exist integers g and r with the given properties is in Section 5.4; the 
proof that g and r are unique is outlined in exercise 21 in Section 4.8. 
If n is positive, the quotient-remainder theorem can be illustrated on the number line 


as follows: 
+ +0 d 2d 3d- ++ +++ qdn--:-. 
——————$—$—— <q] —— > 
yd 


r 


If n is negative, the picture changes. Since n = dq +r, where r is nonnegative, d must be 
multiplied by a negative integer q to bring dq either exactly to n (in which case r = 0) or to 
a point below n (in which case the positive integer r is added to bring dg + r back up to n). 
This is illustrated as follows: 


- +s qd ne + + + + + 33d 2d -d O+ +: 
SSS ee 
ae | 


r 
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Example 4.5.1 


Note In 1801 Carl Fried- 
rich Gauss introduced 

the word modulus and its 
abbreviation mod with the 
meaning shown here. 


Example 4.5.2 


The Quotient-Remainder Theorem 

For each of the following values of n and d, find integers q and r such that n = dq + r and 
Osr<d. 

a. n= 54,d = 4 b. n= —54,d = 4 c. n = 54, d = 70 
Solution 

a. 54 = 4:13 + 2; hence q = 13 andr = 2. 

b. —54 = 4: (—14) + 2; hence q = —14 andr = 2. 

c. 54 = 70:0 + 54; hence q = 0 and r = 54. E 


div and mod 


A number of computer languages have built-in functions that enable you to compute values of 
the quotients and remainders for the quotient-remainder theorem. In Python n div d is written 
n // d and n mod dis written n % d, and for all integer inputs both operators give the values 
that satisfy the quotient-remainder theorem. In C, C#, and Java, n div d is written n / d and 
n mod d is written n % d. For all nonnegative integer inputs for n and positive integer inputs 
for d, both operators give the values that satisfy the quotient-remainder theorem, but for nega- 
tive integer inputs for n or d the resulting values differ from their mathematical counterparts. 


Definition 


Given an integer n and a positive integer d, 


n div d = the integer quotient obtained 
when n is divided by d, and 


n mod d = the nonnegative integer remainder obtained 
when n is divided by d. 


Symbolically, if n and d are integers and d > 0, then 
ndivd=q and nmodd=r & n=dq+r, 


where q and r are integers and 0 S r < d. 


It follows from the quotient-remainder theorem that n mod d equals one of the integers 
from 0 through d — 1 (since the remainder of the division of n by d must be one of these inte- 
gers). Also a necessary and sufficient condition for an integer n to be divisible by an integer 
d is that n mod d = 0. You are asked to prove this in the exercises at the end of this section. 


Computing div and mod by Hand or with a Four-Function Calculator 


Compute 32 div 9 and 32 mod 9 by hand or with a four-function calculator. 


Solution Performing the division by hand gives the following results: 


3 <32div9 


9[32 
27 
“5 <32mod9 
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To use a four-function calculator to compute n div d for a nonnegative integer n and a posi- 
tive integer d, just divide n by d and ignore the part of the answer to the right of the decimal 
point. To compute n mod d, substitute n div d in place of g and n mod d in place of r in the 
equation n = dq + r. The result is 


n = d: (n div d)+n mod d. 
Solving for n mod d gives 
n modd = n-— d: (n div d). 


Thus when you use a four-function calculator to divide 32 by 9, you obtain an expression 
like 3.555555556. Discarding the fractional part gives 32 div 9 = 3, and so 


32 mod 9 = 32 — 9 - (32 div 9) = 32-27 = 5. E 


Computing the Day of the Week 


Suppose today is Tuesday, and neither this year nor next year is a leap year. What day of 
the week will it be 1 year from today? 


Solution There are 365 days in a year that is not a leap year, and each week has 7 days. 
Now 


365 div 7 = 52 and 365 mod7= 1 


because 365 = 52-7 + 1. Thus 52 weeks, or 364 days, from today will be a Tuesday, and 
so 365 days from today will be 1 day later, namely, Wednesday. 

More generally, if DayT is the day of the week today and DayN is the day of the week 
in N days, then 


DayN = (DayT + N) mod 7, 4.5.1 
where Sunday = 0, Monday = 1,..., Saturday = 6. E 


Solving Problems about mod 


a. Prove that if n is a positive integer, then n mod 10 is the digit in the ones place in the 
decimal representation for n. (See Section 2.5 or the preamble to exercises 44—49 in 
Section 4.4 for discussion about the decimal representation of integers.) 


b. Suppose m is an integer. If m mod 11 = 6, what is 4m mod 11? 


Solution 
a. Proof: Suppose n is any positive integer. The decimal representation for n is 
dj dy, ... dy dı do, where do, di, dh, . . . , dg are integers from 0 to 9 inclusive, d, # 0 


unless n = 0 and k = 0, 
n = dp 10° +d,_,-10° | +--+ +d):10? + d,-10 +d), 
and dọ is the digit in the ones place. Factoring out 10 from all but the final term gives 
n = 10: (dp: 10"! + dy_,- 10" 2 +--+ + dy: 10! +d)) + do. 
Thus n = 10-(an integer) + dp, and so n mod 10 = do, which is the digit in the ones 
place in the decimal representation for n. 


b. Because m mod 11 = 6, the remainder obtained when m is divided by 11 is 6. This 
means that there is some integer g so that 


m= llq +6. 
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Thus 4m = 44q + 24 = 44q +22 +2 = 11(4q+2)+2. 


Since 4q + 2 is an integer (because products and sums of integers are integers) and since 
2 < 11, the remainder obtained when 4m is divided by 11 is 2. Therefore, 


4m mod 11 = 2. |_| 


Representations of Integers 


In Section 4.1 we defined an even integer to have the form twice some integer. At that time 
we could have defined an odd integer to be one that was not even. Instead, because it was 
more useful for proving theorems, we specified that an odd integer has the form twice 
some integer plus 1. The quotient-remainder theorem brings these two ways of describing 
odd integers together by guaranteeing that any integer is either even or odd. To see why, 
let n be any integer, and consider what happens when n is divided by 2. By the quotient- 
remainder theorem (with d = 2), there exist unique integers g and r such that 


n=2qtr and 0=r<2. 


But the only integers that satisfy 0 = r < 2 are r = 0 and r = 1. It follows that given any 
integer n, there exists an integer q with 


n=2q+0O or n=2q+1. 


In the case that n = 2q +0 = 24, n is even. In the case that n = 2q + 1, n is odd. Hence 


Note The fact that any n is either even or odd, and, because of the uniqueness of q and r, n cannot be both even 
integer is either even or and odd. 

odd is called the parity The parity of an integer refers to whether the integer is even or odd. For instance, 5 has 
property. odd parity and 28 has even parity. 


CU eee Consecutive Integers Have Opposite Parity 
Prove that given any two consecutive integers, one is even and the other is odd. 


Solution Two integers are called consecutive if, and only if, one is one more than the 
other. So if one integer is m, the next consecutive integer is m + 1. 

To prove the given statement, you can divide the analysis into two cases: case 1, where 
the smaller of the two integers is even, and case 2, where the smaller of the two integers 
is odd. 


Theorem 4.5.2 The Parity Property 


Any two consecutive integers have opposite parity. 


Proof: 

Suppose that two [particular but arbitrarily chosen] consecutive integers are given; 
call them m and m+ 1. [We must show that one of m and m+ 1 is even and that the 
other is odd.] By the parity property, either m is even or m is odd. [We break the proof 
into two cases depending on whether m is even or odd.] 


Case 1 (m is even): In this case, m = 2k for some integer k, and so m+ 1 = 2k+1, 
which is odd [by definition of odd]. Hence in this case, one of m and m+ 1 is even 
and the other is odd. 


(continued on page 204) 
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Case 2 (m is odd): In this case, m=2k+1 for some integer k, and so 
m+] = (2k+1)+1 = 2k+2 = 2(k+1). But k+ 1 is an integer because it is a sum 
of two integers. Therefore, m + 1 equals twice some integer, and thus m + 1 is even. 
Hence in this case also, one of m and m+ 1 is even and the other is odd. 


It follows that regardless of which case actually occurs for the particular m and m + 1 that 
are chosen, one of m and m + 1 is even and the other is odd. [This is what was to be shown.] 


The division into cases in a proof is like the transfer of control for an if-then-else statement 
in a computer program. If m is even, control transfers to case 1; if not, control transfers to 
case 2. For any given integer, only one of the cases will apply. You must consider both cases, 
however, to obtain a proof that is valid for an arbitrarily given integer whether even or not. 

There are times when division into more than two cases is called for. Suppose that at 
some stage of developing a proof, you know that a statement of the form 


A, or Az or A3 or... Or Ån 


is true, and suppose you want to deduce a conclusion C. By definition of or, you know that 
at least one of the statements A; is true (although you may not know which). In this situa- 
tion, you should use the method of division into cases. First assume A, is true and deduce 
C; next assume A; is true and deduce C; and so forth, until you have assumed A, is true and 
deduced C. At that point, you can conclude that regardless of which statement A; happens 
to be true, the truth of C follows. 


Method of Proof by Division into Cases 


To prove a statement of the form “If A, or A, or... or A,, then C,’ prove all of the 
following: 


If Aj, then C, 
If A>, then C, 


If A„, then C. 


This process shows that C is true regardless of which of Aj, Ao,..., A, happens to 
be the case. 


Proof by division into cases is a generalization of the argument form shown in Ex- 
ample 2.3.7, whose validity you were asked to establish in exercise 21 of Section 2.3. This 
method of proof was combined with the quotient-remainder theorem for d = 2 to prove 
Theorem 4.5.2. Allowing d to take on additional values makes it possible to obtain a vari- 
ety of other results. We begin by showing what happens when a = 4. 


Representing Integers mod 4 


Show that any integer can be written in one of the four forms 
n=4q or n=4q+1 or n=4q+2 or n=4q+3 


for some integer q. 
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Example 4.5.7 


Note Another way to 

state this fact is that if you 
square an odd integer and 
divide by 8, you will always 
get a remainder of 1. Try a 
few examples! 


Note Desperation can 
spur creativity. When you 
have tried all the obvious 
approaches without suc- 
cess and you really care 
about solving a problem, 
you reach into the odd cor- 
ners of your memory for 
anything that may help. 


Solution Given any integer n, apply the quotient-remainder theorem to n with the divi- 
sor equal to 4. This implies that there exist an integer quotient g and a remainder r such 
that 


n=4qt+r and 0=r<4. 
But the only nonnegative remainders r that are less than 4 are 0, 1, 2, and 3. Hence 
n=4q or n=4q+1 or n=4q+2 or n=4q+3 
for some integer q. In other words, n mod 4 equals 0, 1, 2, or 3. a 


The next example illustrates how the alternative representations for integers mod 4 can 
help establish a result in number theory. The solution is broken into two parts: a discussion 
and a formal proof. These correspond to the stages of actual proof development. Very few 
people, when asked to prove an unfamiliar theorem, immediately write down the kind of 
formal proof you find in a mathematics text. They may first check some examples to ex- 
plore whether the theorem is believable. If it passes that test, they often need to experiment 
with several possible approaches before finding one that works. A formal proof is much 
like the ending of a mystery story—the part in which the action of the story is systemati- 
cally reviewed and all the loose ends are carefully tied together. 


The Square of an Odd Integer 
Prove: The square of any odd integer has the form 8m + 1 for some integer m. 


Solution If checking some examples convinces you that the statement may be true, be- 
gin to develop a proof by asking, “Where am I starting from?” and “What do I need to 
show?” To help answer these questions, introduce variables to rewrite the statement more 
formally. 


Formal Restatement: Y odd integer n, 3 an integer m such that n? = 8m + 1. 
From this, you can immediately identify the starting point and what is to be shown. 


Starting Point: Suppose n is a particular but arbitrarily chosen odd integer. 


To Show: 3an integer m such that n? = 8m + 1. 


This looks tough. Why should there be an integer m with the property that n = 8m+1? 
That would say that (n? — 1)/8 is an integer, or that 8 divides no-l. Perhaps you could 
make use of the fact that n? — 1 = (n—1)(n+ 1). Does 8 divide (n — 1)(n + 1)? Since n is 
odd, both (n — 1) and (n + 1) are even. That means that their product is divisible by 4. But 
that’s not enough. You need to show that the product is divisible by 8. 

You could try another approach by arguing that since n is odd, you can represent it as 
2q +1 for some integer q. Then n? = (2q + 1)? = 4q°+4q+ 1 = 4(q¢? +) + 1. It is clear 
from this analysis that n? can be written in the form 4m+1, but it may not be clear that it 
can be written as 8m + 1.* 

Yet another possibility is to use the result of Example 4.5.6. That example showed 
that any integer can be written in one of the four forms 4q, 4q + 1, 4q + 2, or 4g + 3. Two 
of these, 4q + 1 and 4q +3, are odd. Thus any odd integer can be written in the form 
4q + 1 or 4q + 3 for some integer q. You could try breaking into cases based on these two 
different forms. 


*See exercise 18 for a different perspective about this approach. 
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It turns out that this last possibility works! In each of the two cases, the conclusion 
follows readily by direct calculation. The details are shown in the following formal 
proof: 


Theorem 4.5.3 


The square of any odd integer has the form 8m + 1 for some integer m. 


Proof: Suppose n is a [particular but arbitrarily chosen] odd integer. By the quotient- 
remainder theorem with the divisor equal to 4, n can be written in one of the forms 


4q or 4q+1 or 4q+2 or 4q+3 


for some integer q. In fact, since n is odd and 4g and 4q + 2 are even, n must have 
one of the forms 
4q+1 or 4q+3. 


Case 1(n = 4q + 1 forsome integer q): [We must find an integer m such thatn? = 8m+1.] 
Since n = 4q +1, 


n= (4q + iy by substitution 
= (4q+ 1)(4q+ 1) by definition of square 
= 16¢°+8q+1 
= 8(2q7 +q)+1 by the laws of algebra. 


Let m = 2q° + q. Then m is an integer since 2 and q are integers and sums and prod- 
ucts of integers are integers. Thus, substituting, 


n? =8m+1 where mis an integer. 


Case2(n = 4 q + 3forsome integer q): [We mustfindan integer m such thatn’ = 8m + 1.] 
Since n = 4q +3, 


n = (4q + ay by substitution 
= (4q + 3)(4q + 3) by definition of square 
= 16q°+24q+9 


16q° + 24g + (8+ 1) 
8(2q" $3g+1)+1 by the laws of algebra. 


[The motivation for the choice of algebra steps was the desire to write the expression in 
the form 8-(some integer) + 1.] 


Let m = 2q +3q + 1. Then m is an integer since 1, 2, 3, and q are integers and sums 
and products of integers are integers. Thus, substituting, 


n? =8m+1 where misan integer. 


Cases 1 and 2 show that given any odd integer, whether of the form 4q +1 or 
4q +3, n? = 8m + 1 for some integer m. [This is what we needed to show.] 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


4.5 DIRECT PROOF AND COUNTEREXAMPLE V: DIVISION INTO CASES AND THE QUOTIENT-REMAINDER THEOREM 207 


Note that the result of Theorem 4.5.3 can also be written, “For any odd integer n, n 


mod 8 = 1.” 

In general, according to the quotient-remainder theorem, if an integer n is divided by an 
integer d, the possible remainders are 0, 1, 2,..., (d — 1). This implies that n can be written 
in one of the forms 


dq, dq+1,dq+2,...,dq+(d—1) for some integer q. 


Many properties of integers can be obtained by giving d a variety of different values and 
analyzing the cases that result. 


Absolute Value and the Triangle Inequality 


The triangle inequality is one of the most important results involving absolute value. It has 
applications in many areas of mathematics. 


Definition 


For any real number x, the absolute value of x, denoted |x|, is defined as follows: 


x br sr 
ee 
xe tie < O 


The triangle inequality says that the absolute value of the sum of two numbers is less than 
or equal to the sum of their absolute values. We give a proof based on the following two facts, 
both of which are derived using division into cases. We state both as lemmas. A lemma is 
a statement that does not have much intrinsic interest but is helpful in deriving other results. 


Lemma 4.5.4 


For every real number r, —|r| <r |r]. 


Proof: Suppose r is any real number. We divide into cases according to whether 
r=0,r>0,orr<0. 


Case 1 (r = 0): In this case, by definition of absolute value, r| = r = Q. Since 0 = —0, 
we have that —0 |r| = 0 =r =|r|, and so it is true that 


-pises 


Case 2 (r > 0): In this case, by definition of absolute value, r| = r, Also, since r is 


positive and — |r| is negative, — |r| < r. Thus it is true that 


-jjer 


Case 3 (r < 0): In this case, by definition of absolute value, |r| = —r. Multiplying 
both sides by —1 gives that — |r| = r. Also, since r is negative and |r| is positive, 
r < |r|. Thus it is also true in this case that 


-Ir =r = rl. 
Hence, in every case, 
-Ir =r= || 


[as was to be shown]. 
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Lemma 4.5.5 


For every real number r, |—r| = |r]. 


Proof: Suppose r is any real number. By Theorem T23 in Appendix A, if r > 0, then 
—r <0, and if r < 0, then —r > 0. Thus 


=r if —r>0 


| = r| = 0 if—r=0 by definition of absolute value 
=(=) if-r<0 
=p if -r> 
: 0 because —(—r) = r by Theorem T4 in 
= 0 if r=0 Appendix A, and when —r = 0, then r = 0 
if —r<0 
=r if r<0 because, by Theorem T24 in Appendix A, 
= 0 if r=0 when —r > 0, then r < 0, when —r <0, 
if r>0 thenr >0 
= r if r20 by reformatting the previous result 
-r if r<0 


by definition of absolute value. 


Lemmas 4.5.4 and 4.5.5 now provide a basis for proving the triangle inequality. 


Theorem 4.5.6 The Triangle Inequality 


For all real numbers x and y, |x+y| = |x| + ly]. 


Proof: Suppose x and y are any real numbers. 
Case 1 (x + y = 0): In this case, |x +y| = x+ y, and so, by Lemma 4.5.4, 
x= |x| and y= |y]. 
Hence, by Theorem T26 of Appendix A, 
lary] = x+y = |x| + |yl. 


Case 2 (x +y <0): In this case, lx+ y] ==(x+y)=(=x)+(-y); and so, by 
Lemmas 4.5.4 and 4.5.5, 
=x £ |~x|= |x| and -y=|=y| = |yl. 


It follows, by Theorem T26 of Appendix A, that 
|x+y] = C+») = |x| + [yl 


Hence in both cases |x + y| = |x| + |y| fas was to be shown]. 
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The quotient-remainder theorem says that for all 
integers n and d with d = 0, there exist q 
and r such that and 


. Ifn and d are integers with d > 0, n div d is 


and n mod d is 


. The parity of an integer indicates whether the 


integer is 


According to the quotient-remainder theorem, if 
an integer n is divided by a positive integer d, the 


possible remainders are . This implies that 
n can be written in one of the forms for 
some integer q. 


. To prove a statement of the form “If A; or A 


and and 


or A3, then C,’ prove 


. The triangle inequality says that for all real num- 


bers x and y, 


For each of the values of n and d given in 1-6, find inte- 


gers q and r such that n = dq+rand0 =r <d. 


1. n=70,d=9 2. n=62,d=7 

3. n = 36, d = 40 4. n=3,d=l1l1 

5. n= —45,d = 11 6. n= —27,d=8 
Evaluate the expressions in 7-10. 

7. a. 43 div9 b. 43 mod 9 

8. a. SO div7 b. 50 mod 7 

9. a. 28 div5 b. 28 mod 5 
10. a. 30 div 2 b. 30 mod 2 

11. Check the correctness of formula (4.5.1) given in 


15. 


16. 


Example 4.5.3 for the following values of DayT 
and N. 

a. DayT = 6 (Saturday) and N = 15 

b. DayT = 0 (Sunday) and N = 7 

c. DayT = 4 (Thursday) and N = 12 


. Justify formula (4.5.1) for general values of DayT 


and N. 


. Ona Monday a friend says he will meet you again 


in 30 days. What day of the week will that be? 


. If today is Tuesday, what day of the week will it be 


1,000 days from today? 


January 1, 2000, was a Saturday, and 2000 was a 
leap year. What day of the week will January 1, 
2050, be? 


Suppose d is a positive and n is any integer. If d|n, 
what is the remainder obtained when the quotient- 
remainder theorem is applied to n with divisor d? 


17. 


18. 


19. 


20. 


21. 


22. 


Prove directly from the definitions that for every 
integer n, n’—n +3 is odd. Use division into two 
cases: n is even and n is odd. 


a. Prove that the product of any two consecutive 
integers is even. 

b. The result of part (a) suggests that the second 
approach in the discussion of Example 4.5.7 
might be possible after all. Write a new proof 
of Theorem 4.5.3 based on this observation. 


Prove directly from the definitions that for all 
integers m and n, if m and n have the same 
parity, then 5m + 7n is even. Divide into two 
cases: m and n are both even and m and n are 
both odd. 


Suppose a is any integer. If a mod 7 = 4, what 
is 5a mod 7? In other words, if division of a by 
7 gives a remainder of 4, what is the remainder 
when 5a is divided by 7? Your solution should 
show that you obtain the same answer no matter 
what integer you start with. 


Suppose b is any integer. If b mod 12 = 5, what 
is 8b mod 12? In other words, if division of b by 
12 gives a remainder of 5, what is the remainder 
when 8b is divided by 12? Your solution should 

show that you obtain the same answer no matter 
what integer you start with. 


Suppose c is any integer. If c mod 15 = 3, what 
is 10c mod 15? In other words, if division of c by 
15 gives a remainder of 3, what is the remainder 
when 10c is divided by 15? Your solution should 
show that you obtain the same answer no matter 
what integer you start with. 
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23. 


24. 


25. 


H 26. 


H 27. 


H 28. 


H 29. 


30. 
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Prove that for every integer n, if n mod 5 = 3 then 
n’ mod 5 = 4. 


Prove that for all integers m and n, if m mod 5 = 2 
and n mod 5 = 1 then mn mod 5 = 2. 


Prove that for all integers a and b, if a mod 7 = 5 
and b mod 7 = 6 then ab mod 7 = 2. 


Prove that a necessary and sufficient condition for 
an integer n to be divisible by a positive integer d 
is that n mod d = 0. 


Use the quotient-remainder theorem with divisor 
equal to 2 to prove that the square of any integer 
can be written in one of the two forms 4k or 4k + 1 
for some integer k. 


a. Prove: Given any set of three consecutive inte- 
gers, one of the integers is a multiple of 3. 

b. Use the result of part (a) to prove that any product 
of three consecutive integers is a multiple of 3. 


a. Use the quotient-remainder theorem with 
divisor equal to 3 to prove that the square of 
any integer has the form 3k or 3k + 1 for some 
integer k. 

b. Use the mod notation to rewrite the result of 


part (a). 


a. Use the quotient-remainder theorem with divi- 
sor equal to 3 to prove that the product of any 
two consecutive integers has the form 3k or 
3k + 2 for some integer k. 

b. Use the mod notation to rewrite the result of 


part (a). 


In 31-33, you may use the properties listed in 
Example 4.3.3. 


31. 


32. 


33. 


H 34. 


a. Prove that for all integers m and n, m + n and 
m— nare either both odd or both even. 

b. Find all solutions to the equation m —n = 56 
for which both m and n are positive integers. 

c. Find all solutions to the equation m-n = 
for which both m and n are positive integers. 


88 


Given any integers a, b, and c, if a — b is even and 
b — c is even, what can you say about the parity of 
2a — (b + c)? Prove your answer. 


Given any integers a, b, and c, if a — b is odd and 
b—c is even, what can you say about the parity of 
a— c? Prove your answer. 


Given any integer n, if n > 3, couldn, n + 2, and 
n + 4 all be prime? Prove or give a counterexample. 
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Prove each of the statements in 35-43. 


35. 


H 36. 


H 37. 
38. 


H 39. 


40 


H 41. 
42. 


H 43. 


44 


45. 


* 46. 


The fourth power of any integer has the form 8m 
or 8m + 1 for some integer m. 


The product of any four consecutive integers is 
divisible by 8. 


For any integer n, n? +5 is not divisible by 4. 


For every integer m, m = 5k, or m = 5k+ 1, or 
m? = 5k + 4 for some integer k. 


Every prime number except 2 and 3 has the form 
6q + 1 or 6q + 5 for some integer q. 


. If n is any odd integer, then nf mod 16 = 1. 


x|-|y| = [xy]. 


For all real numbers r and c with c = 0, -cSrSc 
if, and only if, |r| < c. (Hint: Proving A if, and 
only if, B requires proving both if A then B and if 
B then A.) 


For all real numbers x and y, 


For all real numbers a and b, ||a| —|b|| = |a- b|. 
. A matrix M has 3 rows and 4 columns. 
Ai 42 43 Ay4 
ai an az a4 
431 432, 4@33 34 


The 12 entries in the matrix are to be stored in 
row major form in locations 7,609 to 7,620 in a 
computer’s memory. This means that the entries in 
the first row (reading left to right) are stored first, 
then the entries in the second row, and finally the 
entries in the third row. 

a. Which location will a> be stored in? 

b. Write a formula (in i and j) that gives the 
integer n so that aj is stored in location 
7,609 + n. 

c. Find formulas (in n) for r and s so that a, is 
stored in location 7,609 + n. 


Let M be a matrix with m rows and n col- 

umns, and suppose that the entries of M are 
stored in a computer’s memory in row ma- 

jor form (see exercise 44) in locations N, 
N+1,N+2,...,N+mn-— 1. Find formulas in k 
for r and s so that a, is stored in location N + k. 


If m, n, and d are integers, d > 0, and m mod d = 
nmod d, does it necessarily follow that m = n? 
That m — n is divisible by d? Prove your answers. 
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* 47. If m,n, and d are integers, d > 0, and d|(m—n), * 49. If m,n, a, b, and d are integers, d > 0, and m 
what is the relation between m mod d and n mod d = a and n mod d = b, is (mn) mod 
mod d? Prove your answer. d = ab? Is (mn) mod d = ab mod d? Prove your 

answers. 

* 48. If m,n, a, b, and d are integers, d > 0, and m mod 
d= aand n modd = b,is (m+n) mod d = a+b? 50. Prove that if m, d, and k are integers and d > 0, 
Is (m+n) mod d = (a+b) mod d? Prove your then (m + dk) mod d = m mod d. 
answers. 


ANSWERS FOR TEST YOURSELF 


1. integers; n =dq+r;0Sr<d_ 2. the quotient obtained (d— 1); dq, dqt+1,dq+2,...,dq+(d—-1) 5. A, 
when n is divided by d; the nonnegative remainder obtained then C; If A>, then C; If A3,thenC 6. |x+y| < |x| + |y] 
when n is divided by d 3. odd or even 4.0, 1, 2,..., 


EXA Direct Proof and Counterexample VI: 
Floor and Ceiling 


Proof serves many purposes simultaneously. In being exposed to the scrutiny and 
judgment of a new audience, [a] proof is subject to a constant process of criticism 
and revalidation. Errors, ambiguities, and misunderstandings are cleared up by 
constant exposure. Proof is respectability. Proof is the seal of authority. 

Proof, in its best instances, increases understanding by revealing the heart of the 
matter. Proof suggests new mathematics. The novice who studies proofs gets closer to 
the creation of new mathematics. Proof is mathematical power, the electric voltage of 
the subject which vitalizes the static assertions of the theorems. 

Finally, proof is ritual, and a celebration of the power of pure reason. 

—Philip J. Davis and Reuben Hersh, The Mathematical Experience, 1981 


Imagine a real number sitting on a number line. The floor and ceiling of the number are the 
integers to the immediate left and to the immediate right of the number (unless the number 
is an integer, in which case its floor and ceiling both equal the number itself). Many com- 
puter languages have built-in functions that compute floor and ceiling automatically. These 
functions are very convenient to use when writing certain kinds of computer programs. 
In addition, the concepts of floor and ceiling are important in analyzing the efficiency of 
many computer algorithms. 


Definition 


Given any real number x, the floor of x, denoted |x], is defined as follows: 
|x] = that unique integer n such thatn Sx < n+l. 
Symbolically, if x is a real number and n is an integer, then 


IxjJ=n S nsx<ntl. 


floor of x = |x| 
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Definition 
Given any real number x, the ceiling of x, denoted [x], is defined as follows: 

[x] = that unique integer n such that n— 1 < x =n. 
Symbolically, if x is a real number and n is an integer, then 


HS = Slan 


i 


ceiling of x = [x] 


Computing Floors and Ceilings 


Note On some calcula- Compute |x| and [x] for each of the following values of x: 
tors |x| is denoted INT (x). a 25/4 b. 0.999 c. —2.01 
Solution 


a. 25/4 = 6.25 and 6 < 6.25 < 7; hence |25/4| = 6 and [25/4] = 7. 
b. 0 < 0.999 < 1; hence |0.999| = 0 and [0.999] = 1. 
c. =3 < —2.01 < —2; hence|—2.01| = —3 and[—2.01] = —2. a 


Seu awa An Application 


The 1,370 students at a college are given the opportunity to take buses to an out-of-town 
event. Each bus holds a maximum of 40 passengers. 


a. For reasons of economy, the leader of the event will send only full buses. What is the 
maximum number of buses the event leader will send? 


b. If the event leader is willing to send one partially filled bus, how many buses will be 
needed to allow all the students to take the trip? 


Solution 
a. [1370/40] = [34.25] = 34 b. [1370/40] = [34.25] = 35 a 
Some General Values of Floor 
If k is an integer, what are |k] and |k + 1/2]? Why? 
Solution Suppose k is an integer. Then 
[k] = k because k is an integer and k Sk<k+1, 


and 


1 1 
[e+ $| = k because kis an integer andk Sk+5 <k+1. m 
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Disproving an Alleged Property of Floor 


Is the following statement true or false? 


For all real numbers x and y, |x + y] = |x| + |y]. 


Solution The statement is false. As a counterexample, take x = y = 5 . Then 


1 1 
Lx] + ly! pun 0+0 =0, 


whereas 


1 1 
Ix+ y} RA [J= 1. 
Hence it is not always the case that |x + y] # |x] + Ly]. 

To arrive at this counterexample, you could have reasoned as follows: Suppose x and 
y are real numbers. Must it necessarily be the case that |x + y| = |x] +y], or could x and 
y be such that |x +y] # |x] +y]? Imagine values that the various quantities could take. 
For instance, if both x and y are positive, then |x] and |y] are the integer parts of |x] and |y] 
respectively; just as 


3 3 
25 =2+ 5 
i 
integer part fractional part 
so is 
x = |x] + fractional part of x 
and 
y = |y] + fractional part of y, 


where the term fractional part is understood here to mean the part of the number to the 
right of the decimal point when the number is written in decimal notation. Thus if x and 
y are positive, 


x+y =|x]+|y]+ the sum of the fractional parts of x and y. 
But also 
x+y =|x+y]+ the fractional part of (x + y). 


These equations show that if there exist numbers x and y such that the sum of the fractional 
parts of x and y is at least 1, then a counterexample can be found. As previously indicated, 
there do exist such x and y; for instance, x = 5 and y = 5. E 


The analysis of Example 4.6.4 indicates that if x and y are positive and the sum of their 
fractional parts is less than 1, then |x + y| = |x] + |y]. In particular, if x is positive and m is a 
positive integer, then |x + m] = |x] + |m] = |x| + m. (The fractional part of m is 0; hence the 
sum of the fractional parts of x and m equals the fractional part of x, which is less than 1.) It 
turns out that you can use the definition of floor to show that this equation holds for every 
real number x and for every integer m. 


Proving a Property of Floor 


Prove that for every real number x and for every integer m, |x + m| = |x] + m. 
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Solution Begin by supposing that x is a particular but arbitrarily chosen real number and 
that m is a particular but arbitrarily chosen integer. You must show that |x + m] = |x] + m. 
Since this is an equation involving |x] and |x + m], it is reasonable to give one of these quan- 
tities a name: Let n = |x]. By definition of floor, 


nis an integer and nS=x<ntl. 


This double inequality enables you to compute the value of |x + m] in terms of n by adding 
m to all sides: 


ntmSx+m<n+m+l1. 
Thus the left-hand side of the equation to be shown is 
lx+m|=n+m. 
On the other hand, since n = |x], the right-hand side of the equation to be shown is 
|x} +m=n+m 


also. Thus |x + m] = |x] + m. This discussion is summarized as follows: 


Theorem 4.6.1 


For every real number x and every integer m, |x + m] = |x] + m. 


Proof: Suppose any real number x and any integer m are given. [We must show that 
|x + m| = |x|+m.] Let n = |x]. By definition of floor, n is an integer and 


nsx<ntl. 
Add m to all three parts to obtain 
n+msxtm<n+m+1 


[since adding a number to both sides of an inequality does not change the direction of 
the inequality]. 

Now n + mis an integer [since n and m are integers and a sum of integers is an in- 
teger], and so, by definition of floor, the left-hand side of the equation to be shown is 


|Ix+m|=n+m. 
But n = |x]. Hence, by substitution, 


n+m=\|x|+m, 


which is the right-hand side of the equation to be shown. Thus |x + m] = |x| + m [as 
was to be shown]. 


The analysis of a number of computer algorithms, such as the binary search and merge 
sort algorithms, requires that you know the value of |n/2), where n is an integer. The for- 
mula for computing this value depends on whether n is even or odd. 
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Theorem 4.6.2 The Floor of n/2 


For any integer n, 


if n is even 


N| 
ES 
II 
S wls 


if n is odd. 


Proof: Suppose n is a [particular but arbitrarily chosen] integer. By the quotient- 
remainder theorem, either n is odd or n is even. 


Case 1 (n is odd): In this case, n = 2k + 1 for some integer k. [We must show that 
|n/2] = (n — 1)/2.] But the left-hand side of the equation to be shown is 


f 2k+1 Z 4 1 
= = +—|=|k+-|=k 
2 2 2 72 2 
because k is an integer and k S k + 1/2 < k + 1. And the right-hand side of the equa- 
tion to be shown is 


n—1 (@k+1)-1 2k 
2 2 2 


=k 


also. So since both the left-hand and right-hand sides equal k, they are equal to each 
other. That is, [5 | =” z l [as was to be shown]. 


Case 2 (n is even): In this case, n = 2k for some integer k. [We must show that 
|n/2] = n/2.] The rest of the proof of this case is left as an exercise. 


Given any integer n and any positive integer d, the quotient-remainder theorem guaran- 
tees the existence of unique integers g and r such that 


n=dq+r and O0=r<d. 


The following theorem states that the floor notation can be used to describe g and r as 


follows: 
n n 
q= H and r=n -a|"| 
d d 


Thus if, on a calculator or in a computer language, floor is built in but div and mod are not, 
div and mod can be defined as follows: For a nonnegative integer n and a positive integer d, 


n 


n div d = | and nmodd=n -a|"| 4.6.1 


Note that d divides n if, and only if, n mod d = 0. In floor notation this means that d divides 
nif, and only if, n = d-|n/d|. You are asked to prove this in exercise 33. 
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Theorem 4.6.3 
If 7 is any integer and d is a positive integer, and if q = |n/d|and r = n—d-|n/dl, then 


n=dq+r and O0=r<d. 


Proof: Suppose n is any integer, dis a positive integer, q = |n/d|, andr = n—d-|n/dl. 
[We must show that n = dq +r and0 =r < d.] By substitution, 


agtr=af'le(n i") e 


So it remains only to show that 0 = r < d. But q = |n/d]. Thus, by definition of 


floor, 
a ey] 
g= d q . 
Then 
dq=n<dq+d by multiplying all parts by d 
and so 
Os=n-dq<d by subtracting dq from all parts. 
But 
d2) d 
r= == i ` 
F q 
Hence 


Osr<d by substitution. 


[This is what was to be shown.] 


Se ea Computing div and mod 
Use the floor notation to compute 3,850 div 17 and 3,850 mod 17. 


Solution By formula (4.6.1), 


3,850 div 17 = |3,850/17| =|226.4705882...| = 226 
3,850 mod 17 = 3,850 — 17-|3,850/17| 
= 3,850 = 17-226 


= 3,850 — 3,842 = 8. a 
TEST YOURSELF 
1. Given any real number x, the floor of x is the 2. Given any real number x, the ceiling of x is the 
unique integer n such that ___. unique integer n such that 
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EXERCISE SET 4.6 


Compute |x| and [x] for each of the values of x in 1-4. 


1. 37.999 2. 17/4 13. Prove that if n is any even integer, then |n/2] = n/2. 
3. —14.00001 4. —32/5 14. Show that the following statement is false. 
5. Use the floor notation to express 259 div 11 and For all real numbers x and y, |x — y] = |x|—Ly). 

259 mod 11. 


Some of the statements in 15-22 are true and some are 

6. If k is an integer, what is [k]? Why? false. Prove each true statement and find a counterexam- 

2 ikia inern viari [k 4 1» Why? i for each ms statement, but do not use Theorem 4.6.1 

your proofs. 

8. Seven kilograms of raw material are needed 
to manufacture each unit of a certain product. 
Express the number of units that can be produced 16. For every real number x, [x7| = |x)’. 
from n kilograms of raw material using either the 
floor or the ceiling notation. Which notation is 
more appropriate? n/3 ifn mod 3 = 0 

\In/3)=4(n—-1)/3 ifnmod3 =1 

(n—2)/3 ifn mod 3 = 2. 


15. For every real number x, |x — 1] = |x]— 1. 


H 17. For every integer n, 


9. Boxes, each capable of holding 36 units, are 
used to ship a product from the manufacturer to 


a wholesaler. Express the number of boxes that H 18. For all real numbers x and y, [x + y] = [x] + [y]. 
would be required to ship n units of the product 
using either the floor or the ceiling notation. H 19. For every real number x, [x — 1] = [x]— 1. 
Which notation is more appropriate? 20. For all real numbers x and y, [xy] = [x] [y]. 
10. If 0 = Sunday, 1 = Monday, 2 = Tuesday, ..., 21. For every odd integer n, [n/2] = (n + 1)/2. 
6 = Saturday, then January 1 of year n occurs ; 
on the day of the week given by the following 22. For all real numbers x and y, [xy] = [x]: Ly}. 
formula: Prove each of the statements in 23-33. 
(» n É = | É -= J % É = 3) mod? 23. For any real number x, if x is not an integer, then 
4 100 400 [x] +|-x] = —1. 
a. Use this formula to find January 1 of 24. For any integer m and any real number x, if x is 
i. 2050 ii. 2100 iii. the year of your birth. not an integer, then |x]+|m— x] = m— 1. 
Hb. Interpret the different components of this 
formula. H 25. For every real number x, ||x/2]/2| = |x/4]. 
11. State a necessary and sufficient condition for the 26. For every real number x, if x —|x] < 1/2 then 
floor of a real number to equal that number. [2x] = 2lx]. 
H 12. Let S be the statement: For any odd integer n, 27. For every real number x, if x —Lx] = 1/2 then 
\n/2| = (n— 1)/2. Then S is true, but the following [2x] = 2[x] +1. 
“proof” is incorrect. Find the mistake. H 28. For any odd integer n, 
“Proof: Suppose n is any odd integer. Then nw at i Ae 
n = 2k+ 1 for some integer k. Consequently, "| = ( 5 )( 5 
2k+1|_ k+l _ 2k _, - 
a) 2 iy pi 29. For any odd integer n, 
But n = 2k + 1. Solving for k gives k = (n—1)/2. n2 n2+3 
Hence, by substitution, |n/2] = (n — 1)/2.” H = I 
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n 33. A necessary and sufficient condition for an integer 
2|” n to be divisible by a nonzero integer d is that 
n = |n/d|-d. In other words, for every integer n 


[5 n and nonzero integer d, 
6f a. if d|n, then n =|n/d|-d. 
” b. ifn = |n/d]:d then d|n. 
32. For every integer n, 2 = =| 


ANSWERS FOR TEST YOURSELF 


lnsx<nt+l1 2.n-1<xSn 


ZWA Indirect Argument: Contradiction and Contraposition 


Reductio ad absurdum... is one of a mathematician’s finest weapons. It is a far finer 
gambit than any chess gambit: a chess player may offer the sacrifice of a pawn or 
even a piece, but the mathematician offers the game. —G. H. Hardy, 1877-1947 


In a direct proof you start with the hypothesis of a statement and make one deduction after 
another until you reach the conclusion. Indirect proofs are more roundabout. One kind of 
indirect proof, argument by contradiction, is based on the fact that either a statement is true 
or it is false but not both. So if you can show that the assumption that a given statement is 
not true leads logically to a contradiction, impossibility, or absurdity, then that assumption 
must be false: and, hence, the given statement must be true. This method of proof is also 
known as reductio ad impossible or reductio ad absurdum because it relies on reducing a 
given assumption to an impossibility or absurdity. 

Argument by contradiction occurs in many different settings. For example, if a man ac- 
cused of holding up a bank can prove that he was someplace else at the time the crime was 
committed, he will certainly be acquitted. The logic of his defense is as follows: 


Suppose I did commit the crime. Then at the time of the crime, I would have had to be at the 
scene of the crime. In fact, at the time of the crime I was in a meeting with 20 people far from 
the crime scene, as they will testify. This contradicts the supposition that I committed the 
crime since it is impossible to be in two places at one time. Hence that supposition is false. 


Another example occurs in debate. One technique of debate is to say, “Suppose for a 
moment that what my opponent says is correct.” Starting from this supposition, the debater 
then deduces one statement after another until finally arriving at a statement that is com- 
pletely ridiculous and unacceptable to the audience. By this means the debater shows the 
opponent’s statement to be false. 

The point of departure for a proof by contradiction is the supposition that the statement 
to be proved is false. The goal is to reason to a contradiction. Thus proof by contradiction 
has the following outline: 


A Method of Proof by Contradiction 
Caution! People often . Suppose the statement to be proved is false. That is, suppose that the negation 
make mistakes when of the statement is true. 


they write a negation for 
a statement they want 
to prove by contradiction. 


. Show that this supposition leads logically to a contradiction. 


. Conclude that the statement to be proved is true. 
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There are no clear-cut rules for when to try a direct proof and when to try a proof by 
contradiction, but there are some general guidelines. Proof by contradiction is indicated 
if you want to show that there is no object with a certain property, or if you want to show 
that a certain object does not have a certain property. The next three examples illustrate 
these situations. 


There Is No Greatest Integer 


Use proof by contradiction to show that there is no greatest integer. 


Solution For this proof, the “certain property” is the property of being the greatest in- 
teger. So prove that there is no object with this property, begin by supposing the negation: 
that there is an object with the property. 


Starting Point: Suppose not. Suppose there is a greatest integer; call it N. This means that 
N = n for every integer n. 


To Show: This supposition leads logically to a contradiction. 


Theorem 4.7.1 


There is no greatest integer. 


Proof: [We take the negation of the theorem and suppose it to be true.] Suppose not. 
That is, suppose there is a greatest integer N. [We must deduce a contradiction.] Then 
N = n for every integer n. Let M = N+ 1. Now M is an integer since it is a sum of 
integers. Also M > WN since M = N+ 1. Thus M is an integer that is greater than 
N. So N is the greatest integer and N is not the greatest integer, which is a contra- 
diction. [This contradiction shows that the supposition is false and, hence, that the 
theorem is true.] 


After a contradiction has been reached, the logic of the argument is always the same: 
“This is a contradiction. Hence the supposition is false and the theorem is true.” Because 
of this, most mathematics texts end proofs by contradiction at the point at which the con- 
tradiction has been obtained. 

The contradiction in the next example is based on the fact that 1/2 is not an integer. 


No Integer Can Be Both Even and Odd 


The fact that no integer is both even and odd can also be deduced from the uniqueness part 
of the quotient-remainder theorem. A full proof of this part of the theorem is outlined in 
exercise 21 of Section 4.8. 


Theorem 4.7.2 


There is no integer that is both even and odd. 


Proof: [We take the negation of the theorem and suppose it to be true.] Suppose not. 
That is, suppose there is at least one integer n that is both even and odd. 
(continued on page 220) 
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[We must deduce a contradiction.] By definition of even, n = 2a for some integer a, 
and by definition of odd, n = 2b + 1 for some integer b. Consequently, 


2a = 2b+1 by equating the two expressions for n, 
and so 
2a—2b=1 
2(a—b)=1 
a—b = 1/2 by algebra. 
Now since a and b are integers, the difference a — b must also be an integer. But 
a—b = 1/2, and 1/2 is not an integer. Thus a — b is an integer and a — b is not an 


integer, which is a contradiction. /This contradiction shows that the supposition is 
false and, hence, that the theorem is true.] 


If you want to prove that a certain object does not have a certain property, you may need 
to assume that it does have the property and deduce a contradiction. The next example il- 
lustrates this strategy. It asks you to prove that a certain object (the sum of a rational and 
an irrational number) does not have the property being rational. 


See ae The Sum ofa Rational Number and an Irrational Number 


Use proof by contradiction to show that the sum of any rational number and any irrational 
number is irrational. 


Solution Begin by supposing the negation of what you are to prove. Be very careful when 
writing down what this means. If you take the negation incorrectly, the entire rest of the 
Caution! The negation proof will be flawed. In this example, the statement to be proved can be written formally as 


of “The sum of any ir- 

rational number and any 
rational number is irratio- s is irrational, then r + s is irrational. 
nal” is NOT “The sum of 
any irrational number and 


Y real numbers r and s, if r is rational and 


From this you can see that the negation is 


any rational number is Ja rational number r and an irrational number 


rational.” . . 
s such that r + s is rational. 


It follows that the starting point and what is to be shown are as follows: 


Starting Point: Suppose not. That is, suppose there is a rational number r and an irrational 
number s such that r + s is rational. 


To Show: This supposition leads to a contradiction. 


To derive a contradiction, you need to understand what you are supposing: that there are 
numbers r and s such that r is rational, s is irrational, and r+ s is rational. By definition of 
rational and irrational, this means there are convenient expressions that can be substituted 
for rand r + s, but all you can say about s is that it cannot be written as a quotient of integers 


for some integers a and b with b + 0, and 4.7.1 


for some integers c and d with d # 0. 4.7.2 
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Note There is no con- If you substitute (4.7.1) into (4.7.2), you obtain 
venient expression that a _¢ 
can be substituted for an b +ts= d 


irrational number. 
Subtracting a/b from both sides gives 


———— _ by rewriting c/d and a/b as equivalent fractions 


bc—ad by the rule for subtracting fractions 
= y g 
T bd with the same denominator. 


Now both bc — ad and bd are integers because products and differences of integers are 
integers, and bd # 0 by the zero product property. Hence s can be expressed as a quotient 
of two integers with a nonzero denominator, and so s is rational, which contradicts the sup- 
position that it is irrational. 

This discussion is summarized in a formal proof. 


Theorem 4.7.3 


The sum of any rational number and any irrational number is irrational. 


Proof: 

[We take the negation of the theorem and suppose it to be true.] Suppose not. That is, 
suppose there is a rational number r and an irrational number s such that r+ s is 
rational. [We must deduce a contradiction.] By definition of rational, r = a/b and 
r+s = c/d for some integers a, b, c, and d with b # 0 and d # 0. By substitution, 


a Cc 
=s =, 
b d 
and so 
C a 
VSRR by subtracting a/b from both sides 
d b 
bc — ad 
=< by the laws of algebra. 
bd 


Now bc —ad and bd are both integers [since a, b, c, and d are integers and since 
products and differences of integers are integers], and bd # 0 [by the zero product 
property]. Hence s is a quotient of the two integers bc — ad and bd with bd # 0. 
Thus, by definition of rational, s is rational, which contradicts the supposition that s 
is irrational. [Hence the supposition is false and the theorem is true.] 


Argument by Contraposition 


A second form of indirect argument, argument by contraposition, is based on the logical 
equivalence between a statement and its contrapositive. To prove a statement by contra- 
position, you take the contrapositive of the statement, prove the contrapositive by a direct 
proof, and conclude that the original statement is true. The underlying reasoning is that 
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since a conditional statement is logically equivalent to its contrapositive, if the contraposi- 
tive is true then the statement must also be true. 


Method of Proof by Contraposition 


1. Express the statement to be proved in the form 
Vx in D, if Pœ) then Qx). 


(This step may be done mentally.) 


2. Rewrite this statement in the contrapositive form 
Vx in D, if Qx is false then P(x) is false. 


(This step may also be done mentally.) 
3. Prove the contrapositive by a direct proof. 


a. Suppose x is a (particular but arbitrarily chosen) element of D such that Q(x) 
is false. 


b. Show that P(x) is false. 


If the Square of an Integer Is Even, Then the Integer Is Even 
Prove that for every integer n, if n’ is even then n is even. 
Solution First form the contrapositive of the statement to be proved. 
Contrapositive: For every integer n, if n is not even then n’ is not even. 


By the quotient-remainder theorem with divisor equal to 2, any integer is even or odd, and, 
by Theorem 4.7.2, no integer is both even and odd. So if an integer is not even, then it is 
odd. Thus the contrapositive can be restated as follows: 


Contrapositive: For every integer n, if n is odd then n’ is odd. 


A straightforward computation is the heart of a direct proof for this statement, as shown 
below. 


Proposition 4.7.4 


For every integer n, if n is even then n is even. 


Proof (by contraposition): Suppose n is any odd integer. [We must show that n 
is odd.] By definition of odd, n = 2k +1 for some integer k. By substitution and 
algebra, 


n = (2k+ 1% = 4k +4k+1 = 22K +2K +1. 


Now 2k + 2k is an integer because products and sums of integers are integers. So 
n =R- (an integer) + 1, and thus, by definition of odd, n? is odd [as was to be shown]. 


We used the word proposition here rather than theorem because although the word theo- 
rem can refer to any statement that has been proved, mathematicians often restrict it to 
especially important statements that have many and varied consequences. Then they use 
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the word proposition to refer to a statement that is somewhat less consequential but none- 
theless worth writing down. We will use Proposition 4.7.4 in Section 4.8 to prove that V2 
is irrational. E 


Relation between Proof by Contradiction 
and Proof by Contraposition 


Observe that any proof by contraposition can be recast in the language of proof by contra- 
diction. In a proof by contraposition, the statement 


Vx in D, if P(x) then Q(x) 
is proved by giving a direct proof of the equivalent statement 
Vx in D, if ~ Q(x) then ~P(a). 


To do this, you suppose you are given an arbitrary element x of D such that ~ Q(x). You 
then show that ~ P(x). This is illustrated in Figure 4.7.1. 


Suppose x is an arbitrary sequence of steps Bp 
Pa a ale 3 
element of D such that XQ(x). (x) 


FIGURE 4.7.1 Proof by Contraposition 


Exactly the same sequence of steps can be used as the heart of a proof by contradiction 
for the given statement. The only thing that changes is the context in which the steps are 
written down. 

To rewrite the proof as a proof by contradiction, you suppose there is an x in D such 
that P(x) and ~ Q(x). You then follow the steps of the proof by contraposition to deduce 
the statement ~P(x). But ~P(x) is a contradiction to the supposition that P(x) and ~ Q(x). 
(Because to contradict a conjunction of two statements, it is only necessary to contradict 
one of them.) This process is illustrated in Figure 4.7.2. 


Suppose Wx in D such same sequence of steps Contradiction: 
ee eee A 
that P(x) and Q(x). P(x) and MP(x) 


FIGURE 4.7.2 Proof by Contradiction 


As an example, here is a proof by contradiction of Proposition 4.7.4. 


Proposition 4.7.4 


For every integer n, if n’ is even then n is even. 


Proof (by contradiction): [We take the negation of the theorem and suppose it to be 
true.] Suppose not. That is, suppose there is an integer n such that n? is even and n is 
not even. [We must deduce a contradiction.] By the quotient-remainder theorem with 
divisor equal to 2, any integer is even or odd. Hence, since n is not even it is odd, and 
thus, by definition of odd, n = 2k + 1 for some integer k. By substitution and algebra, 


n = (2k+ 1% = 4k +4k+1 = 22K +2K +1. 


(continued on page 224) 
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Now 2k + 2k is an integer because products and sums of integers are integers. So 
r= 2-(an integer) + 1, and thus, by definition of odd, n? is odd. Therefore, n is 
both even and odd. This contradicts Theorem 4.7.2, which states that no integer can 
be both even and odd. [This contradiction shows that the supposition is false and, 
hence, that the proposition is true.] 


Note that when you use proof by contraposition, you know exactly what conclusion you 
need to show, namely, the negation of the hypothesis; whereas in proof by contradiction, 
it may be difficult to know what contradiction to head for. On the other hand, when you 
use proof by contradiction, once you have deduced any contradiction whatsoever, you are 
done. The main advantage of contraposition over contradiction is that you avoid having to 
take (possibly incorrectly) the negation of a complicated statement. The disadvantage of 
contraposition as compared with contradiction is that you can use contraposition only for a 
specific class of statements—those that are universal and conditional. The previous discus- 
sion shows that any statement that can be proved by contraposition can be proved by con- 
tradiction. But the converse is not true. Statements such as “V2 is irrational” (discussed in 
the next section) can be proved by contradiction but not by contraposition. 


Proof as a Problem-Solving Tool 


Direct proof, disproof by counterexample, proof by contradiction, and proof by contraposi- 
tion are all tools that may be used to help determine whether statements are true or false. 
Working with examples might have given you a sense that a statement of the form 


For all elements in a domain, if (hypothesis) then (conclusion), 


might be true. To explore further, imagine elements in the domain that satisfy the hypothesis. 
Ask yourself: Must they satisfy the conclusion? If you can see that the answer is “yes” in all 
cases, then the statement is true and your insight will form the basis for a direct proof. If after 
some thought it is not clear that the answer is “yes,” ask yourself whether there are elements of 
the domain that satisfy the hypothesis and not the conclusion. If you are successful in finding 
some, then the statement is false and you have a counterexample. On the other hand, if you are 
not successful in finding such elements, perhaps none exist. Perhaps you can show that assum- 
ing the existence of elements in the domain that satisfy the hypothesis and not the conclusion 
leads logically to a contradiction. If so, then the given statement is true and you have the basis 
for a proof by contradiction. Alternatively, you could imagine elements of the domain for 
which the conclusion is false and ask whether such elements also fail to satisfy the hypothesis. 
If the answer in all cases is “yes,” then you have a basis for a proof by contraposition. 

Solving problems, especially difficult problems, is rarely a straightforward process. At 
any stage of following the guidelines above, you might want to try the method of a previous 
stage again. If, for example, you fail to find a counterexample for a certain statement, your 
experience in trying to find it might help you decide to reattempt a direct argument rather 
than trying an indirect one. Psychologists who have studied problem solving have found 
that the most successful problem solvers are those who are flexible and willing to use a 
variety of approaches without getting stuck in any one of them for very long. Mathemati- 
cians sometimes work for months (or longer) on difficult problems. Don’t be discouraged 
if some problems in this book take you quite a while to solve. 

Learning the skills of proof and disproof is much like learning other skills, such as 
those used in a sport or playing a musical instrument. When you first start out, you may 
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feel bewildered by all the rules, and you may not feel confident as you attempt new things. 
But with practice the rules become internalized and you can use them in conjunction with 
all your other powers—of balance, coordination, judgment, aesthetic sense—to concen- 
trate on winning a competition or performing in public. 

Now that you have worked through several sections of this chapter, return to the idea 
that, above all, a proof or disproof should be a convincing argument. You need to know 
how direct and indirect proofs and counterexamples are structured. But to use this knowl- 
edge effectively, you must use it in conjunction with your imaginative powers, your intu- 
ition, and especially your common sense. 


TEST YOURSELF 


1. To prove a statement by contradiction, you sup- 3. To prove a statement of the form “Vx E D, if P(x) 
pose that _____ and you show that ____. then Q(x)” by contraposition, you suppose that 
and you show that 


2. A proof by contraposition of a statement of the 


form “Vx E D, if P(x) then Q(x)” is a direct proof 
Of. —— — 3 


EXERCISE SET 4.7 


1. Fill in the blanks in the following proof by contra- Proof (by contradiction): Suppose not. That is, 
diction that there is no least positive real number. suppose that there exist @ x and ©) y such 
that x — y is rational. By definition of rational, 
there exist integers a, b, c, and d with b # 0 and 
d# 0so that x = © andx—y = _@_. By 


Proof: Suppose not. That is, suppose that there 
is a least positive real number x. [We must deduce 
_@) ] Consider the number x/2. Since x is a 


positive real number, x/2 is also (b) In addi- substitution, 
tion, we can deduce that x/2 < x by multiplying Oe 
both sides of the inequality 1 < 2 by © and bd 
dividing . Hence x/2 is a positive real num- Adding y and subtracting a on both sides gives 
ber that is less than the least positive real number. 
Thisisa _© [Thus the supposition is false, and y= © by substitution 
so there is no least positive real number.] = ad = be 
2. Is t an irrational number? Explain. oe H 
3. Use proof by contradiction to show that for every — bd D a 


aE pe tee as ne divisible by 3. Now both ad — bc and bd are integers because 


4. Use proof by contradiction to show that for every products and differences of _(f)_ are _(8)_. And 

integer m, 7m + 4 is not divisible by 7. bd + 0 by the _(4)_. Hence y is a ratio of integers 
with a nonzero denominator, and thus y is G) 
by definition of rational. We therefore have both 
that y is irrational and that y is rational, which is a 
5. There is no greatest even integer. contradiction. [Thus the supposition is false and the 
statement to be proved is true.] 


Carefully formulate the negations of each of the state- 
ments in 5-7. Then prove each statement by contradiction. 


. There is no greatest negative real number. 
9. a. When asked to prove that the difference of any 


6 

7. There is no least positive rational number. ae : ; 
irrational number and any rational number is 

8 


. Fill in the blanks for the following proof that the irrational, a student began, “Suppose not. That 
difference of any rational number and any irrational is, suppose the difference of any irrational 
number is irrational. number and any rational number is rational.” 
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10. 


Tl. 


12. 


13. 


CHAPTER4 ELEMENTARY NUMBER THEORY AND METHODS OF PROOF 


What is wrong with beginning the proof in 
this way? (Hint: If needed, review the answer 
to exercise 11 in Section 3.2.) 

b. Prove that the difference of any irrational 
number and any rational number is irrational. 


Let S be the statement: For all positive real num- 
bers r and s, Vr + s # Vr + Vs. Statement S is 
true, but the following “proof” is incorrect. Find 
the mistake. 


“Proof by contradiction: Suppose not. That is, 
suppose that for all positive real numbers r and s, 
Vr+s = Vr+ Vs. This means that the equation 
will be true no matter what positive real numbers 
are substituted for r and s. So let r = 9 and s = 16. 
Then r and s are positive real numbers and 


Vr+s = V9+16 =V25 =5 
whereas 


VrtVs =V94+V16 =34+4=7. 


Since 5 # 7, we have that Vr+s + Vrt Vs, 
which contradicts the supposition that 

Vr+s = Vr+ Vs. This contradiction shows 
that the supposition is false, and hence statement 
S is true.” 


Let T be the statement: The sum of any two ra- 
tional numbers is rational. Then T is true, but the 
following “proof” is incorrect. Find the mistake. 


“Proof by contradiction: Suppose not. That is, 
suppose that the sum of any two rational numbers 
is not rational. This means that no matter what 
two rational numbers are chosen their sum is not 
rational. Now both 1 and 3 are rational because 

1 = 1/1 and 3 = 3/1, and so both are ratios of 
integers with a nonzero denominator. Hence, by a 
supposition, the sum of 1 and 3, which is 4, is not 
rational. But 4 is rational because 4 = 4/1, which 
is a ratio of integers with a nonzero denominator. 
Hence 4 is both rational and not rational, which is 
a contradiction. This contradiction shows that the 
supposition is false, and hence statement T is true.” 


Let R be the statement: The square root of any 
irrational number is irrational. 

a. Write a negation for R. 

b. Prove R by contradiction. 


Let S be the statement: The product of any ir- 
rational number and any nonzero rational number 
is irrational. 
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a. Write a negation for S. 
b. Prove S by contradiction. 


Let T be the statement: For every integer a, if a 
mod 6 = 3, then a mod 3 # 2. 
a. Write a negation for T. 


Hb. Prove T by contradiction. 


H 15. 


Do there exist integers a, b, and c such that a, b, 
and c are all odd and a? + b? = c°? Prove your 
answer. 


Prove each statement in 16-19 by contradiction. 


16. 


20. 


21. 


22. 


For all odd integers a and b, b-a #4. 
(Hint: — = (b + a)(b — a) and the only way to 
factor 4 is either 4 = 2:2 or 4 = 4-1.) 


. For all prime numbers a, b, and c, +P Ee. 


. Ifa and b are rational numbers, b # 0, and r is an 


irrational number, then a + br is irrational. 


. For any integer n, n? — 2 is not divisible by 4. 


Fill in the blanks in the following proof by 
contraposition that for every integer n, if 5/n 
then 5/n. 


Proof (by contraposition): [The contrapositive is: 
For every integer n, if 5|n then 5|n”.] Suppose n is 


any integer such that —® _. [We must show 


that ©) _] By definition of divisibility, 
n= ©) for some integer k. By substitution, 
n? =_(®_ = 5(5k°). But 5k’ is an integer be- 


cause it is a product of integers. Hence n = 
5- (an integer), and so (€) fas was to be 
shown]. 


Consider the statement “For every integer n, if n 

is odd then n is odd.” 

a. Write what you would suppose and what you 
would need to show to prove this statement by 
contradiction. 

b. Write what you would suppose and what you 
would need to show to prove this statement by 
contraposition. 


Consider the statement “For every real number r, 

if r is irrational then r is irrational.” 

a. Write what you would suppose and what you 
would need to show to prove this statement by 
contradiction. 

b. Write what you would suppose and what you 
would need to show to prove this statement by 
contraposition. 
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Prove each of the statements in 23-25 in two ways: 
(a) by contraposition and (b) by contradiction. 


23. The negative of any irrational number is irrational. 


24. The reciprocal of any irrational number is irrational. 
(The reciprocal of a nonzero real number x is 1/x.) 


H 25. For every integer n, if n’ is odd then n is odd. 


Use any method to prove the statements in 26-29. 
26. For all integers a, b, and c, if abc then afb. 


27. For all positive real numbers r and s, 


Vrts#Vrt+Vs. 


H 28. For all integers a, b, and c, if a | b and afi c, then 
al(b +c). 


H 29. For all integers m and n, if m + n is even then m 
and n are both even or m and n are both odd. 


30. a. Letn = 53. Find an approximate value for Vn 
and write a list of all the prime numbers less 
than or equal to Vn. Is the following statement 
true or false? When n = 53, n is not divisible 
by any prime number less than or equal to Vn. 

b. Suppose n is a fixed integer. Let S be the state- 
ment, “n is not divisible by any prime number 
less than or equal to Vn.” The following state- 
ment is equivalent to S: 


V prime number p, if p is less than or equal 
to Vn then n is not divisible by p. 


Which of the following are negations for S$? 


(i) Ja prime number p such that p = Vn 
and n is divisible by p. 
(ii) n is divisible by every prime number less 
than or equal to Vn. 
(iii) 4a prime number p such that p is a mul- 
tiple of n and p is less than or equal to Vn. 
(iv) nis divisible by some prime number that 
is less than or equal to Vn. 
(v) V prime number p, if p is less than or 
equal to Vn, then n is divisible by p. 


31. a. Prove by contraposition: For all positive 
integers n, r, and s, if rs =n, then r = Vn or 
s £ Vn. 
(Hint: Use Theorem T27 in Appendix A.) 


b. Prove: For each integer n > 1, if n is not prime 
then there exists a prime number p such that 


p = Vn andn is divisible by p. (Hint: Use the 
results of part (a), Theorems 4.4.1, 4.4.3, and 
4.4.4, and the transitive property of order.) 

c. State the contrapositive of the result of part (b). 
The results of exercise 31 provide a way to test 
whether an integer is prime. 


Test for Primality 


Given an integer n > 1, to test whether n is prime check 
to see if it is divisible by a prime number less than or 
equal to its square root. If it is not divisible by any of 
these numbers, then it is prime. 


32. 


33. 


H* 35. 


H* 36. 


Use the test for primality to determine whether the 
following numbers are prime or not. 
a. 667 b. 557 c. 527 d. 613 


The sieve of Eratosthenes, named after its inventor, 
the Greek scholar Eratosthenes (276—194 B.C.E.), 
provides a way to find all prime numbers less han 
or equal to some fixed number n. To construct it, 
write out all the integers from 2 to n. Cross out 
all multiples of 2 except 2 itself, then all multiples 
of 3 except 3 itself, then all multiples of 5 except 
5 itself, and so forth. Continue crossing out the 
multiples of each successive prime number up to 
Vn. The numbers that are not crossed out are all 
the prime numbers from 2 to n. Here is a sieve of 
Eratosthenes that includes the numbers from 2 to 
27. The multiples of 2 are crossed out with a /, 
the multiples of 3 with a \, and the multiples of 

5 with a —. 


23458 78 8M11K 1344 
$5 16 17 WW 19 26 N 2 23 24 25 2 2 


Use the sieve of Eratosthenes to find all prime 
numbers less than 100. 


. Use the test for primality and the result of exercise 


33 to determine whether the following numbers 


are prime. 
a. 9,269 b. 9,103 c. 8,623 d. 7,917 


Use proof by contradiction to show that every 
integer greater than 11 is a sum of two composite 
numbers. 


For all odd integers a, b, and c, if z is a solution of 
ax’ + bx +c = 0 then z is irrational. (In the proof, 
use the properties of even and odd integers that 
are listed in Example 4.3.3.) 
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ANSWERS FOR TEST YOURSELF 


1. the statement is false; this supposition leads to a contradiction ~Q(x) then ~P(x) 3. xis any [particular but arbitrarily 


2. the contrapositive of the statement, namely, Vx € D, if chosen] element of D for which Q(x) is false; P(x) is false 


EEJ Indirect Argument: Two Famous Theorems 


He is unworthy of the name of man who does not know that the diagonal of a square 
is incommensurable with its side. —Plato (ca. 428-347 B.C.E.) 


This section contains proofs of two of the most famous theorems in mathematics: that V2 
is irrational and that there are infinitely many prime numbers. Both proofs are examples 
of indirect arguments and were well known more than 2,000 years ago, but they remain 
exemplary models of mathematical argument to this day. 


The Irrationality of \/2 


When mathematics flourished at the time of the ancient Greeks, mathematicians believed 
that given any two line segments, say A: ______ and B: ______, a certain unit of length 
could be found so that segment A was exactly m units long and segment B was exactly n 
units long, for some integers m and n. (The segments were said to be commensurable with 
respect to this special unit of length.) Then the ratio of the lengths of A and B would be in 


the same proportion as the ratio of the integers m and n. Symbolically: 
lengthA m 


lengthB on’ 


Now it is easy to find a line segment of length V2; just take the diagonal of a unit square: 


1 


By the Pythagorean theorem, c = 17+ 1? = 2, and soc = V2. If the belief of the ancient 
Greeks were correct, there would be integers m and n such that 


length (diagonal) m 
length (side) n’ 


And this would imply that 


oai a 

But then V2 would be a ratio of two integers, or, in other words, \/2 would be rational. 
In the fourth or fifth century B.C.E., the followers of the Greek mathematician and phi- 
losopher Pythagoras discovered that V2 was not rational. This discovery was very upset- 


ting to them, for it undermined their deep, quasi-religious belief in the power of whole 
numbers to describe phenomena. 
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Euclid 
(fl. 300 B.c.E.) 


Note Strictly speaking, 
being able to assume that 
m and n have no common 
factors is a consequence 
of the “well-ordering 
principle for the inte- 
gers,” which is discussed 
in Section 5.4. 


Bettmann/Getty Images 
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The following proof of the irrationality of V2 was known to Aristotle and is similar to 
that in the tenth book of Euclid’s Elements of Geometry. The Greek mathematician Euclid 
is best known as a geometer, and knowledge of the geometry in the first six books of his 
Elements was considered an essential part of a liberal education for more than 2,000 years. 
However, books 7-10 of his Elements contain much that we would now call number theory. 

The proof begins by supposing the negation: V2 is rational. This means that there 
exist integers m and n such that V2 = m/n. Now if m and n have any common factors, 
these may be factored out to obtain a new fraction, equal to m/n, in which the numerator 
and denominator have no common factors. (For example, 18/12 = (6-3)/(6-2) = 3/2, 
which is a fraction whose numerator and denominator have no common factors.) Thus, 
without loss of generality, we may assume that m and n had no common factors in the first 
place. We will then derive the contradiction that m and n do have a common factor of 2. 
The argument makes use of Proposition 4.7.4: If the square of an integer is even, then that 
integer is even. 


Theorem 4.8.1 Irrationality of V2 
V2 is irrational. 


Proof (by contradiction): [We take the negation and suppose it to be true.] Suppose 
not. That is, suppose V2 is rational. Then there are integers m and n with no com- 
mon factors such that 


v2=" 4.8.1 


n 


[by dividing m and n by any common factors if necessary]. [We must derive a contradic- 
tion.] Squaring both sides of equation (4.8.1) gives 


Or, equivalently, 
m = w. 4.8.2 


Note that equation (4.8.2) implies that m? is even (by definition of even). It follows 
that m is even (by Proposition 4.7.4). We file this fact away for future reference and 
also deduce (by definition of even) that 


m= 2k for some integer k. 4.8.3 
Substituting equation (4.8.3) into equation (4.8.2), we see that 
m = (2k) = 4k = 20°. 
Dividing both sides of the right-most equation by 2 gives 
R=. 


Consequently, n’ is even, and so n is even (by Proposition 4.7.4). But we also know 
that m is even. /This is the fact we filed away.] Hence both m and n have a common 
factor of 2. But this contradicts the supposition that m and n have no common fac- 
tors. [Hence the supposition is false and so the theorem is true.] 
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Now that you have seen the proof that V2 is irrational, you can use the irrationality of 
V2 to derive the irrationality of certain other real numbers. 


Irrationality of 1+ 3V2 


Prove by contradiction that 1 + 3V2 is irrational. 


Solution The essence of the argument is the observation that if 1 + 3V2 could be writ- 
ten as a ratio of integers, then so could V2. But by Theorem 4.8.1, we know that to be 
impossible. 


Proposition 4.8.2 
1+3V2 is irrational. 


Proof: Suppose not. Suppose 1 + 3V2 is rational. [We must derive a contradiction. ] 
Then by definition of rational, 


a : ; 
14+3V2= A for some integers a and b with b # 0. 


It follows that 


a 
3V2 = 5 -1 by subtracting 1 from both sides 

a b 
Ee a by substitution 

b b 
_a— b by the rule for subtracting fractions 
E b with a common denominator. 

Hence 
a—b 


V2 = 


3b by dividing both sides by 3. 

But a — b and 3b are integers (since a and b are integers and differences and products 
of integers are integers), and 3b # 0 by the zero product property. Hence V2 is a 
quotient of the two integers a — b and 3b with 3b # 0, and so V2 is rational (by defi- 
nition of rational). This contradicts the fact that V2 is irrational. [The contradiction 
shows that the supposition is false.] Hence 1 + 3V2 is irrational. 


Are There Infinitely Many Prime Numbers? 


You know that a prime number is a positive integer that cannot be factored as a product 
of two smaller positive integers. Is the set of all such numbers infinite, or is there a larg- 
est prime number? The answer was known to Euclid, and a proof that the set of all prime 
numbers is infinite appears in Book 9 of his Elements of Geometry. 

Euclid’s proof requires one additional fact we have not yet established: If a prime num- 
ber divides an integer, then it does not divide the next successive integer. 
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Proposition 4.8.3 
For any integer a and any prime number p, if p|a then p/(a+ 1). 


Proof (by contradiction): Suppose not. That is, suppose there exist an integer a and 
a prime number p such that p|a and p|(a+ 1). Then, by definition of divisibility, 
there exist integers r and s such that a = pr anda + 1 = ps. It follows that 


1 = (a+ 1)—a = ps — pr = p(s —-r), 


and so (since s — r is an integer) p| 1. But, by Theorem 4.4.2, the only integer divisors 
of l are 1 and —1, and p > 1 because p is prime. Thus p = 1 and p > 1, whichis a 
contradiction. [Hence the supposition is false, and the proposition is true.] 


The idea of Euclid’s proof is this: Suppose the set of prime numbers were finite. Then 
you could take the product of all the prime numbers and add 1. By Theorem 4.4.4 this 
number must be divisible by some prime number. But by Proposition 4.8.3, this number is 
not divisible by any of the prime numbers in the set. Hence there must be a prime number 
that is not in the set of all prime numbers, which is impossible. 

The following formal proof fills in the details of this outline. 


Theorem 4.8.4 Infinitude of the Primes 


The set of prime numbers is infinite. 


Proof (by contradiction): Suppose not. That is, suppose the set of prime numbers is 
finite. [We must deduce a contradiction.] Then some prime number p is the largest of 
all the prime numbers, and hence we can list the prime numbers in ascending order: 


25, By Dy Uy. Mess Di 
Let N be the product of all the prime numbers plus 1: 
N= (2:3:5-7-11°-:p) +1 


Then N > 1, and so, by Theorem 4.4.4, N is divisible by some prime number q. Be- 
cause q is prime, q must equal one of the prime numbers 2, 3, 5, 7, 11,..., p. Thus, 
by definition of divisibility, q divides 2-3-5-7-11---p, and so, by Proposition 4.8.3, 
q does not divide (2-:3-5-7-11-+*:p)+ 1, which equals N. Hence N is divisible by q 
and N is not divisible by g, and we have reached a contradiction. [Therefore, the sup- 
position is false and the theorem is true.] 


The proof of Theorem 4.8.4 shows that if you form the product of all the prime numbers 
up to a certain point and add one, the result, N, is divisible by a prime not on the list. The 
proof does not show that N is itself prime. In exercise 30 at the end of the section you will 
find a value for N that is not prime, although it is divisible by a prime. 


When to Use Indirect Proof 


The examples in this section and Section 4.7 have not provided a definitive answer to 
the question of when to prove a statement directly and when to prove it indirectly. Many 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


232 CHAPTER4 ELEMENTARY NUMBER THEORY AND METHODS OF PROOF 


theorems can be proved either way. Usually, however, when both types of proof are pos- 
sible, indirect proof is clumsier than direct proof. In the absence of obvious clues suggest- 
ing indirect argument, try first to prove a statement directly. Then, if that does not succeed, 
look for a counterexample. If the search for a counterexample is unsuccessful, look for a 
proof by contradiction or contraposition. 


Open Questions in Number Theory 


The Royal Society 


b ‘d In this section we proved that there are infinitely many prime numbers. There is no known 
Ben Joseph Green formula for obtaining primes, but a few formulas have been found to be more successful at 
(born 1977) producing them than other formulas. One such is due to Marin Mersenne, a French monk 
who lived from 1588 to 1648. Mersenne primes have the form 2” — 1, where p is prime. Not 
all numbers of this form are prime, but because Mersenne primes are easier to test for primal- 
ity than are other numbers, most of the largest known prime numbers are Mersenne primes. 

An interesting question is whether there are infinitely many Mersenne primes. As of the 
date of publication of this book, the answer is not known, but new mathematical discoveries 
are being made every day and by the time you read this someone may have discovered the 
answer. Another formula that seems to produce a relatively large number of prime numbers 
is due to Fermat. Fermat primes are prime numbers of the form 27" + 1, where n is a positive 
integer. Are there infinitely many Fermat primes? Again, as of now, no one knows. Similarly 
unknown are whether there are infinitely many primes of the form n’ +1, where nisa posi- 
tive integer, and whether there is always a prime number between integers n and (n+ 1)”. 
Terence Chi-Shen Tao Another famous open question involving primes is the twin primes conjecture, which 
(born 1975) states that there are infinitely many pairs of prime numbers of the form p and p+ 2. As 
with other well-known problems in number theory, this conjecture has withstood computer 
testing up to extremely large numbers, and some progress has been made toward a proof. In 
2004, Ben Green and Terence Tao showed that for any integer m > 1, there is a sequence of 
m equally spaced integers all of which are prime. In 2013 Yitang Zhang proved that there 
are infinitely many pairs of prime numbers that differ by no more than 70,000,000. This 
is a lot more than 2, but Zhang’s was the first discovery of any fixed upper bound between 
infinitely many pairs of prime numbers. In 2014 a group of mathematicians working col- 
laboratively showed that the bound could be reduced from 70,000,000 to 246; still more 
than 2 but giving hope that a proof of the twin primes conjecture may be attainable. 

Related to the twin primes conjecture is a conjecture made by Sophie Germain, a 
French mathematician born in 1776, who made significant progress toward a proof of 
Fermat’s Last Theorem. Germain conjectured that there are infinitely many prime number 
Yitang Zhang pairs of the form p and 2p + 1. Initial values of p with this property are 2, 3, 5, 11, 23, 29, 
(Dorn 1333) 41, and 53, and computer testing has verified the conjecture for many additional values. In 
fact, as of the writing of this book, the largest prime p for which 2p + 1 is also known to be 
prime is 2618163402417 x 2'7°° — 1, This is a number with 388,341 decimal digits! But 
compared with infinity, any number, no matter how large, is less than a drop in the bucket. 

In 1844, the Belgian mathematician Eugéne Catalan conjectured that the only solu- 
tions to the equation x" — y” = 1, where x, y, n, and m are all integers greater than 1, is 
3? —2° = 1. This conjecture was finally proved by Preda Milhailescu in 2002. 

In 1993, while trying to prove Fermat’s Last Theorem, an amateur number theorist, 
Andrew Beal, became intrigued by the equation x” + y" = z‘, where no two of x, y, OF z 
have any common factor other than +1. When diligent effort, first by hand and then by 
computer, failed to reveal any solutions, Beal conjectured that no solutions exist. His con- 

; . . jecture has become known as the Beal conjecture, and he has offered a prize of $100,000 
Marie-Sophie Germain x 5 y 
(1776-1831) to anyone who can either prove or disprove it. 


Philippe Desmazes/AFP/Getty Images 
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TEST YOURSELF 


1. 


EXERCISE SET 4.8 


A calculator display shows that V2 = 1.414213562. 


1. 


4.8 
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These are just a few of a large number of open questions in number theory. Many people 
believe that mathematics is a fixed subject that changes very little from one century to the 
next. In fact, more mathematical questions are being raised and more results are being 
discovered now than ever before in history. 


The ancient Greeks discovered that in a right 
triangle where both legs have length 1, the ratio of 
the length of the hypotenuse to the length of one 
of the legs is not equal to a ratio of 


. One way to prove that V2 is an irrational number 


is to assume that V2 = m/n for some integers m 
and n that have no common factor greater than 1, 
use the lemma that says that if the square of an 


integer is even then 
that m and n 


, and eventually show 


One way to prove that there are infinitely 
many prime numbers is to assume that there is 
a largest prime number p, construct the number 

, and then show that this number has to 
be divisible by a prime number that is greater 
than 


Because 1.414213562 = a this suggests 


that V2 is a rational number, which contradicts 
Theorem 4.8.1. Explain the discrepancy. 


. Example 4.3.1(h) illustrates a technique for show- 


ing that any repeating decimal number is rational. 
A calculator display shows the result of a certain 
calculation as 40.72727272727. Can you be sure 
that the result of the calculation is a rational num- 
ber? Explain. 


. Could there be a rational number whose first tril- 


lion digits are the same as the first trillion digits 
of V2? Explain. 


A calculator display shows that the result of 
a certain calculation is 0.2. Can you be sure 
that the result of the calculation is a rational 
number? 


Let S be the statement: The cube root of every 
irrational number is irrational. This statement is 
true, but the following “proof” is incorrect. Ex- 
plain the mistake. 


“Proof (by contradiction): Suppose not. 
Suppose the cube root of every irrational number 
is rational. But 2V2 is irrational because it is 

a product of a rational and an irrational number, 
and the cube root of 2V2 is V2, which is irratio- 
nal. This is a contradiction, and hence it is not 
true that the cube root of every irrational number 
is rational. Thus the statement to be proved 

is true.” 


Determine which statements in 6-16 are true and which 
are false. Prove those that are true and disprove those 
that are false. 


6. 
7. 
8. 
9. 
10. 
1. 


12. 


13. 


14. 


15. 
H 16. 


17. 


18. 


6— 7V2 is irrational. 

3V2 —7 is irrational. 

V4 is irrational. 

\V2/6 is irrational. 

The sum of any two irrational numbers is irrational. 


The difference of any two irrational numbers is 
irrational. 


The positive square root of a positive irrational 
number is irrational. 


If r is any rational number and s is any irrational 
number, then r/s is irrational. 


The sum of any two positive irrational numbers is 
irrational. 


The product of two irrational numbers is irrational. 


If an integer greater than | is a perfect square, 
then its cube root is irrational. 


Consider the following sentence: If x is rational 
then Vx is irrational. Is this sentence always true, 
sometimes true and sometimes false, or always 
false? Justify your answer. 


a. Prove that for every integer a, if a’ is even then 
a is even. 
b. Prove that W/2 is irrational. 
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20. 


H 21. 


H 22. 
H 23. 
24. 


25. 


H 26. 


* 27. 


H 28. 


H 29. 


30. 
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. a. Use proof by contradiction to show that for any 


integer n, it is impossible for n to equal both 
3q, + rı and 3q, + rz, where qj, qo, F1, and rz are 
integers, 0 =r, < 3,0 S r, <3,andr, F r2. 

b. Use proof by contradiction, the quotient- 
remainder theorem, division into cases, and the 
result of part (a) to prove that for every integer 
n, if n? is divisible by 3 then n is divisible by 3. 

c. Prove that V3 is irrational. 


Give an example to show that if d is not prime and 
n? is divisible by d, then n need not be divisible by d. 


The quotient-remainder theorem says not only 
that there exist quotients and remainders but also 
that the quotient and remainder of a division are 
unique. Prove the uniqueness. That is, prove that 
if a and d are integers with d > 0 and if q), F1, q2, 
and r, are integers such that 


a=dq,;+r, where 0S1r,<d 
and 
a= dq, +r, where 0517, <d, 
then 
a= and =n. 


Prove that V5 is irrational. 
Prove that for any integer a, 9/ (a — 3). 


An alternative proof of the irrationality of V2 
counts the number of 2’s on the two sides of the 
equation 2n” = m and uses the unique factoriza- 
tion of integers theorem to deduce a contradiction. 
Write a proof that uses this approach. 


Use the proof technique illustrated in exercise 24 
to prove that if is any positive integer that is not 
a perfect square, then Vn is irrational. 


Prove that V2 + V3 is irrational. 


Prove that log;(2) is irrational. (Hint: Use the 
unique factorization of integers theorem.) 


Let N = 2-3-5-7+ 1. What remainder is obtained 
when N is divided by 2? 3? 5? 7? Justify your answer. 


Suppose a is an integer and p is a prime number 
such that p|a and p|(a +3). What can you deduce 
about p? Why? 


Let pı, P2, p3,-.- be a list of all prime numbers in 
ascending order. Here is a table of the first six: 
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Pi P2 P3 P4 
2 3 5 7 


Ds 
11 


P6 
13 


Ha. Let N, = p, M = Ppi 'Po, N3 = Pit Px P3- 


No = P1‘P2"P3" Pa" Ps" Po. Calculate N, Na, N3, 
Ng, Ns, and Ne. 

b. For each i = 1, 2, 3, 4, 5, 6, find whether N; is 
itself prime or just has a prime factor less than 
itself. (Hint: Use the test for primality from 
exercise 31 in Section 4.7 to determine your 
answers.) 


For exercises 31 and 32, use the fact that for every integer n, 


31. 


H* 32. 


H* 33. 


n! = n(n—1):::3:2-1. 


An alternative proof of the infinitude of the prime 
numbers begins as follows: 


Proof: Suppose there are only finitely many 
prime numbers. Then one is the largest. Call it 
p. Let M = p! +1. We will show that there is a 
prime number q such that q > p. Complete this 
proof. 


Prove that for every integer n, if n > 2 then there 
is a prime number p such that n < p < n!. 


Prove that if p4, p2, ..., and p, are distinct 
prime numbers with p; = 2 and n > 1, then 

Pi P2*** Pat 1 can be written in the form 4k + 3 
for some integer k. 


H 34. a. Fermat’s last theorem says that for every 


integer n > 2, the equation x" + y” = z” has 
no positive integer solution (solution for which 
x, y, and z are positive integers). Prove the 
following: If for every prime number p > 2, 
x? +y” = Z has no positive integer solution, 
then for any integer n > 2 that is not a power of 
2, x" + y” = z” has no positive integer solution. 
b. Fermat proved that there are no integers x, y, 
and z such that x“ + y = z“. Use this result to 
remove the restriction in part (a) that n not be 
a power of 2. That is, prove that if n is a power 
of 2 and n > 4, then x" + y” = z” has no posi- 
tive integer solution. 


For exercises 35-38 note that to show there is a unique 
object with a certain property, show that (1) there is an 
object with the property and (2) if objects A and B have 
the property, then A = B. 


35. 


Prove that there exists a unique prime number 
of the form n? — 1, where n is an integer that is 
greater than or equal to 2. 
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36. Prove that there exists a unique prime number number r. (Such a number is called an additive 
of the form n? + 2n—3, where n is a positive identity.) 
integer. . i 
2 38. Prove that there is at most one real number b with 
37. Prove that there is at most one real number a the property that br = r for every real number r. 
with the property that a +r = r for every real (Such a number is called a multiplicative identity.) 


ANSWERS FOR TEST YOURSELF 


1. two integers 2. the integer is even; have a common factor greater than 1 3.2-3-5-7-1l-::p+1;p 


EEJ Application: The Handshake Theorem 


The most important things in those first few seconds [of a job interview] are, basically, a 
firm handshake and a smile, good eye contact and really paying attention. —Pat Schaumann 


Consider the following curious problem: At a meeting attended by a group of people, four 
people shook hands with one other person, six people shook hands with two other people, 
two people shook hands with three other people, and the rest shook hands with four other 
people. No two people shook hands with each other more than once, and a total of 23 hand- 
shakes occurred. How many people attended the meeting? 

It turns out that a theorem about graphs helps to answer this question. Recall from 
Section 1.4 that if G is a graph and v is a vertex of G, then the degree of v, denoted deg(v), 
is the number of edges incident on v, with an edge that is a loop counted twice. 


Definition 


The total degree of a graph is the sum of the degrees of all the vertices of the graph. 


The Total Degree of a Graph 
Find the degree of each vertex of the graph G shown below, and then find the total degree 


of the graph. 
vi 
L a 
U4 
Solution 
deg(v;) = 1; deg(v2) = 4; deg(v3) = 2; deg(v4) = 3 
total degree of G = 1+4+2+3 = 10 E 


Note that in Example 4.9.1 the total degree of G, which is 10, equals twice the number 
of edges of G, which is 5. In fact, the total degree of any graph is twice the number of its 
edges. One way to see why this is so, at least for a graph without loops, is to imagine that 
the vertices of the graph represent people at a meeting and that each edge of the graph rep- 
resents a handshake between two people. Each person participates in a certain number of 
handshakes—perhaps many, perhaps none—but if the numbers for each person are added 
together, a total is obtained, and, because each handshake is experienced by two different 
people, the total will equal twice the number of handshakes. 
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The handshake analogy is such an attractive way to understand the graph theory result 
that the theorem describing it is known as the handshake theorem. As the proof demon- 
strates, the conclusion is true even for a graph that contains loops. 


Theorem 4.9.1 The Handshake Theorem 


If G is any graph, then the sum of the degrees of all the vertices of G equals twice 
the number of edges of G. Specifically, if the vertices of G are v4, v2,..., Vn, Where 
n is a nonnegative integer, then 
the total degree of G = deg(v,) + deg(v2) +::: + deg(v,) 
= 2-(the number of edges of G). 


Proof: Let G be a particular but arbitrarily chosen graph, and suppose that G has n 
vertices v4, V2, . . - , V, and m edges, where n is a positive integer and m is a nonnega- 
tive integer. We claim that each edge of G contributes 2 to the total degree of G. For 
suppose e is an arbitrarily chosen edge with endpoints v; and v, This edge contrib- 
utes 1 to the degree of v; and 1 to the degree of v;. As shown below, this is true even 
if i = j, because an edge that is a loop is counted twice in computing the degree of 
the vertex on which it is incident. 


iM i=j 
Therefore, e contributes 2 to the total degree of G. Since e was arbitrarily chosen, 


this shows that each edge of G contributes 2 to the total degree of G. Thus 


the total degree of G = 2- (the number of edges of G). 


The following corollary follows immediately from the handshake theorem. 


Corollary 4.9.2 


The total degree of a graph is even. 


Proof: By Theorem 4.9.1 the total degree of G equals 2 times the number of edges 
of G, which is an integer, and so the total degree of G is even. 


Determining Whether Certain Graphs Exist 


Draw a graph with the specified properties or show that no such graph exists. 
a. A graph with four vertices of degrees 1, 1, 2, and 3 
b. A graph with four vertices of degree 1, 1, 3, and 3 


Solution 


a. No such graph exists. By Corollary 4.9.2 the total degree of a graph is even. But 
a graph with four vertices of degrees 1, 1, 2, and 3 would have a total degree of 
1+1+2+3 = 7, which is odd. 
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Example 4.9.3 


Note In Section 5.4 you 
will have an opportunity 
to prove formally that 

any finite sum of even 
integers is even and that if 
a sum of n odd integers is 
even then n is even. 


4.9 APPLICATION: THE HANDSHAKE THEOREM 237 


b. Let G be any of the graphs shown below. 


a b ae———eb a b a b 
d é 
4 O—O Yo L. 
d c d c 
In each case, no matter how the edges are labeled, deg(a) = 1, deg(b) = 1, deg(c) = 3, 
and deg(d) = 3. a 


Application to an Acquaintance Graph 


Is it possible in a group of nine people for each to be friends with exactly five others in the 
group? 


Solution The answer is no. Imagine constructing an “acquaintance graph” in which each 
of the nine people is represented by a vertex and two vertices are joined by an edge if, and 
only if, the people they represent are friends. Suppose each of the people is friends with 
exactly five others. Then the degree of each of the nine vertices of the graph would be 
five, and so the total degree of the graph would be 45. But this contradicts Corollary 4.9.2, 
which says that the total degree of the graph is even. The contradiction shows that the 
supposition is false. Hence it is impossible for each person in a group of nine people to be 
friends with exactly five others in the group. 

The following proposition can be deduced from Corollary 4.9.2 using properties of even 
and odd integers. E 


Proposition 4.9.3 


In any graph there is an even number of vertices of odd degree. 


Proof: Suppose G is any graph, and suppose G has n vertices of odd degree and m 
vertices of even degree, where n is a positive integer and m is a nonnegative integer. 
[We must show that n is even.] Let E be the sum of the degrees of all the vertices of 
even degree, O the sum of the degrees of all the vertices of odd degree, and T the 
total degree of G. If u4, u2,..., Um are the vertices of even degree and v}, V2, ..., V, 
are the vertices of odd degree, then 


E= deg(u;) + deg(u2) mieng deg(u,n), 
O = deg(v,) + deg(v2)+:::+ deg(v,), and 
T = deg(u,) +--+ deg(u,,) + deg(v,) +:+:+ deg(v,) = E+ 0. 


Now T, the total degree of G, is an even integer by Corollary 4.9.2. Also E is even 
since either E is zero, which is even, or E is a sum of even numbers. Now since 


T=E+0, 
then 
O=T-—E. 
Hence O is a difference of two even integers, and so O is even. 
By assumption, deg(v;) is odd for every integer i = 1, 2,..., n. Thus O, an even 
integer, is a sum of the n odd integers deg(v,), deg(v2),..., deg(v,,). But if a sum of n 


odd integers is even, then n is even. Therefore, n is even /as was to be shown]. 
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Applying the Fact That the Number of Vertices with Odd Degree Is Even 
Is there a graph with ten vertices of degrees 1, 1, 2, 2, 2, 3, 4, 4, 4, and 6? 


Solution No. Such a graph would have three vertices of odd degree, which is impossible 
by Proposition 4.9.3. 

Note that this same result could have been deduced directly from Corollary 4.9.2 by 
computing the total degree (1 +1+2+2+2+3+4+4+4+6 = 29) and noting that 
it is odd. However, Proposition 4.9.3 gives the result without the need to perform this 
addition. a 


We can now show how to answer the question posed at the beginning of this section. 


How Many People Attended the Meeting? 


At a meeting attended by a group of people, four people shook hands with one other per- 
son, six people shook hands with two other people, two people shook hands with three 
other people, and the rest shook hands with four other people. No two people shook hands 
with each other more than once, and a total of 23 handshakes occurred. How many people 
attended the meeting? 


Solution Define a graph G by letting each vertex represent a person at the meeting and let- 
ting each edge represent one handshake between two people. Let x be the number of people 
who shook hands with four other people. Then 


the total degree of the graph = 4:1 +6:2+2-:3+x:4 = 22 + 4x 


because 4 people shook hands with 1 other person, 6 people shook hands with 2 other 
people, 2 people shook hands with 3 other people, and x people shook hands with 4 other 
people. In addition, since a total of 23 handshakes occurred, the graph has 23 edges. By 
the handshake theorem (Theorem 4.9.1), the total degree is twice the number of edges. 
Hence 


the total degree of the graph = 2:23 = 46. 


It follows that 


22 + 4x = 46. 
Thus 
4x = 46 — 22 = 24, 
and so 
er 
4 


In other words, six people at the meeting shook hands with four other people. Now the 
total number of people at the meeting is the sum of the number who shook hands with one 
other person, plus the number who shook hands with two other people, plus the number 
who shook hands with three other people, plus the number who shook hands with four 
other people. Therefore, 


the number of people at the meeting = 4+6+2+6= 18. E 
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Special Graphs 


One important class of graphs consists of those that do not have any loops or parallel edges. 
Such graphs are called simple. In a simple graph, no two edges share the same set of end- 
points, so specifying two endpoints is sufficient to determine an edge. 


Definition and Notation 


A simple graph is a graph that does not have any loops or parallel edges. In a simple 
graph, an edge with endpoints v and w is denoted {v, w}. 


Some Simple Graphs 


Draw all simple graphs with the four vertices {u, v, w, x} and two edges, one of which is 
{u, v}. 


Solution 


Since one edge of the graph is specified to be {u, v}, the possibilities for the other edge are 
{u, w}, {u, x}, {v, w}, {v, x}, and {w, x}. The resulting graphs are shown below. 


i i Fa a Uu e————__®U 
w ex we x w ex we x wW e————e X 
Seu eee Determining Whether Certain Simple Graphs Exist 

Draw a graph with the specified properties or show that no such graph exists. 


a. A simple graph with six vertices and sixteen edges. 


b. A simple graph with four vertices of degrees 1, 1, 3, and 3. 


Solution 
a. There is no simple graph with six vertices and sixteen edges. 
Proof (by contradiction): Suppose there is a graph G with six vertices and sixteen 


edges. According to the handshake theorem (Theorem 4.9.1), since G has sixteen 
edges its total degree is 2:16 = 32, and because G has six vertices, 


the total degree 32 5 1 
"3 


the average degree of each vertex = — = 
the number of vertices 6 


The only way this can happen is for at least one vertex—say v, to have degree greater 
than five. But since G has only six vertices, there are at most five other vertices to 
which v can be connected. Thus, in order for v to have degree greater than five, either 
there are at least two edges connecting v to one of the other vertices or there is a loop 
incident on v. In either case G would not be simple. Thus there is no simple graph that 
satisfies the given conditions. 


b. There is no simple graph with four vertices of degrees 1, 1, 3, and 3. 


You might first try the same approach as in the solution for part (a): Assume such a 
graph exists and divide its total degree by the number of edges. Since the total degree 
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is 1+1+3+3 = 8 and there are four vertices, the result is 8/4 = 2. But this, by 
itself, is not a problem. You can easily find examples of simple graphs with four verti- 
ces where the average number of edges per vertex is two. Nonetheless, as the follow- 
ing argument shows, you will not be able to find a simple graph with total degree of 
eight and four vertices of degrees 1, 1, 3, and 3. 


Proof (by contradiction): Suppose there is a simple graph G with four vertices 

of degrees 1, 1, 3, and 3. Call a and b the vertices of degree 1, and call c and d the 
vertices of degree 3. Since deg(c) = 3 and G does not have any loops or parallel edges 
(because it is simple), there must be edges that connect c to a, b, and d. 


But then deg(a) = 2 and deg(b) = 2, which contradicts the supposition that these ver- 
tices have degree 1. Hence the supposition is false, and consequently there is no simple 
graph with four vertices of degrees 1, 1, 3, and 3. E 


Another important class of graphs consists of those that are “complete” in the sense that 
each vertex in the graph is connected by exactly one edge to each other vertex in the 
graph. 


Note The K stands Definition 
for the German word 
komplett, which means 
“complete.” 


Let n be a positive integer. A complete graph on n vertices, denoted K„, is a simple 
graph with n vertices and exactly one edge connecting each pair of distinct vertices. 


See ees Complete Graphs on n Vertices: K;, Kz, K3, K4, K5 
The complete graphs K,, K2, K3, K4, and K; can be drawn as follows: 


U3 
Up 
V2 U3 
Uo U4 
e Vjpe=—————el2 Uy U3 Uy U4 Ui v5 
K; K, K, Ki K; 
E 
elu eee The Number of Edges of K, 
ata 1) 


Prove that for any positive integer n, a complete graph on n vertices has —3—— edges. 
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Solution 


Proof: Suppose n is a positive integer and K, is a complete graph on n vertices. Because 
each vertex of K,, is connected by exactly one edge to each of the other n — 1 vertices of K,,, 
the degree of each vertex of K,, is n — 1. Thus 


the number of the degree of 
the total degree of K, = . 


vertices of K, 


) = n(n- 1). 


each vertex of K, 


By the handshake theorem (Theorem 4.9.1), the total degree of K,, equals twice the number 
of its edges. So since the total degree is n(n — 1), 


n(n — 1) = 2: (the number of edges of K,,) 
Dividing by 2 gives that 


n(n — 1) 


the number of edges of K, = 5 


In another class of graphs, called complete bipartite graphs, the vertex set can be separated 
into two subsets so that each vertex in one of the subsets is connected by exactly one edge 
to each vertex in the other subset but no vertex is connected by an edge to any other vertex 
in its own subset. 


Definition 


Let m and n be positive integers. A complete bipartite graph on (m, n) vertices, 
denoted K,,,,, is a simple graph whose vertices are divided into two distinct subsets, 
V with m vertices and W with n vertices, in such a way that 


1. every vertex of K,,,,, belongs to one of V or W, but no vertex belongs to both V 
and W; 


2. there is exactly one edge from each vertex of V to each vertex of W; 
3. there is no edge from any one vertex of V to any other vertex of V; 


4. there is no edge from any one vertex of W to any other vertex of W. 


Complete Bipartite Graphs: K3 > and K3 3 


The complete bipartite graphs K3 and K3 3 are illustrated below. 


In exercise 23, you are asked to find the number of edges of K, 
positive integers. 


where m and n are any 


m,n? 
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TEST YOURSELF 


1. 
2. 


EXERCISE SET 4.9 


CHAPTER4 ELEMENTARY NUMBER THEORY AND METHODS OF PROOF 


The total degree of a graph is defined as 


The handshake theorem says that the total degree 
of a graph is 


In any graph the number of vertices of odd degree 
is 


A simple graph is —___ 


A complete graph on n vertices is a 


6. 


A complete bipartite graph on (m, n) vertices is a 
simple graph whose vertices can be divided into 
two distinct, non-overlapping sets, say V with m 
vertices and W with m vertices, in such a way that 
(1) there is from each vertex of V to each 
vertex of W, (2) there is from any one ver- 
tex of V to any other of V, and (3) there is 
from any one vertex of W to any other vertex 
of W. 


In 1 and 2 find the degree of each vertex and the total de- 
gree of the graph. Check that the number of edges equals 
one-half of the total degree. 


1. 


—2 9 


v. v, 
2 4 
U6 


vi 
V3 Us 


. A graph has vertices of degrees 0, 2, 2, 3, and 9. 


How many edges does the graph have? 


A graph has vertices of degrees 1, 1, 4, 4, and 6. 
How many edges does the graph have? 


In each of 5-13 either draw a graph with the specified 
properties or explain why no such graph exists. 


5. 
6. 
7. 
8. 
9. 


10. 


1. 


Graph with five vertices of degrees 1, 2, 3, 3, and 5. 


Graph with four vertices of degrees 1, 2, 3, and 3. 
Graph with four vertices of degrees 1, 1, 1, and 4. 
Graph with four vertices of degrees 1, 2, 3, and 4. 


Simple graph with four vertices of degrees 1, 2, 3, 
and 4. 


Simple graph with five vertices of degrees 2, 3, 3, 
3, and 5. 


Simple graph with five vertices of degrees 1, 1, 1, 
2, and 3. 
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12. 


13. 


14. 


15. 


16. 


Simple graph with six edges and all vertices of 
degree 3. 


Simple graph with nine edges and all vertices of 
degree 3. 


At a party attended by a group of people, two 

people knew one other person before the party, 

and five people knew two other people before the 

party. The rest of the people knew three other 

people before the party. A total of 15 pairs of 

people knew each other before the party. 

a. How many people attending the party knew 
three other people before the party? 

b. How many people attended the party? 


A small social network contains three people who 
are network friends with six other people in the 
network, one person who is network friend with 
five other people in the network, and five people 
who are network friends with four other people 
in the network. The rest are network friends with 
three other people in the network. The network 
contains 41 pairs of network friends. 
a. How many people are network friends with 
three other people in the network? 
b. How many people are in the network? 


a. In a group of 15 people, is it possible for each 
person to have exactly 3 friends? Justify your 
answer. (Assume that friendship is a symmet- 
ric relationship: If x is a friend of y, then y is a 
friend of x.) 

b. Ina group of 4 people, is it possible for each 
person to have exactly 3 friends? Justify your 
answer. 


17. 


18. 


H 19. 


20. 


21. 


H 22. 


23. 


In a group of 25 people, is it possible for each to 
shake hands with exactly 3 other people? Justify 
your answer. 


Is there a simple graph, each of whose vertices has 
even degree? Justify your answer. 


Suppose that G is a graph with v vertices and e 
edges and that the degree of each vertex is at least 
din and at most dmax- Show that 

y i jeg ag V 

p mın 2 max 

a. Draw Ke, a complete graph on six vertices. 

b. Use the result of Example 4.9.9 to show that 
the number of edges of a simple say with n 
vertices is less than or equal to —— 

a. Ina simple graph, must every vertex have 
degree that is less than the number of vertices 
in the graph? Why? 

b. Can there be a simple graph that has four verti- 
ces all of different degrees? Why? 

c. For any integer n = 5, can there be a simple 
graph that has n vertices all of different de- 
grees? Why? 


In a group of two or more people, must there 
always be at least two people who are acquainted 
with the same number of people within the group? 
Why? 


Recall that K, „ denotes a complete bipartite graph 

on (m, n) vertices. 

a. Draw Ky». 

b. Draw K; 3. 

c. Draw K3 4. 

d. How many vertices of K„ „ have degree m? 
degree n? 

e. What is the total degree of K,,, ,? 


vi 


U4 


Uy 


US 


4.9 APPLICATION: THE HANDSHAKE THEOREM 243 


disjoint nonempty subsets V; and V3 such that 
vertices in V; may be connected to vertices in 

V>, but no vertices in V, are connected to other 
vertices in V; and no vertices in V, are connected 
to other vertices in V2. For example, the bipartite 
graph G illustrated in (i) can be redrawn as shown 
in (ii). From the drawing in (ii), you can see that 
G is bipartite with mutually disjoint vertex sets 

Vi = {v;, v3, vs} and V2 = {v2, v4, V6}. 


(i) v2 (ii) Le N a 3i 
svi Va. i 
vi V3 i i I 
\ \ i 
1 v3 V4 1 
i i 
V6 U4 i I 
ye Ne 20: sl 
N me 
Us se ae 


Find which of the following graphs are bipartite. 
Redraw the bipartite graphs so that their bipartite 
nature is evident. 


Hi ) V 
v4 U3 V3 


v3 d. Uo U3 
v3 vı U4 
Us 
V6 
V6 
Vg 
V2 f. v2 
U3 
vı U3 
U4 
Us U4 


zh 


Find a formula in terms of m and n for the 
number of edges of K, n. Justify your answer. 


24. A (general) bipartite graph G is a simple graph 
whose vertex set can be partitioned into two 


ANSWERS FOR TEST YOURSELF 


25. Suppose r and s are any positive integers. Does 


there exist a graph G with the property that G 
has vertices of degrees r and s and of no other 
degrees? Explain. 


1. the sum of the degrees of all the vertices of the graph 


2. equal to the number of edges of the graph 3. an even 
number 


4. a graph with no loops or parallel edges 


5. simple graph with n vertices whose set of edges contains 
exactly one edge for each pain of vertices 
edge, no edge 


6. one edge; no 
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EA Application: Algorithms 


Begin at the beginning . . . and go on till you come to the end: then stop. 
—Lewis Carroll, Alice’s Adventures in Wonderland, 1865 


In this section we will show how the number theory facts developed in this chapter form 
the basis for some useful computer algorithms. 

The word algorithm refers to a step-by-step method for performing some action. Some 
examples of algorithms in everyday life are food preparation recipes, directions for as- 
sembling equipment or hobby kits, sewing pattern instructions, and instructions for filling 
out income tax forms. Part of elementary school mathematics is devoted to learning algo- 
rithms for doing arithmetic such as multidigit addition and subtraction, multidigit (long) 
multiplication, and long division. 

The idea of a computer algorithm is credited to Ada Augusta, Countess of Lovelace. 
Trained as a mathematician, she became very interested in Charles Babbage’s design for an 
Lady Lovelace “Analytical Engine,” a machine similar in concept to a modern computer. Lady Lovelace 
(1815-1852) extended Babbage’s explorations of how such a machine would operate, recognizing that 
its importance lay “in the possibility of using a given sequence of instructions repeatedly, 
the number of times being either preassigned or dependent on the results of the computa- 
tion.” This is the essence of a modern computer algorithm. 


Hulton Archive/Getty Images 


An Algorithmic Language 


The algorithmic language used in this book is a kind of pseudocode, combining elements 
of Python, C, (ous and Java, and ordinary, but fairly precise, English. We will use some of 
the formal constructs of computer languages—such as assignment statements, loops, and 
so forth—but we will ignore the more technical details, such as the requirement for explicit 
end-of-statement delimiters, the range of integer values available on a particular installa- 
tion, and so forth. The algorithms presented in this text are intended to be precise enough 
to be easily translated into virtually any high-level computer language. 

In high-level computer languages, the term variable is used to refer to a specific storage 
location in a computer’s memory. To say that the variable x has the value 3 means that the 
memory location corresponding to x contains the number 3. A given storage location can 
hold only one value at a time. So if a variable is given a new value during program execu- 
tion, then the old value is erased. The data type of a variable indicates the set in which the 
variable takes its values, whether the set of integers, or real numbers, or character strings, 
or the set {0, 1} (for a Boolean variable), and so forth. 

An assignment statement gives a value to a variable. It has the form 


X= e, 


where x is a variable and e is an expression. This is read “x is assigned the value e” or “let 
x be e” When an assignment statement is executed, the expression e is evaluated (using 
the current values of all the variables in the expression), and then its value is placed in the 
memory location corresponding to x (replacing any previous contents of this location). 

Ordinarily, algorithm statements are executed one after another in the order in which 
they are written. Conditional statements allow this natural order to be overridden by us- 
ing the current values of program variables to determine which algorithm statement will 
be executed next. Conditional statements are denoted either 


a. if (condition) or b. if (condition) then s; 


then sı 
else sz 
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where condition is a predicate involving algorithm variables and where sı and s are al- 
gorithm statements or groups of algorithm statements. We generally use indentation to 
indicate that statements belong together as a unit. When ambiguity is possible, however, we 
may explicitly bind a group of statements together into a unit by preceding the group with 
the word do and following it with the words end do. 


Execution of an if-then-else statement occurs as follows: 


1. The condition is evaluated by substituting the current values of all algorithm vari- 
ables appearing in it and evaluating the truth or falsity of the resulting statement. 


2. If condition is true, then s4 is executed and execution moves to the next algo- 
rithm statement following the if-then-else statement. 


3. If condition is false, then s) is executed and execution moves to the next algo- 
rithm statement following the if-then-else statement. 


Execution of an if-then statement is similar to execution of an if-then-else statement, 
except that if condition is false, execution passes immediately to the next algorithm state- 
ment following the if-then statement. 

Often condition is called a guard because it is stationed before s; and s, and restricts 
access to them. 


Execution of if-then-else and if-then Statements 


Consider the following algorithm segments: 


a. ifx>2 b. y:=0 
then y:=x+1 if x > 2 then y := 2* 
else do x := x— 1 
y:= 3-x end do 
What is the value of y after execution of these segments for the following values of x? 
i x=5 i. x=2 
Solution 


a. (i) Because the value of x is 5 before execution, the guard condition x > 2 is true at 
the time it is evaluated. Hence the statement following then is executed, and so the 
value of x + 1 = 5+ 1 is computed and placed in the storage location correspond- 
ing to y. So after execution, y = 6. 

(ii) Because the value of x is 2 before execution, the guard condition x > 2 is false at 
the time it is evaluated. Hence the statement following else is executed. The value 
of x — 1 = 2— 1 is computed and placed in the storage location corresponding to 
x, and the value of 3-x = 3-1 is computed and placed in the storage location cor- 
responding to y. So after execution, y = 3. 

b. (i) Since x = 5 initially, the condition x > 2 is true at the time it is evaluated. So the 
statement following then is executed, and y obtains the value P= 92. 

(ii) Since x = 2 initially, the condition x > 2 is false at the time it is evaluated. Execu- 
tion, therefore, moves to the next statement following the if-then statement, and 
the value of y does not change from its initial value of 0. E 


Iterative statements are used when a sequence of algorithm statements is to be execut- 
ed over and over again. We will use two types of iterative statements: while loops and 
for-next loops. 
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A while loop has the form 


while (condition) 
[statements that make up 
the body of the loop] 
end while 


where condition is a predicate involving algorithm variables. The word while marks the 
beginning of the loop, and the words end while mark its end. 


Execution of a while loop occurs as follows: 

1. The condition is evaluated by substituting the current values of all the algorithm 
variables and evaluating the truth or falsity of the resulting statement. 

2. If condition is true, all the statements in the body of the loop are executed in 
order. Then execution moves back to the beginning of the loop and the process 
repeats. 

3. If condition is false, execution passes to the next algorithm statement following 
the loop. 


The loop is said to be iterated (IT-a-rate-ed) each time the statements in the body of the 
loop are executed. Each execution of the body of the loop is called an iteration (it-er-AY- 
shun) of the loop. 


Sete Tracing Execution of a while Loop 


Trace the execution of the following algorithm segment by finding the values of all the 
algorithm variables each time they are changed during execution: 


i:=1l,s:=0 

while (i = 2) 
Si=sti 
i:=it+l 

end while 


Solution Since iis given an initial value of 1, the condition i = 2 is true when the while 
loop is entered. So the statements within the loop are executed in order: 


s=O+1=1 and i=1+1=2. 


Then execution passes back to the beginning of the loop. 
The condition i = 2 is evaluated using the current value of i, which is 2. The condition 
is true, and so the statements within the loop are executed again: 


s=14+2=3 and 1=2+4+1=3. 


Then execution passes back to the beginning of the loop. 

The condition i = 2 is evaluated using the current value of i, which is 3. This time the 
condition is false, and so execution passes beyond the loop to the next statement of the 
algorithm. 

This discussion can be summarized in a table, called a trace table, that shows the cur- 
rent values of algorithm variables at various points during execution. The trace table for 
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a while loop generally gives all values immediately following each iteration of the loop. 
(‘After the zeroth iteration” means the same as “before the first iteration.’”) 


Trace Table 
Iteration Number 
0 1 2 
i 1 2 3 
Variable Name 
sS 0 1 3 


The second form of iteration we will use is a for-next loop. A for-next loop has the 
following form: 


for variable := initial expression to final expression 


[statements that make up 
the body of the loop] 
next (same) variable 


A for-next loop is executed as follows: 


. The for-next loop variable is set equal to the value of initial expression. 


. Acheck is made to determine whether the value of variable is less than or equal 
to the value of final expression. 


. If the value of variable is less than or equal to the value of final expression, 
then the statements in the body of the loop are executed in order, variable is 
increased by 1, and execution returns back to step 2. 


. If the value of variable is greater than the value of final expression, then execu- 
tion passes to the next algorithm statement following the loop. 


See aes Trace Table for afor-next Loop 


Convert the for-next loop shown below into a while loop. Construct a trace table for the 


loop. 
for i := 1 to 4 
x:=ř 
next 7 


Solution The given for-next loop is equivalent to the following: 


i:=1 
while (i = 4) 
ee 
AL 
i:=it+1 
end while 
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Its trace table is as follows: 


Trace Table 
Iteration Number 
0 1 2 3 4 
x 1 4 9 16 
Variable Name 
I 1 2 3 4 5 5 


A Notation for Algorithms 


Note Programming lan- We will express algorithms as subroutines that can be called upon by other algorithms 
guages have various terms as needed and used to transform a set of input variables with given values into a set of 
for subroutines: proce- output variables with specific values. The output variables and their values are assumed 


dures, functions, routines, to be returned to the calling algorithm. For example, the division algorithm specifies a 
and subprograms. 


procedure for taking any two positive integers as input and producing the quotient and 
remainder of the division of one number by the other as output. Whenever an algorithm 
requires such a computation, the algorithm can just “call” the division algorithm to do 
the job. 

We generally include the following information when describing algorithms formally: 


1. The name of the algorithm, together with a list of input and output variables. 
2. A brief description of how the algorithm works. 


3. The input variable names, labeled by data type (whether integer, real number, and so 
forth). 


4. The statements that make up the body of the algorithm, possibly with explanatory 
comments. 


5. The output variable names, labeled by data type. 


You may wonder where the word algorithm came from. It evolved from the last part of 
the name of the Persian mathematician Abu Ja’far Mohammed ibn Misa al-Khowarizmi. 
During Europe’s Dark Ages, the Arabic world enjoyed a period of intense intellectual 
al-Khowârizmî activity. One of the great mathematical works of that period was a book written by al- 
eae) Khowéarizmi that contained foundational ideas for the subject of algebra. The translation of 
this book into Latin in the thirteenth century had a profound influence on the development 
of mathematics during the European Renaissance. 


Eduard Kim/Shutterstock.com 


The Division Algorithm 


For an integer a and a positive integer d, the quotient-remainder theorem guarantees the 
existence of integers q and r such that 


a=dqtr and 0=r<d. 


In this section, we give an algorithm to calculate g and r for given a and d where a is 
nonnegative. (The extension to negative a is left to the exercises at the end of this sec- 
tion.) The following example illustrates the idea behind the algorithm. Consider trying 
to find the quotient and the remainder of the division of 32 by 9, but suppose that you 
do not remember your multiplication table and have to figure out the answer from basic 
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principles. The quotient represents that number of 9’s that are contained in 32. The re- 
mainder is the number left over when all possible groups of 9 are subtracted. Thus you 
can calculate the quotient and remainder by repeatedly subtracting 9 from 32 until you 
obtain a number less than 9: 


32-9 = 23 = 9, and 


32-9-9 = 14 = 9, and 
32-—9-9-9=5 <9. 


This shows that 3 groups of 9 can be subtracted from 32 with 5 left over. Thus the quotient 
is 3 and the remainder is 5. 


Algorithm 4.10.1 Division Algorithm 


[Given a nonnegative integer a and a positive integer d, the aim of the algorithm is 
to find integers q and r that satisfy the conditions a = dq+r and O Sr < d. This 
is done by subtracting d repeatedly from a until the result is less than d but is still 
nonnegative. 


0<a—d—d-—d--:--d=a-dq<d. 


The total number of d’s that are subtracted is the quotient q. The quantity a — dq equals 
the remainder r.] 
Input: a [a nonnegative integer], d [a positive integer] 


Algorithm Body: 


r:=a,q:=0 

[Repeatedly subtract d from r until a number less than d is obtained. Add 1 to q each 
time d is subtracted.] 

while (r = d) 


r:i=r-d 
q:=qtl 
end while 


[After execution of the while loop, a = dq +r.] 


Output: q, r [nonnegative integers] 


Note that the values of g and r obtained from the division algorithm are the same as 
those computed by the div and mod functions built into a number of computer languages. 
That is, if g and r are the quotient and remainder obtained from the division algorithm with 
input a and d, then the output variables q and r satisfy 


q=adivd and r=amodd. 


The next example asks for a trace table for the division algorithm. 


Tracing the Division Algorithm 
Trace the action of Algorithm 4.10.1 on the input variables a = 19 and d = 4. 


Solution Make a trace table as shown on the next page. The column under the kth itera- 
tion gives the states of the variables after the kth iteration of the loop. 
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Iteration Number 


0 1 2 3 4 
a 19 
d 4 
Variable Name 
r 19 15 11 7 3 
q 0 1 2 3 4 - 


The Euclidean Algorithm 


The greatest common divisor of two integers a and b is the largest integer that divides 
both a and b. For example, the greatest common divisor of 12 and 30 is 6. The Euclidean 
algorithm provides a very efficient way to compute the greatest common divisor of two 
integers. 


Definition 


Let a and b be integers that are not both zero. The greatest common divisor of a 
and b, denoted ged(a, b), is that integer d with the following properties: 


1. dis a common divisor of both a and b. In other words, 


dja and d|b. 


2. For every integer c, if c is a common divisor of both a and b, then c is less than or 
equal to d. In other words, 


for every integer c, if c|a and c|b then c = d. 


Calculating Some gcd’s 


a. Find gcd(72, 63). 
b. Find ged(10”°, 6°). 


c. In the definition of greatest common divisor, gcd(0, 0) is not allowed. Why not? What 
would gcd(0, 0) equal if it were found in the same way as the greatest common divi- 
sors for other pairs of numbers? 


Solution 


a. 72 = 9-8 and 63 = 9-7. So 9|72 and 9|63, and no integer larger than 9 divides both 
72 and 63. Hence gcd(72, 63) = 9. 


b. By the laws of exponents, 10” = 2.5% and 6 = 9-3"? = 9 5? Tt follows 
that 
2110 and 26”, 
and by the unique factorization of integers theorem, no integer larger than 2” divides 


both 107° and 6” (because no more than twenty 2’s divide 10%, no 3’s divide 107°, and 
no 5’s divide 6°’). Hence ged(10”", 6°°) = 2”. 


c. Suppose gcd(0, 0) were defined to be the largest common factor that divides 0 and 0. 
The problem is that every positive integer divides 0 and there is no largest integer. So 
there is no largest common divisor! E 
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Calculating gcd’s using the approach illustrated in Example 4.10.5 works only when the 
numbers can be factored completely. By the unique factorization of integers theorem, all 
numbers can, in principle, be factored completely. But, in practice, even using the highest- 
speed computers, the process is unfeasibly long for very large integers. Over 2,000 years 
ago, Euclid devised a method for finding greatest common divisors that is easy to use and 
is much more efficient than either factoring the numbers or repeatedly testing both num- 
bers for divisibility by successively larger integers. 

The Euclidean algorithm is based on the following two facts, which are stated as 
lemmas. 


Lemma 4.10.1 


If r is a positive integer, then gcd(r, 0) = r. 


Proof: Suppose r is a positive integer. [We must show that the greatest common divisor 
of both r and 0 is r.] Certainly, r is a common divisor of both r and 0 because r divides 
itself and also r divides 0 (since every positive integer divides 0). Also no integer 
larger than r can be a common divisor of r and 0 (since no integer larger than r can 
divide r). Hence r is the greatest common divisor of r and 0. 


The proof of the second lemma is based on a clever pattern of argument that is used 
in many different areas of mathematics: To prove that A = B, prove that A = B and that 
BSA. 


Lemma 4.10.2 


If a and b are any integers not both zero, and if q and r are any integers such that 


a=bqtr, 
then 
gcd(a, b) = gced(b, r). 


Proof: /The proof is divided into two sections: (1) proof that gcd(a, b) = gcd(b, r), and 
(2) proof that gcd(b, r) = gcd(a, b). Since each gcd is less than or equal to the other, the 
two must be equal.] 
1. gcd(a, b) = ged(b, r): 
a. [We will first show that any common divisor of a and b is also a common divi- 
sor of b and r.] 
Let a and b be integers, not both zero, and let c be a common divisor of 
a and b. Then c|a and c|b, and so, by definition of divisibility, a = nc and 
b = mc, for some integers n and m. Substitute into the equation 


a=bq+r 
to obtain 
nc = (mc)q +r. 


(continued on page 252) 
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Then solve for r: 


r =nc—(mc)q = (n—ma)c. 


Now n — mq is an integer, and so, by definition of divisibility, c|r. Because 
we already know that c|b, we can conclude that c is a common divisor of b 
and r [as was to be shown]. 
b. [Next we show that gcd(a, b) = gced(b, r).] 

Now the greatest common divisor of a and b is defined because a and 
b are not both zero. Also, by part (a), every common divisor of a and b 
is acommon divisor of b and r, and so the greatest common divisor of a 
and b is a common divisor of b and r. But then gcd(a, b) (being one of the 
common divisors of b and r) is less than or equal to the greatest common 
divisor of b and r: 


gcd(a, b) = gcd(b, r). 
2. gcd(b, r) S gcd(a, b): 


The second part of the proof is very similar to the first part. It is left as an 
exercise. 


The Euclidean algorithm can be described as follows: 


Euclidean Algorithm Description 


1. Let A and B be integers with A > B = 0. 


2. To find the greatest common divisor of A and B, first check whether B = 0. If it 
is, then ged(A, B) = A by Lemma 4.10.1. If it isn’t, then B > 0 and the quotient- 
remainder theorem can be used to divide A by B to obtain a quotient g and a 
remainder r: 


A = Bq+r whereO=r<B. 


By Lemma 4.10.2, gcd(A, B) = gcd(B, r). Thus the problem of finding the great- 
est common divisor of A and B is reduced to the problem of finding the greatest 
common divisor of B and r. 

[What makes this information useful is the fact that the larger number of the pair 
(B, r) is smaller than the larger number of the pair (A, B). The reason is that the value 
of r found by the quotient-remainder theorem satisfies 


0sr<B. 


And, since by assumption B < A, we have that 
Note Strictly speaking, 


the fact that the repeti- OSr<B<A.] 

tions eventually termi- 

nate is justified by the 3. Now just repeat the process, starting again at (2), but use B instead of A and 
well-ordering principle r instead of B. The repetitions are guaranteed to terminate eventually with 
for the integers, which is r = 0 because each new remainder is less than the preceding one and all are 
discussed in Section 5.4. nonnegative. 
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By the way, it is always the case that the number of steps required in the Euclidean al- 
gorithm is at most five times the number of digits in the smaller integer. This was proved 
by the French mathematician Gabriel Lamé (1795-1870). 

The following example illustrates how to use the Euclidean algorithm. 


Seley Hand-Calculation of gcd’s Using the Euclidean Algorithm 
Use the Euclidean algorithm to find gcd(330, 156). 
Solution 


1. Divide 330 by 156: 


2 < quotient 


156| 330 
312 
“18 <— remainder 


Thus 330 = 156-2 + 18, and hence gcd(330, 156) = gcd(156, 18) by Lemma 4.10.2. 
2. Divide 156 by 18: 


8 < quotient 


18] 156 
144 


12 <— remainder 


Thus 156 = 18-8 + 12, and hence ged(156, 18) = gcd(18, 12) by Lemma 4.10.2. 
3. Divide 18 by 12: 


1 < quotient 


12| 18 
12 


6 <— remainder 


Thus 18 = 12-1 +6, and hence ged(18, 12) = gcd(12, 6) by Lemma 4.10.2. 
4. Divide 12 by 6: 


2 < quotient 


6| 12 
12 


0 <remainder 


Thus 12 = 6:2 + 0, and hence gcd(12, 6) = gcd(6, 0) by Lemma 4.10.2. 
Putting all the equations above together gives 
gcd(330, 156) = gcd(156, 18) 
= gcd(18, 12) 
= gcd(12, 6) 
= gcd(6, 0) 
=6 by Lemma 4.10.1. 


Therefore, gcd(330, 156) = 6. E 
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The following is a version of the Euclidean algorithm written using formal algorithm 
notation. 


Algorithm 4.10.2 Euclidean Algorithm 


[Given two integers A and B with A > B = 0, this algorithm computes gcd(A, B). It is 
based on two facts: 

1. gcd(a, b) = gcd(b, r) if a, b, q, and r are integers witha = b-q+rand0=r<b. 
2. gcd(a, 0) = a.] 


Input: A, B [integers with A > B = 0] 
Algorithm Body: 


a:=A,b:=B,r:=B 
[If b + 0, compute a mod b, the remainder of the integer division of a by b, and set r 
equal to this value. Then repeat the process using b in place of a and r in place of b.] 
while (b # 0) 
r:=amodb 
[The value of a mod b can be obtained by calling the division algorithm.] 
a:=b 
b:=r 
end while 
[After execution of the while loop, gcd(A, B) = a.] 
gcd:=a 


Output: ged /a positive integer] 


A Trace Table for the Euclidean Algorithm 


Construct a trace table for Algorithm 4.10.2 using A = 330 and B = 156, the same num- 
bers as in Example 4.10.6. 


Solution 


A | 330 
B 156 
a 330 | 156 | 18 12 
b 156 | 18 12 6 
r 156 | 18 12 6 


DISS 


TEST YOURSELF 


1. When an algorithm statement of the form x : = e if (condition) 
is executed, 


then sı 


2. Consider an algorithm statement of the following else s, 
form. 
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When such a statement is executed, the truth or 
falsity of the condition is evaluated. If condition is 
true, _______. If condition is false, 


3. Consider an algorithm statement of the following 
form. 
while (condition) 
[statements that make up the body of the loop] 
end while 


When such a statement is executed, the truth or 
falsity of the condition is evaluated. If condition is 
true, _______. If condition is false, 


4. Consider an algorithm statement of the following 
form. 
for variable : = initial expression to final 
expression. 
[statements that make up the body of the loop] 
next (same) variable 


EXERCISE SET 4.10 
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When such a statement is executed, variable is set 
equal to the value of the initial expression, and a 
check is made to determine whether the value of 
variable is less than or equal to the value of final 
expression. If so, _____. If not, 


5. Given a nonnegative integer a and a positive inte- 
ger d the division algorithm computes 


6. Given integers a and b, not both zero, gcd(a, b) is 
the integer d that satisfies the following two condi- 
tions: and 


7. If ris a positive integer, then gcd(r, 0) = 


8. If a and b are integers not both zero and if q and r 
are nonnegative integers such that a = bq + r then 
gcd(a, b) = ____. 


9. Given positive integers A and B with A > B, the 
Euclidean algorithm computes 


Find the value of z when each of the algorithm segments 
in 1 and 2 is executed. 


1. i:=2 2. i:=3 
if (i > 3 ori $0) if (i = 3 ori > 6) 
then z := 1 then z := 2 
else z := 0 else z := 0 


3. Consider the following algorithm segment: 
if x-y > 0 then do y := 3-x 
x:=x+ 1 end do 
Zi=xy 
Find the value of z if prior to execution x and y 
have the values given below. 
a x=2,y=3 
b. x=1,y=1 
Find the values of a and e after execution of the loops in 4 
and 5 by first making trace tables for them. 


4. a:=2 5. e:=2,f:=0 
for i := 1 to3 for k:= 1 to3 
a:=3at+l e:=erk 
next i f:=etf 
next k 


Make a trace table to trace the action of Algorithm 4.10.1 
for the input variables given in 6 and 7. 


6. a=26,d=7 7. a=59,d = 13 


8. The following algorithm segment makes change; 
given an amount of money A between 1¢ and 99¢, 


it determines a breakdown of A into quarters (q), 
dimes (d), nickels (n), and pennies (p). 


q:= Adiv 25 
A:=Amod 25 
d:=Adiv 10 
A:=Amod 10 
n:=Adiv5 

p:=Amod5 


a. Trace this algorithm segment for A = 69. 
b. Trace this algorithm segment for A = 87. 


Find the greatest common divisor of each of the pairs of 
integers in 9-12. (Use any method you wish.) 


9. 27 and 72 
11. 7 and 21 


10. 5and9 
12. 48 and 54 


Use the Euclidean algorithm to hand-calculate the greatest 
common divisors of each of the pairs of integers in 13-16. 


13. 1,188 and 385 14. 509 and 1,177 


15. 832 and 10,933 16. 4,131 and 2,431 


Make a trace table to trace the action of Algorithm 4.10.2 
for the input variables given in 17-19. 


17. 1,001 and 871 
18. 5,859 and 1,232 
19. 1,570 and 488 
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common divisors. (After all, division is repeated 
subtraction.) It is based on the following lemma. 


Definition: Integers a and b are said to be relatively 
prime if, and only if, their greatest common divisor is 1. 


In 20 and 21 trace the action of Algorithm 4.10.2 to deter- 
mine whether the integers are relatively prime. 


Lemma 4.10.3 
If a=b>O0, then gcd(a, b) = gcd(b, a — b). 


20. 4.617 and 2.563 21. 34.391 and 6.728. Algorithm 4.10.3 Computing gcd’s by Subtraction 
? ? ? [Given two positive integers A and B, variables a and b are 
H 22. Prove that for all positive integers a and b, a | bif, set equal to A and B. Then a repetitive process begins. If 


23. 


24. 


H 25. a. Prove: Ifa and d are positive integers and q ita =b thena: = a-b 
and r are integers such that a = dq + r and iseh = bsa 
0< r< d, then end while 
-a = d—-(q4+ 1) + (d-7) if a = 0 then gcd := b 
else gcd := a 
and 0<d-r<d. [After execution of the if-then-else statement, gcd = 
b. Indicate how to modify Algorithm 4.10.1 to ged(A, B).] 
allow for the input a to be negative. Output: gcd [a positive integer] 
26. a. Prove that if a, d, q, and r are integers such 
that a = dq+rand0=r<d, then a. Prove Lemma 4.10.3. 
b. Trace the execution of Algorithm 4.10.3 for 
gla eni Fem edid, A = 630 and B = 336. 
b. In a computer language with a built-in-floor c. Trace the execution of Algorithm 4.10.3 for 
function, div and mod can be calculated as A = 768 and B = 348. 
follows: 
Exercises 28-32 refer to the following definition. 
adivd=|a/d| and a mod d=a-—|a/d|-d. 
: . Definition: The least common multiple of two nonzero 
Rewrite the steps of Algorithm 4.10.2 for a integers a and b, denoted Iem(a, b), is the positive integer 
computer language with a built-in floor func- c such that 
tion but without div and mod. a. alc and b|c 
27. An alternative to the Euclidean algorithm uses us ns positive integers m, if a|m and b|m, then 
subtraction rather than division to compute greatest a8 


and only if, gcd(a, b) = a. (Note that to prove “A 
if, and only if, B,’ you need to prove “if A then B” 
and “if B then A.”) 


a. Prove that if a and b are integers, not both zero, 
and d = gcd(a, b), then a/d and b/d are integers 
with no common divisor that is greater than 1. 

b. Write an algorithm that accepts the numerator 
and denominator of a fraction as input 
and produces as output the numerator and 
denominator of that fraction written in lowest 
terms. (The algorithm may call upon the 
Euclidean algorithm as needed.) 


Complete the proof of Lemma 4.10.2 by proving 
the following: If a and b are any integers with 
b + 0 and q and r are any integers such that 


a = bq+r. 


then gcd(b, r) = gcd(a, b). 
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a # 0, and b + Q, then the larger of a and b is set equal to 
a—b (ifa = b)ortob—a (ifa < b), and the smaller of a 
and b is left unchanged. This process is repeated over and 
over until eventually a or b becomes 0. By Lemma 4.10.3, 
after each repetition of the process, 


gcd(A, B) = gcd(a, b). 
After the last repetition, 
gcd(A, B) = ged(a,0) or gcd(A, B) = gcd(0, b) 


depending on whether a or b is nonzero. But by Lemma 
4.10.1, 


gcd(a, 0) =a and gcd(0, b) = b. 
Hence, after the last repetition, 
gcd(A, B)=aifa+0 or ged(A, B) = b ifb #0.] 
Input: A, B [positive integers] 


Algorithm Body: 
a:=A,b:=B 
while (a # 0 and b # 0) 


28. Find 
a. Icm(12, 18) 
b. Icm(2?-3-5, 27-37) 
c. Icm(2800, 6125) 


29. Prove that for all positive integers a and b, 
gcd(a, b) = lcm(a, b) if, and only if, a = b. 


ANSWERS FOR TEST YOURSELF 
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30. Prove that for all positive integers a and b, a|b if, 
and only if, Icm(a, b) = b. 


31. Prove that for all integers a and b, 
gcd(a, b)|Icm(a, b). 


32. Prove that for all positive integers a and b, 
gcd(a, b): lcm(a, b) = ab. 


1. the expression e is evaluated (using the current values of 
all the variables in the expression), and this value is placed 
in the memory location corresponding to x (replacing 

any previous contents of the location) 2. statement s4 is 
executed; statement s is executed 3. all statements in the 
body of the loop are executed in order and then execution 
moves back to the beginning of the loop and the process 
repeats; execution passes to the next algorithm statement 


following the loop 4. the statements in the body of the 
loop are executed in order, variable is increased by 1, and 
execution returns to the top of the loop; execution passes to 
the next algorithm statement following the loop 5. integers 
q and r with the property that n = dq +r and0 Sr < d 

6. d divides both a and b; if c is a common divisor of both 

a and b, thenc Sd 7.r 8.gcd(b,r) 9. the greatest 
common divisor of A and B (Or: gcd(A, B)) 
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SEQUENCES, 


Note The symbol “.. .” 
is called an ellipsis. It 
is shorthand for “and so 
forth.” 


Note Strictly speaking, 
the true value of A, is less 
than 2 when k is large, 
because ancestors from 
one branch of the family 
tree may also appear on 


other branches of the tree. 
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MATHEMATICAL 
INDUCTION, AND 
RECURSION 


One of the most important tasks of mathematics is to discover and characterize regular 
patterns, such as those associated with processes that are repeated. The main mathemati- 
cal structure used in the study of repeated processes is the sequence, and the main math- 
ematical tool used to verify conjectures about sequences is mathematical induction. In 
this chapter we introduce the notation and terminology of sequences, show how to use 
both ordinary and strong mathematical induction to prove properties about them, illus- 
trate the various ways recursively defined sequences arise, describe a method for obtain- 
ing an explicit formula for a recursively defined sequence, and explain how to verify the 
correctness of such a formula. We also discuss a principle—the well-ordering principle 
for the integers—that is logically equivalent to the two forms of mathematical induction, 
and we show how to adapt mathematical induction to prove the correctness of computer 
algorithms. In the final section we discuss more general recursive definitions, includ- 
ing those for Boolean expressions and recursive functions, and we introduce a variation 
of mathematical induction, called structural induction, which is especially important in 
computer science. 


Sequences 


A mathematician, like a painter or poet, is a maker of patterns. 
—G. H. Hardy, A Mathematician’s Apology, 1940 


Imagine that a person decides to count his ancestors. He has two parents, four grandpar- 
ents, eight great-grandparents, and so forth. These numbers can be written in a row as 


2, 4, 8, 16, 32, 64, 128,... 


To express the pattern of the numbers, suppose that each is labeled by an integer giving 
its position in the row. 


Position in the row 1 2 3 4 5 6 Tess 


Number of ancestors 2 4 8 16 32 64 128... 


The number corresponding to position 1 is 2, which equals 2'. The number correspond- 
ing to position 2 is 4, which equals 2°. For positions 3, 4, 5, 6, and 7, the corresponding 
numbers are 8, 16, 32, 64, and 128, which equal 2 24, 2. a and z. respectively. For a 
general value of k, let A, be the number of ancestors in the kth generation back. The pattern 
of computed values strongly suggests the following for each k: 


A= oO" 
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Definition 


A sequence is a function whose domain is either all the integers between two given 
integers or all the integers greater than or equal to a given integer. 


We typically represent a sequence as a set of elements written in a row. In the sequence 
denoted 


Am» Am+1> An+2>+++> Am 


each individual element a, (read “a sub k”) is called a term. The k in a, is called a sub- 
script or index, m (which may be any integer) is the subscript of the initial term, and n 
(which must be an integer that is greater than or equal to m) is the subscript of the final 
term. The notation 


Am, Am+1> AIm+2>+++ 


denotes an infinite sequence. An explicit formula or general formula for a sequence is a 
rule that shows how the values of a, depend on k. 

The following example shows that it is possible for two different formulas to give se- 
quences with the same terms. 


CASAN Finding Terms of Sequences Given by Explicit Formulas 


Define sequences a, a2, a3, . . . and bo, b3, b4, . . . by the following explicit formulas: 


k 
a= LI for every integer k = 1, 


~] 
b; = . i for every integer i = 2. 
i 


Compute the first five terms of both sequences. 


Solution 

oe gall 

“a~ pe 2 a a 

2 2 3k 2 

a =~ == b, =—— == 

2+1 3 3 3 

2828 4,814 

464° 4 - 4 a 

4 4 5-1 4 

a; = — =- b; = —— =- 

4+1 5 5 3 

5 3 6=1._-3 

as = —_ = — be — a M 

SFL -6 6 6 
As you can see, the first terms of both sequences are L, Z, 3. t Š; in fact, it can be shown 
that all terms of both sequences are identical. E 


The next example shows that an infinite sequence may have a finite number of values. 
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Geuuceaee An Alternating Sequence 


Compute the first six terms of the sequence co, C1, C2, . . . defined as follows: 


c =(- 1ř for every integer j = 0. 


Solution cy =(-1}ł=1 
q =(-1)} =-1 
Q=(-l’=1 
c,=(-1P =-1 
co =(-14 = 1 


cs =(-1) =-1 


Thus the first six terms are 1, —1, 1, — 1, 1, — 1. By exercises 10 and 11 of Section 4.2, even 
powers of — 1 equal | and odd powers of — 1 equal —1. It follows that the sequence oscil- 
lates endlessly between | and —1. a 


In Examples 5.1.1 and 5.1.2 the task was to compute the first few terms of a sequence 
given by an explicit formula. The next example treats the question of how to find an explicit 
formula for a sequence with given initial terms. Any such formula is a guess, but it is useful 
to be able to make such guesses. 


Ce Finding an Explicit Formula to Fit Given Initial Terms 


Find an explicit formula for a sequence with the following initial terms: 


Solution Denote the general term of the sequence by a, and suppose the first term is a}. 
Next observe that the denominator of each term is a perfect square. Thus the terms can be 
rewritten as 


1 6D. L ey 1 1) 


1?’ 22 $ 37 42 P 5? 62 
t t t t t t 
a, dz a, ag as 6 


Now note that the denominator of each term equals the square of the subscript of that term, 
and that the numerator equals +1. Hence 


Also the numerator oscillates back and forth between +1 and —1; it is +1 when k is odd 
and —1 when k is even. To achieve this oscillation, insert a factor of (— i (or (— i) 
into the formula for az. [For when k is odd, k+ 1 is even and thus (— 1? = +1; and when 
kis even, k+ 1 is odd and thus (—1)**' = —1.] Consequently, an explicit formula that gives 
the correct first six terms is 


for every integer k = 1. 
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Note that making the first term ap) would have led to the alternative formula 
g 0 
zA = 
Caution! Itis also pos- Ok = (k+ for every integer k = 0. 
sible for two sequences . . . . 
to start off with the same You should check that this formula also gives the correct first six terms. E 


initial values but diverge 
later on. See exercise 5 at 
the end of this section. 


Summation Notation 


Consider again the example in which A; = g: represents the number of ancestors a person 
has in the kth generation back. What is the total number of ancestors for the past six gen- 
erations? The answer is 


A, +A +A3 +44 +A; +A = 2! +2 +2 +242 +26 = 126. 


It is convenient to use a shorthand notation to write such sums. In 1772 the French math- 
ematician Joseph Louis Lagrange introduced the capital Greek letter sigma, ©, to denote 
the word sum (or summation), and defined the summation notation as follows: 


Definition 


n 
If m and n are integers and m < n, the symbol > a,, read the summation from k 


equals m to n of a-sub-k, is the sum of all the terms am Grae anra Chas WWE 
say that am + G41 + 4m+2 + ' `- +a, is the expanded form of the sum, and we write 
n 
Da z Cg r Gael TOAT pi Pll 


k=m 


Historical/Getty Images 


Joseph Louis Lagrange 
(1736-1813) 


We call k the index of the summation, m the lower limit of the summation, and n the 
upper limit of the summation. 


Computing Summations 


Let a; = —2, a = —1, a3 = 0, a4 = 1, andas = 2. Compute the following: 
5 2 2 
a. Ya b. Sa C. S ax 
k=1 k=2 k=1 
Solution 


= a; +a + a3 + a4 + as — (—2)+(-1)+0+1+2=0 


a = -l1 


5 
a. Sa 
k=1 
2 
b. Sa 
k=2 

2 
Cc. X ax = a. +a. = ag +a, = —14+1=0 E 
Oftentimes, the terms of a summation are expressed using an explicit formula. For in- 


stance, it is common to see summations such as 


: 2 1) 
kK 3 
{Ko 2 


k= 
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CUA When the Terms of a Summation Are Given by a Formula 
5 
Compute Şe. 


k=1 
5 

Solution ZE = 1+2? EP +a +5? = 55, m 
k=1 


When the upper limit of a summation is a variable, an ellipsis is used to write the sum- 
mation in expanded form. 


Changing from Summation Notation to Expanded Form 


n (—| l 
Write = C ) in expanded form: 
a l 


Solution 
A pi 1)° D! D 13 1)" 
C Fal as ) £ J. ) pragt ) 
itl 0+1 1+1 2+1 3+1 n+1 
1 -1 =f —1)”" 
_ af ere. l purgi ) 
1 2 3 4 n+l 
— 1)" 
Se et! = 
3 n+1 


Changing from Expanded Form to Summation Notation 
Express the following using summation notation: 


1 2; 3 n+1 
+ + , 
n n+l n+2 2n 


: ; : i+ : 
Solution The general term of this summation can be expressed as itl for each integer 
i from 0 to n. Hence 


LS, Sed ed 


+ + brett = eee a 
n n+l n+2 2n Stri 


In Examples 5.1.6 and 5.1.7, the top index n of the summation is a free variable because 
it may be replaced by any integer greater than or equal to the bottom index, and each such 
replacement leads to a different summation. For any particular summation the top index 
acts like a constant. Thus when the top index also appears in the terms of the summation, 
as in Example 5.1.7, its value does not change from term to term. By contrast, the index 
variable in these examples is bound by the summation symbol. It must take every value 
from the bottom limit to the top limit in succession. The binding of an index variable in 
a summation is similar to the binding of a variable in a quantified statement or of a local 
variable in a computer program. 

Writing a summation in expanded form helps relate it to our previous experience of 
working with sums. But for small values of n the expanded form may be misleading. For 
instance, consider trying to evaluate the following expression for n = 1: 


1+2 +3 + n. 
A It may be tempting to write that when n = 1, 1+2 +3 + n equals 
Caution! 5 
Don’t write this > 174+27+3?+---+1*. This is wrong! 
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Example 5.1.8 


Example 5.1.9 
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The reason is that 17+27+3*+--:+n? is simply a way of representing the sum of 
squares of consecutive integers starting with 1° and ending with n’. Thus, when n = 1 the 
sum starts and ends with 1, and so it is just 1°. If n = 2 the sum is 1° + a and if n = 3 the 
sum is 17 +27+ 3°, 


Evaluating a, +a, +a, +: : : + a, for Small n 
What is the value of 2? +2! +2? +- -- +2” when n = 0,n = 1, andn = 2? 


Solution When you evaluate a summation like 2° +2! +2? +--+ 2” for small values of 
n, you can avoid a mistake by imagining it in summation notation. For instance, 


Pe 2 eS yt 
i=0 


0 
So when n = 0, 2° +2! +2? +: -+ +2” has the value X 2' = 2° = 1. 
i=0 


1 
When n = 1, 2°+2' +2? +-+: +2" has the value X 2 = 2° +2' = 142. 


i=0 


2 
When n = 2, 2° +2' +2? +: +2" has the value X, 2f = 2° +2'+2°=1+2+4. m 
i=0 
A more mathematically precise definition of summation, called a recursive definition, 
is the following:* If m is any integer, then 
m n n-1 
Sa =a, and X a= Sata, foreveryinteger n > m. 


k=m k=m k=m 


When solving problems, it is often useful to rewrite a summation using the recursive form 
of the definition, either by grouping summands using a single summation sign or by sepa- 
rating off the final term of a summation. 


Using a Single Summation Sign and Separating Off a Final Term 


n 
a. Write 52 +2"*! asa single summation. 
k=0 
n+l 
b. Rewrite Da a by separating off the final term. 
i=1 


Solution 
n n+1 
ay 2 = (22 22a yO 
k=0 k=0 
aI] 1 1 1 1 1 n] 1 
b. = >+5+ ae + = + E 
> rr E n (ntl) 2? (n+1)° 


In certain sums each term is a difference of two quantities. When you write such 
sums in expanded form, you sometimes see that all the terms cancel except the first 
and the last. 


*Other recursively defined sequences are discussed later in this section and, in greater detail, in Section 5.6. 
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A Telescoping Sum 


Some sums can be transformed so that successive cancellation of terms collapses the final 
result like a telescope. For instance, observe that for every integer k = 1, 


1 1 _(k+D-k_ 1 
k k+l k(k+ 1) k(k+1) 
a 1 
Use this identity to find a simple expression for È ESN 


Solution 


n 1 n 1 1 
> kk+1) sia z) 


n+1 


Product Notation 


The notation for the product of a sequence of numbers is analogous to the notation for their 
sum. The Greek capital letter pi, L, denotes a product. For example, 


5 


Ig = G)a703040s. 
k=1 


Definition 
n 
If m and n are integers and m = n, the symbol | [ap read the product from k 
k=m 
equals m to n of a-sub-k, is the product of all the terms am, 4m+1; á4m+2: - -> An- 
We write 


n 
[a = An? Am+1°4m+2° °° An 


k=m 


A recursive definition for the product notation is the following: If m is any integer, then 


m n eA 
[flan and [|a= (Ta) ʻa, for every integer n > m. 
k=m k=m k=m 


Computing Products 


Compute the following products: 
5 
: k b. —— 
i II Hoa 


Solution 
k 1 if 
k+1 1+1 2 


5 
a. [k= 1-2-3-4-5=120 bv TT 
k=1 


1 
k=1 
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Properties of Summations and Products 


The following theorem states general properties of summations and products. The proof of 
the theorem is discussed in Section 5.6. 


Theorem 5.1.1 


If Qins Aint ts Am+2+-- and Dy, Dyn +1, Om+25 ++. are sequences of real numbers and c is 
any real number, then the following equations hold for any integer n = m: 


il. at S bk = Sat bp 


k=m k=m k=m 


DERG: Sa = Se a; generalized distributive law 


k=m k=m 
3 (Kta) (L1) = [][@-20.- 
k=m k=m k=m 


Using Properties of Summation and Product 


Let ap = k+ 1 and b; = k—1 for every integer k. Write each of the following expressions 
as a single summation or product: 


k=m k=m 
k=m k=m 
Solution 
Sate: X b= Skt a2 k= 1) by substitution 
k=m k=m k=m k=m 
-S++ (k— 1) by Theorem 5.1.1 (2) 
k=m 
= Si ((k+ I) +2-(k-D) by Theorem 5.1.1 (1) 
k=m 
= > (3k—1) by algebraic simplification 
k=m 
b. (TI) $ (ra) = (iie D) š (I1«- D) by substitution 
k=m k=m k=m k=m 
= [ħ[&+ 1):(k-1)) by Theorem 5.1.1 (3) 
k=m 
= He =f) by algebraic simplification E 
k=m 
Change of Variable 
3 
Observe that Se = 1+2? +3? 
k=1 
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3 
and also that >` P= 17+274+37, 


{=i 
3 3 

Hence ye = Sr’. 
i=l =I 


The symbol used to represent an index of a summation is an example of a local variable, of- 
ten called a dummy variable, because, as illustrated above, it can be replaced by any other 
symbol as long as the replacement is made in each location where it occurs. Outside of that 
context (both before and after), the symbol may have another meaning entirely. In the same 
way, a symbol used to represent a variable in a universally or existentially quantified state 
can be replaced by any other symbol as long as the replacements are made consistently. 

The appearance of a summation can be altered by more complicated changes of variable 
as well. For example, observe that 


4 
Sg- D = 2-19 +6-D°+(4-1P 
j=2 


A general procedure to transform the first summation into the second is illustrated in 
Example 5.1.13. 


Transforming a Sum by a Change of Variable 


Transform the following summation by making the specified change of variable: 
ol 
tion: X —— h iable: j=k+1 
summation pares i change of variable: j 


Solution First calculate the lower and upper limits of the new summation: 


Whenk=0, j=k+1=0+1=1. 
Whenk=6, j=k+1=6+1=7. 


Thus the new sum goes from j = 1 to j = 7. 
Next calculate the general term of the new summation. You will need to replace each 
occurrence of k by an expression in j: 


Since j =k+1,thenk = j-1. 
1 1 1 
k+1 G-)4+1 j 


Hence 


Finally, put the steps together to obtain 


1 
Er SL 
mkt j=1J E 


Equation (5.1.1) can be transformed further by noting that because the j in the right- 
hand summation is a dummy variable, it may be replaced by any other variable name, as 
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long as the substitution is made in every location where j occurs. In particular, it is legal to 
substitute k in place of j to obtain 


1 
= Èr 5.1.2 


Putting equations (5.1.1) and (5.1.2) together gives 


Ti 
> 
Il 


Sometimes it is necessary to shift the limits of one summation in order to add it to an- 
other. An example is the algebraic proof of the binomial theorem, given in Section 9.7. A 
general procedure for making such a shift when the upper limit is part of the summand is 
illustrated in the next example. 


When the Upper Limit Appears in the Expression to Be Summed 


ntl 


Rewrite the summation > LE so that the lower limit becomes 0 and the upper limit 
k=1 
becomes n but the index of the summation remains k. 


a. First, transform the summation by making the change of variable j = k—1. 


b. Second, transform the summation obtained in part (a) by changing all j’s to k’s. 


Solution 


a. The index variable k is bound by the summation symbol to take each of the values 
from | ton + 1 in succession. 
When k = 1, then j = 1 — 1 = 0, and when k = n +1, then j = (n+ 1)—-1 =n. 
So the new lower limit is 0 and the new upper limit is n. 
Now n is a constant with respect to the terms of the sum. Unlike k, its value does 
not change from one term to the next. In addition, since j = k— 1, then k=j+1. 
Thus 


k j+1 
n+k n+(j+1) 


and so the general term of the new summation is 


j+1 
n+(j+1) 
Therefore, 
n+1 k n i+] 
yy SS 5.13 
Qonutk fontGdt+y 


b. Changing all the j’s to k’s in the right-hand side of equation (5.1.3) gives 


"o j+l n k+l 
y= 5.14 
n+(j+1) n+(k+1) 


j=0 k=0 


Combining equations (5.1.3) and (5.1.4) results in 


ntl k n k+l 
A An 5 


k=1 k=0 
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Factorial and “n Choose r” Notation 


The product of all consecutive integers up to a given integer occurs so often in mathematics 
that it is given a special notation—factorial notation. 


Definition 


For each positive integer n, the quantity n factorial denoted n!, is defined to be the 
product of all the integers from 1 to n: 


al S asa = I)yee9 Bowe il. 


Zero factorial, denoted 0!, is defined to be 1: 


0! = 1. 


The definition of zero factorial as 1 may seem odd, but, as you will see when you read 
Chapter 9, it is convenient for many mathematical formulas. 


The First Ten Factorials 


0=1 1!1=1 
21=2:1=2 3!=3-2-1=6 
4) = 4-3-2-1 = 24 5! = 5-4-3-2-1 = 120 


6! = 6-5-4:3-2-1 = 720 7! = 7-6-5:4-3-2-1 = 5,040 
8! =8-7:6:5-4:3:2-1 9! =9-8-7:6:5-4-3-2-1 
= 40,320 = 362,880 E 


As you can see from the example above, the values of n! grow very rapidly. For instance, 
40! = 8.16 X 10%, which is a number that is too large to be computed exactly using the 
standard integer arithmetic of the machine-specific implementations of many computer 
languages. (The symbol = means “is approximately equal to.”) 

A recursive definition for factorial is the following: Given any nonnegative integer n, 


A f ifn=0 
ni = 


an- D! ifn= 
Caution! Note that maa=. iia =l 
n:(n— 1)! is to be inter- 


The next example illustrates the usefulness of the recursive definition for making 
preted as n-[(n— DI 


computations. 


Computing with Factorials 


Simplify the following expressions: 


8! 5! 1 1 (n+ 1)! n! 
a. = b. = tS d. e. 

7! 2!+3! 2!-4! 31-3! n! (n— 3)! 
Solution 

8! 8H 
a — = LA =8 

7! 1 

5! 5:4371 5-4 
= 3 = 10 
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1 1 Ei 1 3 1 4 by multiplying each numerator and denominator by just 
C. 21-4! F 31-3! 21-4! ` 3 + 31-3! i 4 what is necessary to obtain a common denominator 
3 4 

= F by rearranging factors 

3:214! 3!:4-3! 

3 4 

= + because 3-2! = 3! and 4-3! = 4! 

31-4! 31-4! 
_ 7 by the rule for adding fractions with a com- 
~ 144 mon denominator 


P (n+1)) (n+1)-nt 

i nio ní F 
n! n: (n— 1) (n— 2) (n35)! 

(n=3)! (n—3)! 


=n -3n +2n E 


n+1 


ê. =n-(n—1)-(n—2) 


An important use for the factorial notation is in calculating values of quantities, called 
n choose r, that occur in many branches of mathematics, especially those connected with 
the study of counting techniques and probability. 


Let n and r be integers with 0 = r <n. The symbol 


(") 


is read “n chooser” and represents the number of subsets of size r that can be chosen 
from a set with n elements. 


Observe that the definition implies that (’) will always be an integer because it is a num- 
ber of subsets. In Section 9.5 we will explore many uses of n choose r for solving problems 
involving counting, and we will prove the following computational formula: 


Formula for Computing (P) 


For all integers n and r with 0 S r S n, 


In this chapter, we show how to compute its values. Because n choose r is always an inte- 
ger, you can be sure that all the factors in the denominator of the formula will be canceled 
out by factors in the numerator. Many electronic calculators have keys for computing val- 
ues of (%). These are denoted in various ways such as nCr, C(n, r),"C,, and Cn, The letter 
C is used because the quantities (”) are also called combinations. Sometimes they are 
referred to as binomial coefficients because of the connection with the binomial theorem 
discussed in Section 9.7. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


270 CHAPTER5 SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION 


Example 5.1.17 


Example 5.1.18 


Computing (”) 


Use the formula for computing (’") to evaluate the following expressions: 


(a (" j ') 
b. Cc. 

4 n 
Solution 


B) 8! 
a. SS 
5) 58-5)! 


= ok f: p f cecal always cancel common factors before multiplying 
G-4-3-2-1)-G21) 
= 56. 


[A A L a 
‘\4]~ a-p 40 E320) 


The fact that 0! = 1 makes this formula computable. It gives the correct value because a 
set of size 4 has exactly one subset of size 4, namely itself. 


n+1 (n+1)! n+)! (nt+1)-nf 
j Horio wt MAMo 


n+1 a 
n 


Sequences in Computer Programming 


An important data type in computer programming consists of finite sequences. In com- 
puter programming contexts, these are usually referred to as one-dimensional arrays. For 
example, consider a program that analyzes the wages paid to a sample of 50 workers. Such 
a program might compute the average wage and the difference between each individual 
wage and the average. This would require that each wage be stored in memory for retrieval 
later in the calculation. To avoid the use of entirely separate variable names for all of the 
50 wages, each is written as a term of a one-dimensional array: 


W([L], W[2], W[3],..., W[50]. 


Note that the subscript labels are written inside square brackets. The reason is that until 
relatively recently, it was impossible to type actual dropped subscripts on most computer 
keyboards. 


Dummy Variable in a Loop 


The index variable for a for-next loop is a local, or dummy, variable. For example, the fol- 
lowing three algorithm segments all produce the same output: 


1. fori:= 1 ton 2. for j:= O0ton—1 3. fork :=2ton+1 
print a[i] print a[j + 1] print a[k — 1] 
next i next j next k 


The recursive definitions for summation, product, and factorial lead naturally to com- 
putational algorithms. For instance, here are two sets of pseudocode to find the sum of a[1], 
a(2],..., a[n]. The one on the left exactly mimics the recursive definition by initializing 
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the sum to equal a[1]; the one on the right initializes the sum to equal 0. In both cases the 
output is 2) alk]. 


s:=a{l] s:=0 

for k:=2ton for k:= 1 ton 
s := stalk] s := stalk] 

next k next k 


Application: Algorithm to Convert from Base 10 
to Base 2 Using Repeated Division by 2 


Section 2.5 contains some examples of converting integers from decimal to binary nota- 
tion. The method shown there, however, is only convenient to use with small numbers. A 
systematic algorithm to convert any nonnegative integer to binary notation uses repeated 
division by 2. 

Suppose a is a nonnegative integer. Divide a by 2 using the quotient-remainder theorem 
to obtain a quotient g[0] and a remainder r[0]. If the quotient is nonzero, divide by 2 again 
to obtain a quotient q[1] and a remainder r[1]. Continue this process until a quotient of 0 is 
obtained. At each stage, the remainder must be less than the divisor, which is 2. Thus each 
remainder is always either 0 or 1. The process is illustrated below for a = 38. (Read the 
divisions from the bottom up.) 


0 remainder = 1 = r[5] 


2/1 remainder = 0 = r[4] 


22 remainder = 0 = r[3] 

214 remainder = 1 = r[2] 

2 9 remainder = 1 = r[1] 

2 19 remainder = 0 = r[0] 


2 | 38 


The results of all these divisions can be written as a sequence of equations: 


38 = 19-2+0, 
19 =9-2+1, 
9=4-24+1, 
4=2-2+0, 
2=1-2+0, 
1=0-2+1. 


By repeated substitution, then 


38 = 19:2+0 
= (9:-2+1):2+0=9-27+1:2+0 
= (4-241)-27+1-2+0 =4-27+1-27+1-2+0 
= (2:2+0):2?+1-27+1-2+0 
= 2-24+40-23+1-27+1-2+0 
= (1:2+0)-:24+0-27+1-27+1:2+0 
1:2°+0-24+0-23+1-27+1:2+0. 
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Note that each coefficient of a power of 2 on the right-hand side of the previous page 
is one of the remainders obtained in the repeated division of 38 by 2. This is true for the 
left-most 1 as well, because 1 = 0-2 + 1. Thus 


38:19 = 100110, = ([5]r[4]r[3]r[2]7[1]7[0))>. 


In general, if a nonnegative integer a is repeatedly divided by 2 until a quotient of zero 
is obtained and the remainders are found to be r[0], 7[1],..., r[k], then by the quotient- 
remainder theorem each r[i] equals 0 or 1, and by repeated substitution from the theorem, 


a= rk] +2 lrk- 1] +--+ +2?-7f2]+2!-7f1] 4+ 2°70]. 51,5 
Thus the binary representation for a can be read from equation (5.1.5): 


ayy = (KAA 1] ++ A210). 


Converting from Decimal to Binary Notation Using Repeated Division by 2 


Use repeated division by 2 to write the number 29), in binary notation. 


Solution 
0 remainder = r[4] = 1 
211 remainder = r[3] = 1 
213 remainder = r[2] = 1 
2 7 remainder = r[1] = 0 
2 14 remainder = r[0] = 1 
2| 29 
Hence 29,9 = (r[4]r[3]r[2]r[1]r[0])> = 111012. m 


The procedure we have described for converting from base 10 to base 2 is formalized 
in the following algorithm: 


Algorithm 5.1.1 Decimal to Binary Conversion Using Repeated Division by 2 


[In Algorithm 5.1.1 the input is a nonnegative integer a. The aim of the algorithm is to 
produce a sequence of binary digits r[0], r[1], r[2], ..., r[k] so that the binary represen- 
tation of n is 


([A rik — 1] -- [2] 1]r10))>. 
That is, 


a = 2$- rk] +2! k—1] + +++ +27 +7[2] +2! [1] + 2°70]. 


Input: a [a nonnegative integer] 


Algorithm Body: 
g:i=a,i:=0 
[Repeatedly perform the integer division of q by 2 until q becomes 0. Store succes- 
sive remainders in a one-dimensional array r{0], r{\], r[2],..., r[k]. Even if the ini- 


tial-value of q equals 0, the loop should execute one time (so that r[0] is computed). 
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rli] := q mod 2 
g:=qdiv2 


i:=itl 
end while 
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Thus the guard condition for the while loop is i = 0 or q # 0.] 
while (i = 0 or q # 0) 


[r{i] and q can be obtained by calling the division algorithm.] 


[After execution of this step, the values of r[{0], r{\],..., rli- 1] are all 0’s and 1’s, 
and a = (r{i- 1r{i— 2] +++ r[2]r[1]r{(0])>.) 
Output: r[0], r[1], r[2],..., r[i— 1] [a sequence of integers] 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 
n 


1. The notation X} a; is read “___.” 


k=m 


2. The expanded form of X, a; is 


k=m 
3. The value of a; +a, +a3 + ++: +a, whenn = 2 is 


“cc 3 


n 
4. The notation [ [axis read “___.” 


k=m 


EXERCISE SET 5.1* 


5. If nis a positive integer, then n! = 


n 
6. Sa toy hy St es 
k=m 


k=m 
7. (1) 11) = 
k=m k=m 


Write the first four terms of the sequences defined by the 
formulas in 1-6. 


k 
1. ag = Tose for every integer k = 1. 
2. b;= a) for every integer j = 1. 
=) 
3. Ci 7 CU 


T for every integer i = 0. 
1\” : 
4. d,=1+ J for every integer m = 0. 
n . 
5. eg = He for every integer n = 0. 
n . 
6. f, = He for every integer n = 1. 


7. Let a, = 2k+ 1 and bj = (k— 1)? +k +2 for every 
integer k = 0. Show that the first three terms of 
these sequences are identical but that their fourth 
terms differ. 


Compute the first fifteen terms of each of the sequences 
in 8 and 9, and describe the general behavior of these 
sequences in words. (A definition of logarithm is given in 
Section 7.1.) 


8. g, = [log,n| for every integer n = 1. 


9. h, = nilog,n| for every integer n = 1. 
Find explicit formulas for sequences of the form aj, a2, 
a3, . . . with the initial terms given in 10-16. 
10. —1,1,—-1,1,—-1,1 


n. 0,1, —2,3, —4,5 
123456 
4 9 16’ 25’ 36’ 49 
11 11 11 11 11 1 
72 33 44 55 66 7 
14 9 16 25 36 
3° 9° 27 81° 243° 729 
12 34 56 
2S AS 6T 


12. 


13. 1 


14. 


15. 0, 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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16. 3,6, 12, 24, 48, 96 Write each of 43-52 using summation or product 
2n+(-1)"-1 notation. 
* 17. Consider the sequence defined by a, = ——--—— 
for every integer n = 0. Find an alternative explicit 43. 17-27 +3°?-47+5°-@ +7 
formula for a, that uses the floor notation. 44. = = D4 = b= =e -1 
18. Let dg = 2, a; = 3, a = —2, a3 = 1, a4 = 0, 45. ey Thee 1) (4 1) 
as = —1, and dg = —2. Compute each of the sum- 
mations and products below. 46. 2 3 + 4 5 re 6 
6 0 3 6 2 3-4 45 56 67 78 
a. a; b. a; ©: ay; d. a, @ ak 
> > > : II H 47. 1-r+r-r+r -r 
Compute the summations and products in 19-28. 48. 0-0 A-A- A-A A-^) 
3 4 3 34.534 23 3 
19. (k+ 1) 20. a 21. (Re +1) 49. 1° +2 +3 +: +n 
> II > 1 2. .- 3 n 


50. —+— +> ++ 
2! 31 4! (n+1)! 


4 1 0 
_1y ci „oj 
22. IK 1y 23. ZUEN 24. Z0 2 ETO T eee 


2 1 1 52 S a I a L 
25. {== 26. +3 ET 3! 4! n! 
ÚU- = ) n 
6 (4 1 5 i(i +2) Transform each of 53 and 54 by making the change of 
>| = 8. | [——— variable i = k+1. 
=—\n n+l zo (i71) (i+ 1) 5 n 
k(k— 1) 54. [| f 
Write the summations in 29-32 in expanded form. aes > ° +4 
k=0 k=1 
n+l] k+1 Transform each of 55-58 by making the change of vari- 
29. 2e 2)' 30. >i +1) 31. ry 32. Si!) ablef=i-1. 
i=l 
nth G— D? z i 
Evaluate the summations and products in 33-36 for the 3: > a 56. > ars 
ae : j=] SF 
indicated values of the variable. i = 
1 1 1 1 k i “n-itl 
33. Sta tat tane=l 57. 58. a 
: oS ne A (n-i? II n+ 
34. 1(1!) +22) + 3G!) + +++ +m@n!); m = 2 Write each of 59-61 as a single summation or product. 


s ( 1 )( 2 \ 3 J( k ses 59. 3D Ok— Z0- 
1+1/\2+1/\3+1 k+1 

(esye ( m-(m+1) ) 60. 2- ŞGÉ+4)+5: SR- I) 
36. 3-4 eee sm=1 


k=1 k=1 


6°7/\8-9 (m+ 2):(m+ 3) Pe (i k ) (=) 
Write each of 37-39 as a single summation. atk t1 zik +2 
ka 4 Zok m+1 
37. X Etk 38. > __ ta Compute each of 62-76. Assume the values of the 
s eeki m+2 : f 4 
i=l k=l variables are restricted so that the expressions are 
n m er defined. 
H 39. Sm+1)2 + (n+ 2)2 4! 6! 4! 
m= 2 63. — — 
0 62. 31 g! 64. 01 
Rewrite 40-42 by separating off the final term. i ! 
k+1 m+1 n+1 Ai (n = 1)! n: 
Ss 65. 66. 67. 
40. X i(i!) a. Ske 42. Sim(m+ 1) (n— 1)! (n+1)! (n—2)! 


i=1 k=1 m=1 
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(n+ 1)! n! n! Fill in the blanks below so that each algorithm 
68. (nly 69. (n—k)! 70. (n—k+1)! segment performs the same job as the one shown 
in the exercise statement. 
5 1 3 a. sum:=0 b. sum:=0 
71. 72. 73. 
J ) a) for i := 0 to for j := 2 to 
sum:=— sum:=—— 
5 n n+1 ; f 
74. 5 75. i 76. i next i next j 
n= n= 


Use repeated division by 2 to convert (by hand) the inte- 


77. a. Prove that n! + 2 is divisible by 2, for every : 
gers in 81-83 from base 10 to base 2. 


integer n = 2. 


b. Prove that n! + k is divisible by k, for every 81. 90 82. 98 83. 205 
integer n = 2 and k = 2,3,...,n. 
H c. Given any integer m = 2, is it possible to find a Make a trace table to trace the action of Algorithm 5.1.1 
sequence of m — 1 consecutive positive integers on the input in 84-86. 
none of which is prime? Explain your answer. 84. 23 85. 28 86. 44 


78. Prove that for all nonnegative integers n and r with ; . LS . 
a n= n 87. Write an informal description of an algorithm (us- 


r+l=n, +i p i : ing repeated division by 16) to convert a nonnega- 
i r tive integer from decimal notation to hexadecimal 


79. Prove that if p is a prime number and r is an inte- notation (base 16). 
ger with O < r < p, then P | is divisible by p. Use the algorithm you developed for exercise 87 to con- 
j vert the integers in 88-90 to hexadecimal notation. 
80. Suppose a[l], a[2], a[3],... , alm] is a one- 88. 287 89. 693 90. 2,301 
dimensional array and consider the following 
algorithm segment: 91. Write a formal version of the algorithm you devel- 


sum:= 0 oped for exercise 87. 


for k:= 1 tom 
sum := sum + a[k] 
next k 


ANSWERS FOR TEST YOURSELF 


1. the summation from k equals m to n of a-sub-k 


5.n'(n—1):::3:2-1(0r: n-(n—1)!) 6. S (a+ cby) 


k=m 


2. amt am+1 tamt +a, 3.a +a à 
4. the product from k equals m to n of a-sub-k 7. | [ab 


k=m 


EFJ Mathematical Induction |: Proving Formulas 


A good proof is one which makes us wiser. —I. Manin, A Course in Mathematical Logic, 1977 


In natural science courses, deduction and induction are presented as alternative modes of 
thought—deduction being to infer a conclusion from general principles using the laws of 
logical reasoning, and induction being to enunciate a general principle after observing it 
to hold in a large number of specific instances. Discovery of new mathematical facts often 
occurs through experimentation with examples, but mathematical induction as a proof 
technique is not inductive but deductive. Once proved by mathematical induction, a theo- 
rem is known just as certainly as if were proved by any other mathematical method. Thus, 
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in mathematics, inductive reasoning is used in the natural sciences sense, but only to make 
conjectures, not to prove them. For example, observe that 


row 1 l--= 
row 2 1-1 jae 2L 
2 3 3 

1 1 1 1 

row 3 1 1 1 = 
(aaa 


This pattern seems so unlikely to occur by pure chance that it is reasonable to conjecture 
(though it is by no means certain) that the pattern holds true in general. To use mathemati- 
cal induction to explore the conjecture, ask yourself: Is there something about the form of 
the pattern in one row that insures the pattern will be true in the next row? For instance, 
does the fact that the pattern is true in row 3 imply that it will also be true in a new row 4? 
To answer this question try substituting the right-hand side of the equation in row 3, namely 


L, into the expression (1 )(1 (1 HI 3). That is, 


mie (fot C-A-A-N- 


When you do this, you obtain 
1 1 1 1 1 1 14 
{= Ls = 1 1 . 


So the pattern does extend to row 4! Does the process also work for going from row 1 to 
row 2? Yes! When you substitute the right-hand side of the equation in row 1, namely 5, in 
place of (1 = 1) in the left-hand side of the equation in row 2, the result is 


ELE 


So the truth of the pattern in row 1 implies the truth of the pattern in row 2. Stop reading 
for a moment and use the same procedure to derive the truth of the pattern in row 3 from 
its truth in row 2 and then show that the same pattern extends to a new row 5. (This is ex- 
ercise 1 at the end of this section.) 

With this background you are ready to check out the general case. If you suppose that 
the pattern holds for an arbitrarily chosen row, can you show that the pattern holds for the 
next row? Suppose that k is any integer that is at least 2, and assume 


a eH 
(aalen) (ata) ae) 
a =a e ade er 
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Thus for any integer k greater than or equal to 2, 


«(ee (et) om (ean) 


This example illustrates the basic idea of the principle of mathematical induction. It shows 
that as long as the pattern holds in one row, then it has to hold in the next larger row. But 
this implies that the pattern holds in every row no matter how far down the table it might 
be. The reason is that since the pattern holds in row 1, it holds in row 2. And since it 
holds in row 2, it holds in row 3. And since it holds in row 3, it holds in row 4. And since it holds 
in row 4, it holds in row 5. And so on, and so on, forever! 


Principle of Mathematical Induction 


Let P(n) be a property that is defined for integers n, and let a be a fixed integer. Sup- 
pose the following two statements are true: 


1. P(@) is true. 
2. For every integer k = a, if P(x) is true then P(k + 1) is true. 


Then the statement 
for every integer n = a, P(n) 


is true. 


The first known use of mathematical induction occurs in the work of the Italian scientist 
Francesco Maurolico in 1575. In 1653 Blaise Pascal gave a clear description of the tech- 
nique, and in 1883 Augustus De Morgan (best known for De Morgan’s laws) gave the pro- 
cess the name mathematical induction. An equivalent logical inference rule, now known 
as the well-ordering principle (see Section 5.4), was used implicitly by mathematicians in 
ancient Greece, in the middle ages by Campanus of Novara, and in the seventeenth century 
by Pierre de Fermat, who called it the “method of infinite descent.” 

To visualize the idea of mathematical induction, imagine an infinite collection of domi- 
noes positioned one behind the other in such a way that if any given domino falls back- 
ward, it makes the one behind it fall backward also. (See Figure 5.2.1.) Then imagine that 
the first domino falls backward. What happens?... They all fall down! 


FIGURE 5.2.1 If the kth domino falls backward, it pushes the (k + 1)st domino backward also. 


To see the connection between this image and the principle of mathematical induction, 
let P(n) be the sentence “The nth domino falls backward.” It is given that for each k = 1, if 
P(k) is true (the kth domino falls backward), then P(k + 1) is also true (the (k + 1)st domino 
falls backward). It is also given that P(1) is true (the first domino falls backward). Thus by 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


278 CHAPTERS SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION 


the principle of mathematical induction, P(n) (the nth domino falls backward) is true for 
every integer n = 1. 

The validity of proof by mathematical induction is generally taken as an axiom. That is 
why it is referred to as the principle of mathematical induction rather than as a theorem. It 
is equivalent to the following property of the integers, which is easy to accept on intuitive 
grounds: 


Suppose S is any set of integers satisfying (1) a is in S, and (2) for every 
integer k = a, if kis in S then k + 1 is in S. Then S contains every inte- 
ger greater than or equal to a. 


To understand the equivalence of this formulation and the one given earlier, let S be the set 
of all integers for which P(n) is true. 

Proving a statement by mathematical induction is a two-step process. The first step is 
called the basis step, and the second step is called the inductive step. 


Method of Proof by Mathematical Induction 


Consider a statement of the form, “For every integer n = a, a property P(n) is true.” 
To prove such a statement, perform the following two steps: 


Step 1 (basis step): Show that P(a) is true. 


Step 2 (inductive step): Show that for every integer k = a, if P(k) is true then 
P(k+ 1) is true. To perform this step, 


suppose that P(x) is true, where k is any 
particular but arbitrarily chosen integer with k = a. 


[This supposition is called the inductive hypothesis./ 
Then 
show that P(k + 1) is true. 


The following example shows how to use mathematical induction to prove a formula for 
the sum of the first n integers. 


GBeuceeas Sum of the First n Integers 


Use mathematical induction to prove that 
_n(n+1) 
2 


E rA Se Pn for every integer n = 1. 


Solution To construct a proof by induction, you must first identify the property P(n). In 
this case, P(n) is the equation 


Note The property is n(n+ 1) 
just the equation. The 14+2+ s +n =—— 
proof will show that the 
equation is true for every [To see that P(n) is a sentence, note that its subject is “the sum of the integers from 1 ton” and 
integer n = 1. its verb is “equals.” ] 
In the basis step of the proof, you must show that the property is true for n = 1, or, in 
other words, that P(1) is true. Now P(1) is obtained by substituting 1 in place of n in P(n). 
The left-hand side of P(1) is the sum of all the successive integers starting at 1 and ending 
at 1. This is just 1. Thus P(1) is 


<— the property (P(n)) 


Note To write P(1), just i= 11+ 1) oan 
copy P(n) and replace 7° < basis (P(1)) 
each n by 1. 
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Of course, this equation is true because the right-hand side is 
11+1) 12 _ i 
2 2 i 
which equals the left-hand side. 
In the inductive step, you assume that P(k) is true, for a particular but arbitrarily chosen 
integer k with k = 1. /This assumption is the inductive hypothesis.] You must then show that 


P(k + 1) is true. What are P(k) and P(k + 1)? P(k) is obtained by substituting k for every n 
in P(n). Thus P(k) is 


Note To write P(k), just k(k+1) 
copy P(n) and replace Le 2be bk 5 | < inductive hypothesis (PW) 
each n by k. 


Similarly, P(k + 1) is obtained by substituting the quantity (k + 1) for every n that appears 
in P(n). Thus P(k + 1) is 


1424+ °° +(k+1)= a nee 


or, equivalently, 


Note To write P(k+ 1), (k+ 1)\(k+2) 
just copy P(n) and replace Lek 2ab er eck (Ker 1) = a i, 
each n by (k+ 1). 


<— to show (P(k + 1)) 


Now the inductive hypothesis is the supposition that P(k) is true. How can this sup- 
position be used to show that P(k + 1) is true? P(k + 1) is an equation, and the truth of an 
equation can be shown in a variety of ways. One of the most straightforward is to use the 
inductive hypothesis along with algebra and other known facts to separately transform the 
left-hand and right-hand sides until you see that they are the same. In this case, the left- 
hand side of P(k + 1) is 


1+2+-+-+(k+)), 


which equals 
The next-to-last term is k because the terms are 
successive integers and the last term is k + 1. 


(1+2+-::+kh+(k+1) 
By substitution from the inductive hypothesis, 


(1+2+ ++ +h +(k+1) 


k(k+ 1) since the inductive hypothesis says that 
=e  +&+1) 1424-044 = 28 

kk+1)  2(k+1) by multiplying the numerator and denominator 
= + of the second term by 2 to obtain a common 

2 2 denominator 
2 
Z kK+k 2k+2 by multiplying out the two numerators 
2 2 

ER E by adding fractions with the same denominator 

= a and combining like terms. 


2437 
So the left-hand side of P(k + 1) is ==. Now the right-hand side of P(k + 1) is 


(k+1Xk+2) K +3k+2 
2 2 
Thus the two sides of P(k + 1) are equal to each other, and so the equation P(k + 1) is true. 


by multiplying out the numerator. 
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This discussion is summarized as follows: 


Theorem 5.2.1 Sum of the First n Integers 


For every integer n = 1, 
n(n + 1) 
z 


1+2+ -+n = 


Proof (by mathematical induction): Let the property P(n) be the equation 


n(n+ 1) 
ae ale oe < Po) 
Show that P(1) is true: 
To establish P(1), we must show that 
104+ 1) 
= <— P(1) 
2 
But the left-hand side of this equation is 1 and the right-hand side is 
i+) 2 
2 2 


also. Hence P(1) is true. 

Show that for every integer k = 1, if P(k) is true then P(k +1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 1. That is:] 
Suppose that k is any integer with k = 1 such that 


k(k+ 1 
peiit eee =P) , 
2 inductive hypothesis 
[We must show that P(k + 1) is true. That is:] We must show that 
(k+ Dİ(k+1)+1] 


2 2 


1+2434+---+(k+1) = 


or, equivalently, that 
(k + 1)(k +2) 
B 


[We will show that the left-hand side and the right-hand side of P(k + 1) are equal to 
the same quantity and thus are equal to each other.] 


The left-hand side of P(k + 1) is 


LEZF3 +t (kF) = < P(k+1) 


1+2+3+- +(k+1) 
=] +2+3+--+k+(k+1) by making the next-to-last 


term explicit 
k(k + 1) j 
= 2 +(k+1) by substitution from the 
inductive hypothesis 


_ Kk+N) , 2k+ 1) 
5 2 
K+k 2k+2 
=+ 
2 2 
K+3k+2 
3 


by algebra. 
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And the right-hand side of P(k + 1) is 


(k+1Xk+2) K+3k+2 
2 2 l 
Thus the two sides of P(k + 1) are equal to the same quantity and so they are equal 
to each other. Therefore, the equation P(k + 1) is true fas was to be shown]. 


[Since we have proved both the basis step and the inductive step, we conclude that the 
theorem is true.] 


The story is told that one of the greatest mathematicians of all time, Carl Friedrich 
Gauss (1777-1855), was given the problem of adding the numbers from 1 to 100 by his 
teacher when he was a young child. The teacher had asked his students to compute the sum, 
supposedly to gain himself some time to grade papers. But after just a few moments, Gauss 
produced the correct answer. Needless to say, the teacher was dumbfounded. How could 
young Gauss have calculated the quantity so rapidly? In his later years, Gauss explained 
that he had imagined the numbers paired according to the following schema. 


ee eee 50 S51......98 99 100 


oa 


sum is 101 
sum is 101 
sum is 101 


The sum of the numbers in each pair is 101, and there are 50 pairs in all; hence the total 
sum is 50-101 = 5,050. 


Definition 


If a sum with a variable number of terms is shown to equal an expression that does 
not contain either an ellipsis or a summation symbol, we say that the sum is written 
in closed form. 


— nant) 
2 


For example, writing 1 +2+3-+:--+n 
in closed form. 


expresses the sum 1+2+3+°::+n 


eee) Applying the Formula for the Sum of the First n Integers 


a. Evaluate 2 + 4+6 +--+ 500. 

b. Evaluate 5+6+7+8+-::: +50. 

c. For an integer h = 2, write 1 +2 +3 +:-::+(h-— 1) in closed form. 
Solution 


a. 2+4+6+: +500 = 2 (1 +2+3+: +250) 


=2. 250:251 by applying the formula for the sum 
2 of the first n integers with n = 250. 


= 62,750 
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b. 5+6+7+8+ °° +50 = (142434 °°: +50)-(1+24+34+4) 


= 50-51 by applying the formula for the sum 
2 of the first n integers with n = 50. 


= 1,265 


(h-1) [(h —1)+ 1] by applying the formula for the sum 
= 2 of the first n integers with n = h — 1 


 (h=1)h 
= 3 


c 1+2+3+ +(A—1) 


since (h— 1)+1 = h. E 


The next example asks for a proof of another famous and important formula in math- 
ematics—the formula for the sum of a geometric sequence. In a geometric sequence, each 
term is obtained from the preceding one by multiplying by a constant factor. If the first 
term is 1 and the constant factor is r, then the sequence is 1, r, r^, P, ...,r',.... The sum 


of the first n terms of this sequence is given by the formula 


n , tty 


> 


i= 


r= 


for every integer n = 0 and every real number r not equal to 1. The expanded form of the 
formula is 
yr =j 
porter 4/7" = =f 
Note This is the defini- J 
tion of 0° given by Donald and because I = 1 and r! = r, the formula for n = 1 can be rewritten as 


E. Knuth in The Art of 


n+l 
Computer Programming, |rt iep — l 
Volume 1: Fundamental awe r r-1 ` 
Algorithms, 3rd Edi- P 
tion (Reading, Mass.: In some mathematical contexts 0° is regarded as indeterminate. In discrete mathematics 
n 
Addison-Wesley, 1997), we usually define 0° = 1 so that when we write X,” we do not need to give special atten- 
p. 57. tion to the case r = 0. i=0 


Gees) Sum of a Geometric Sequence 


P ntl E 
Prove that >, r= 


r 
i=0 


7=7 > for every integer n = 0 and every real number r except 1. 


Solution In this example the property P(n) is again an equation, although in this case it 
contains a real variable r. 


5 y = ——_. | ethe property (P(n)) 


Because r can be any real number other than 1, the proof begins by supposing that r is a 
particular but arbitrarily chosen real number not equal to 1. Then the proof continues by 
mathematical induction on n, starting with n = 0. In the basis step, you must show that P(0) 
is true; that is, you show the property is true for n = 0. So you substitute 0 for each n in P(n): 


0+1 
r =] 


0 
5 r = ———. | — basis (P(0)) 
a p=] 
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Note To write P(0), copy 
P(n) and replace each n 
by 0. 
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In the inductive step, you suppose k is any integer with k = 0 for which P(&) is true; that is, 
you suppose the property is true for n = k. So you substitute k for each n in P(n): 


< inductive hypothesis (P(k)) 


Then you show that P(k + 1) is true; that is, you show the property is true for n = k + 1. So 
you substitute k + 1 for each n in P(n): 


k+1 (k+1)+1 
i_?r ey 
Sie — = 
i=0 r-l 
or, equivalently, 
k+l t2 =] 
yr = ——.. |< to show (P(k+ 1)) 
i=0 r=1 


In the inductive step for this proof we use another common technique for showing that 
an equation is true: We start with the left-hand side and transform it step-by-step into the 
right-hand side. To do so, we use the inductive hypothesis together with algebra and other 
known facts. 


Theorem 5.2.2 Sum of a Geometric Sequence 


For any real number r except 1, and any integer n = 0, 
ael 1 


n i r 
Sr- 
E ral 
Proof (by mathematical induction): Suppose r is a particular but arbitrarily cho- 


sen real number that is not equal to 1, and let the property P(n) be the equation 


n p ail 


l 


<— P(n) 
i=0 pond " 


We must show that P(n) is true for every integer n = 0. We do this by mathematical 
induction on n. 

Show that P(0) is true: 

To establish P(O), we must show that 


0 ;_ Pt] 
5r oy, <— P(0) 
i=0 


The left-hand side of this equation is r? = 1 and the right-hand side is 


ply r-l 
= =1 


=i r—-l 


also because r! = r and, since r #1, r—1# 0. Hence P(0) is true. 


Show that for every integer k = 0, if P(k) is true then P(k + 1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 0. That is:] 


(continued on page 284) 
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Let k be any integer with k = 0, and suppose that 


Note To write P(k), copy ko, Pligg 
L . . : 
P(n) and replace each n ` r= ok <— P(k) inductive hypothesis 
by k. i=0 
[We must show that P(k + 1) is true. That is:] We must show that 
S , petit) _ 4 
a 
i=0 pl 
or, equivalently, that 
Note To write P(k+ 1), k+l kt2_ 
copy P(n) and replace 5 r= pma <—P(k+1) 
each n by k+1 i=0 . 


[We will show that the left-hand side of P(k + 1) equals the right-hand side.] 
The left-hand side of P(k + 1) is 


k+1 k 
i i Fil 
`r = Şr+r by writing the (k + 1)st term 
i=0 i=0 separately from the first k terms 
k+1 
I=] E ; ; 
= + 11 by substitution from the inductive 
r= hypothesis 
Śl] rt r—1) by multiplying the numerator and 
= =j Ps | denominator of the second term 


yi ki by (r — 1) to obtain a common 
(r* —1)+ t (r—1) denominator 
by adding fractions 


r—-1 
k+1 k+2__ k+l 
peeler =r by multiplying out and using the fact 
7 r—-1 that “t! -r = tl. r! = t? 
ft? 


= =. by canceling the pth 
p= 


which is the right-hand side of P(k + 1) /as was to be shown]. 


[Since we have proved the basis step and the inductive step, we conclude that the theorem 
is true.] 


Proving an Equality 


The proofs of the basis and inductive steps in Examples 5.2.1 and 5.2.3 illustrate two dif- 
ferent ways to show that an equation is true: (1) transforming the left-hand side and the 
right-hand side independently until they are seen to be equal, and (2) transforming one side 
of the equation until it is seen to be the same as the other side of the equation. 

Sometimes people use a method that they believe proves equality but that is actually in- 
valid. For example, to prove the basis step for Theorem 5.2.2, they perform the following steps: 


0 i #7 —1 
n 
= r-1 
0 r'-1 
p= 
r= 
r=] 
1= 
y=] 
1=1 
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The problem with this method is that starting from a statement and deducing a true conclu- 
sion does not prove that the statement is true. A true conclusion can also be deduced from a 
false statement. For instance, the steps below show how to deduce the true conclusion that 


Caution! Don’t try to 
1 = 1 from the false statement that 1 = 0: 


prove an equality by 


assuming it is true and 1=0 sce 
deducing a true statement; 0=1 
that form of argument is 
invalid! 1+0=0+1 
1=1 <— true 


When using mathematical induction to prove formulas, be sure to use a method that 
avoids invalid reasoning, both for the basis step and for the inductive step. 


Deducing Additional Formulas 


The formula for the sum of a geometric sequence can be thought of as a family of different 
formulas in r, one for each real number r except 1. 


Applying the Formula for the Sum of a Geometric Sequence 


In each of (a) and (b) below, assume that m is an integer that is greater than or equal to 3. 
Write each of the sums in closed form. 


a. 1+34+3°+:--+3"% 
bo es ee a 


Solution 
gona] =] by applying the formula for the sum of a 
24. ee e 
a. 1+3 +3" + +3 E F] geometric sequence with r = 3 andn = m—2 
ginal =j 
2 


b. 32 +37 +344: +3" = 3 (1+3 +3. +43") by factoring out 3° 


aul | 
=9- ee by part (a). ii) 


As with the formula for the sum of the first n integers, there is a way to think of the 
formula for the sum of the terms of a geometric sequence that makes it seem simple and 
intuitive. Let 


S,=l+rtPtet7, 


Then 
rS =rtrtrste te), 
and so 
rS -Sp = (HPHP rrt te +7) 
==, 5.2.1 
Now 
Sn — Sn = (r= Sp, $32 
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Equating the right-hand sides of equations (5.2.1) and (5.2.2) and dividing by r— 1 gives 
prl_4 


ral 


n 


This derivation of the formula is quite convincing. However, it is not as logically airtight 
as the proof by mathematical induction. To go from one step to another in the previous cal- 
culations, the argument is made that each term among those indicated by the ellipsis (. . .) 
has such-and-such an appearance and when these are canceled such-and-such occurs. But 
it is impossible actually to see each such term and each such calculation, and so the ac- 
curacy of these claims cannot be fully checked. With mathematical induction it is possible 
to focus exactly on what happens in the middle of the ellipsis and verify without doubt that 
the calculations are correct. 


TEST YOURSELF 


1. Mathematical induction is a method for proving (a) In the basis step one must show that 
that a property defined for integers n is true for all (b) In the inductive step one supposes that 
values of n that are ———— ~ . ———— fora particular but arbitrarily chosen 


a ; isd 
2. Let P(n) be a property defined for integers n and AU or ENRERE ee TS Suppo on 


‘ . see called the . One then has to show 
consider constructing a proof by mathematical in- 
i é E ” that 
duction for the statement “P(n) is true for all n = a. 
EXERCISE SET 5.2 
1. Use the technique illustrated at the beginning of 3. For each positive integer n, let P(n) be the formula 


this section to show that the statements in (a) and 
(b) are true. 


1242+- tea n(n + 1)Qn+ 1) 


I 1 I 1) 1 i 
a. If|1 5 1 3 1 ri 1 5] 5 then 
a. Write P(1). Is P(1) true? 
l 1 i 1 1 1 j 1 i 1\ 1 b. Write P(k). 
2 3 4 5 6 6 c. Write P(k+ 1). 
d. Ina proof by mathematical induction that the 
1 1 1 1 1 1 i 
b. If ( y(t J4 J 5) (i 5) 6 then formula holds for every integer n = 1, what 


must be shown in the inductive step? 


1 l 1 ! 1 l 1 l 1 ! 1 ! =l 4. For each integer n with n = 2, let P(n) be the 


n=1 


2. For each positive integer n, let P(n) be the formula 5 w= n(in— 1)(n+ 1) 
= <a — 


1+3+5+ + (2n-1) =n’, i=1 


a. Write P(1). Is P(1) true? a. Write P(2). Is P(2) true? 

b. Write P(k). b. Write P(k). 

c. Write P(k + 1). c. Write P(k+ 1). 

d. In a proof by mathematical induction that the d. In a proof by mathematical induction that the 
formula holds for every integer n = 1, what formula holds for every integer n = 2, what 
must be shown in the inductive step? must be shown in the inductive step? 
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5. Fill in the missing pieces in the following proof 
that 


7. For every integer n = 1, 


n(5n— 3) 
r 1+6+11+16+:+(5n- 4) = ——, 
1+3+5+ = +02n-1)=n 2 


for every integer n = 1. 
Proof: Let the property P(n) be the equation 


8. For every integer n = 0, 


14242 + +2 =at], 


1+3+5+ 0 +(2n-1)=n. —Pin) 


9. For every integer n = 3, 
Show that P(1) is true: To establish P(1), we must E 
show that when 1 is substituted in place of n, the pA A da AE =O 
left-hand side equals the right-hand side. But when 3 
n = 1, the left-hand side is the sum of all the odd 
integers from | to 2-1 — 1, which is the sum of the 
odd integers from | to | and is just 1. The right-hand 
side is _@) , which also equals 1. So P(1) is true. 
Show that for every integer k = 1, if P(k) is true 
then P(k + 1) is true: Let k be any integer with 


Prove each of the statements in 10-18 by mathematical 
induction. 


2_ n(n+1)(2n+ 1) 
6 


10. 17+2?+---+n 


, for every 


integer n = 1. 


n(n+ 1)}? 
k=l. n. 1+2 + = a , for every integer 
[Suppose P(k) is true. That is:] Suppose n= 1. 
1+3+5+ = +2k-1)= ©). -rw E. 1 no 
[This is the inductive hypothesis. ] 12. 1-2 2-3 n(n+ 1) n+l’ SS 

[We must show that P(k + 1) is true. That is:] We integer n = 1. 

m —1)\(n+1 
muse shy that 3. Sth = lle for every integer 

© = 0. <— P(k+1) i=l : 
n=2. 


Now the left-hand side of P(k + 1) is 
1+3+5+--:+(2(k+1)—-1) 


nt+1 
Sieg = n-2"*? +2, for every integer n = 0. 
i=l 


14. 


=1+34+5+-::+(2k+1)  byalgebra 


n 


=[1+3+54+-:: +(2k—-1)]+(2k+1) H 15. 5i) = (n+ 1)!— 1, for every integer n = 1. 
the next-to-last term is 2k — 1 because __©) _ F i i 1 i 
+ 
=? +(2k+1) by ®© 16. (1-—][1-)---(1-] = for every 
= (k +1) iih = A n an 
vies integer n = 2. 
which is the right-hand side of P(k + 1) [as was to i i i 
be shown]. 17. Ils i FF 3) = nT for every integer 
[Since we have proved the basis step and the induc- izo \2t t Gi Car?) 
n=0. 


tive step, we conclude that the given statement is 


true.] 

n 1 1 ; 
Note: This proof was annotated to help make its 18. i: z } =F for every integer n = 2. 
logical flow more obvious. In standard mathemati- a 


cal writing, such annotation is omitted. Hint: See the discussion at the beginning of this 


, : , section. 
Prove each statement in 6-9 using mathematical 


induction. Do not derive them from Theorem 5.2.1 or 
Theorem 5.2.2. 


19. (For students who have studied calculus) Use 


mathematical induction, the on rule from 
d(x) 


x 
calculus, and the facts that -4% = 1 and that 
x) = xaf to prove that for every integer n = 1, 
d(x") o 
dx 


6. For every integer n = 1, 


2+4+6+ =- +2n=R +n. 


n=l 
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Use the formula for the sum of the first n integers and/or Guess a general formula and prove it by mathemati- 
the formula for the sum of a geometric sequence to evalu- cal induction. 


ate the sums in 20-29 or to write them in closed form. . ; 
H 33. Find a formula in n, a, m, and d for the sum 


20. 4+8+12+16+::-+200 (a+ md) +(a+(m+1)d) + (a+ (m+ 2)d)+°°°+ 


21. 5+10+15+20+---+300 (a+ (m + n)d), where m and n are integers, n = 0, 


and a and d are real numbers. Justify your answer. 
22. a 3+4+5+6+:--+ 1000 


b. 3+44+5+6+:::+m 34. Find a formula in a, r, m, and n for the sum 


23. a. 7+8+9+10+-:-+600 ar” + art! + at? +- tar 
b. 7+84+9+4+10+°:-+k 


> 


where m and n are integers, n = 0, and a and r are 


24. 1+2+3+°:-+(k—1), where kis any integer with real numbers. Justify your answer. 
k=2., 
‘ 35 35. You have two parents, four grandparents, eight 
25. a. 1+ a 2 K e2 i great-grandparents, and so forth. 
b. 2+ 2 T oa apne ented a. If all your ancestors were distinct, what would 
C22 re + +2" be the total number of your ancestors for the 
26. 3+37+3°>+---+3”, where n is any integer with past 40 generations (counting your parents’ gen- 
n=l. eration as number one)? (Hint: Use the formula 
ee eee $ ; : ; for the sum of a geometric sequence.) 
27. 5+5 +5 +++" +5", where k is any integer with b. Assuming that each generation represents 
k= 3. 25 years, how long is 40 generations? 
1 1 1 . ae c. The total number of people who have ever lived 
~ 145454045 i 
side 2 2 2” Wiehe IS ay POSING Tate EE is approximately 10 billion, which equals 10'° 
29. 1—2+2?—23+---+(—1)" 2", where n is any people. Compare this fact with the answer to 
positive integer. part (a). What can you deduce? 
30. Observe that Find the mistakes in the proof fragments in 36-38. 
t+ 36. Theorem: For any integer n = 1, 
ll i FR peen a OTCE 
Ea Oe Sie 6 
I a a 2 “Proof (by mathematical induction): Certainly 
+ = the theorem is true for n = 1 because 17 = 1 and 
1:3 3:5 5:7 7 10+ DQ-14+1) _ , 
1 po 1 1 4 1 4 ~~~ = 1. So the basis step is true. For 
130355 557 7:9 F the inductive step, suppose that k is any integer with 
k(k + 1)(2k + 1 
Guess a general formula and prove it by mathemati- k=1,Ż = une no We must show 
cal induction. k+Ik+ D+ DQK+1) +1 
that (k +1)? = (K+ Dk + 1) - NK + 1) dos 
31. Compute values of the product 
1 1 1 1 H 37. Theorem: For any integer n = 0, 
14 14 Lae a E Le 
l 2 2 K 1424+242 = t], 
for small values of n in order to conjecture a general 
formula for the product. Prove your conjecture by “Proof (by mathematical induction): 
mathematical induction. Let the property P(n) be 
2 n _ antl 
H 32. Observe that 1+2+2 + +2 =2 =l. 
1=1 Show that P(0) is true: 
1-4=-(14+2) The left-hand side of P(0) is 1+2+27+-:-+2°=1 
1-449 =14243 and the right-hand side is 2*'— 1 = 2—1 = 1 also. 
1-44+9-16 = -(1+2+3+4) So P(O) is true.” 


1-—4+9-—164+25 = 142137445. 
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H 38. Theorem: For any integer n = 1, So 10!) =2!-1 
n and 1=1. 
ie) = Ger 11, Thus P(1) is true.” 
i=l 
“Proof (by mathematical induction): Let the *39. Use Theorem 5.2.1 to prove that if m and n are any 
property positive integers and m is odd, then Dea (n+k) 
ñ is divisible by m. Does the conclusion hold if m is 
P(n) be > ii!) =(n+1)!-1. even? Justify your answer. 
i=l 
Show that P(1) is true: When n = 1, H* 40. Use Theorem 5.2.1 and the result of exercise 10 to 
prove that if p is any prime number with p = 5, 
5 (iD =(+1!-1. then the sum of the squares of any p consecutive 
S integers is divisible by p. 


ANSWERS FOR TEST YOURSELF 


1. greater than or equal to some initial value 2. (a) P(a) is true (b) P(A) is true; inductive hypothesis; P(k + 1) is true 


EÆJ Mathematical Induction Il: Applications 


[Mathematical induction is] the standard proof technique in computer science. 
—Anthony Ralston, 1984 


In Section 5.2 we showed how to use mathematical induction to prove formulas. In this 
section we will show how to apply it in a broader variety of situations. 

As a first example consider the argument that the U.S. penny should be eliminated be- 
cause it isn’t profitable to produce. Due to inflation and the rising cost of metals, it actually 
costs more than one cent to produce a penny. If the penny were eliminated and another 
coin worth 3¢ were introduced, what prices could be paid using only 3¢ and 5¢ coins? The 
table below shows some examples. 


Number of Cents How to Obtain It 
3¢ 3¢ 
S¢ S¢ 
8¢ 3¢+5¢ 
9¢ 3¢+3¢+3¢ 
10¢ 5¢ + 5¢ 
ll¢ 3¢ +3¢ + 5¢ 
12¢ 3¢+3¢+3¢+3¢ 
13¢ 3¢+5¢+5¢ 
14¢ 3¢+3¢+3¢+5¢ 
15¢ S¢+5¢+5¢ 
16¢ 3¢+ 3¢+5¢+5¢ 
17¢ 3¢+3¢+3¢+3¢+5¢ 
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Can the table be continued indefinitely to show that every larger price could be paid 
with the two coins? This question is similar to the one raised in the opening example in 
Section 5.2, which involved showing that if the property illustrated in the table was true for 
an arbitrarily chosen row k, then it was also true for row k+ 1. To use a similar technique 
for this example, suppose that a collection of coins worth k¢ can be obtained using 3¢ and 
5¢ coins. The challenge is to show that a collection of coins worth (k + 1)¢ can be obtained 
using 3¢ and 5¢ coins. 

To meet the challenge observe that just one of two situations must occur: either the col- 
lection worth k¢ contains a 5¢ coin or it does not. If you can obtain k¢ using at least one 
5¢ coin, then you can obtain (k+ 1)¢ by replacing the 5¢ coin by two 3¢ coins, as shown 
in Figure 5.3.1. 


k¢ (k+ 1)¢ 


Replace a 5¢ coin by 


two 3¢ coins. 


Remove Add 
FIGURE 5.3.1 


On the other hand, if you can obtain k¢ without using a 5¢ coin, then you need to use 3¢ 
coins exclusively. If the total is more than 8¢, then three or more 3¢ coins must be included, 
in which case you can replace three of the 3¢ coins by two 5¢ coins to obtain a total of (k + 1)¢, 
as shown in Figure 5.3.2. 


(k+ 1)¢ 


kg 
O Replace three 3¢ coins 
O © by two 5¢ coins. 


Remove Add 
FIGURE 5.3.2 


This discussion shows that for any integer k that is at least 8, if you can obtain k¢ using 
3¢ and 5¢ coins, then you can obtain (k + 1)¢ using 3¢ and 5¢ coins. This is essentially the 
inductive step of a proof by mathematical induction, and the fact that you can obtain 8¢ 
using one 3¢ and one 5¢ coin provides the basis step for the induction. 

In Section 5.2 the properties proved by mathematical induction were all equations. In 
this section the properties are more general sentences. For this example the sentence is “ng 
can be obtained using 3¢ and 5¢ coins,” and the proof by mathematical induction shows 
that the sentence is true for every integer greater than or equal to 8. 
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Note To write P(8), just 
copy P(n) and replace 
each n by 8. 


Note To write P(k), just 
copy P(n) and replace 
each n by k. 


Note To write P(k+ 1), 
just copy P(n) and replace 
each n by k+ 1. 
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Proposition 5.3.1 


For every integer n = 8, n¢ can be obtained using 3¢ and 5¢ coins. 


Proof (by mathematical induction): 
Let the property P(n) be the sentence 


n¢ can be obtained using 3¢ and 5¢ coins. < P(n) 


Show that P(8) is true: 
P(8) is true because 8¢ can be obtained using one 3¢ coin and one 5¢ coin. 
Show that for every integer k = 8, if P(k) is true then P(k + 1) is also true: 


[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 8. That is:] 
Suppose that k is any integer with k = 8 such that 


k¢ can be obtained using 3¢ and 5¢ coins. <— P(k) 
inductive hypothesis 


[We must show that P(k + 1) is true. That is:] We must show that 


(k+ 1)¢ can be obtained using 3¢ and 5¢ coins. = <—P(k+1) 
Case 1 (There is a 5¢ coin among those used to make up the k¢.): In this case re- 
place the 5¢ coin by two 3¢ coins; the result will be (k + 1)¢. 


Case 2 (There is not a 5¢ coin among those used to make up the k¢.): In this case, 
because k = 8, at least three 3¢ coins must have been used. So remove three 3¢ coins 
and replace them by two 5¢ coins; the result will be (k + 1)¢. 

Thus in either case (k + 1)¢ can be obtained using 3¢ and 5¢ coins /as was to be shown]. 


[Since we have proved the basis step and the inductive step, we conclude that the propo- 
sition is true.] 


The basic outlines of the proofs in the remainder of this section are the same in all 
cases, but the details of the basis and inductive steps differ quite a lot from one to another. 
Proving a Divisibility Property 
Use mathematical induction to prove that for each integer n = 0, 2" — 1 is divisible by 3. 


Solution As in the previous proofs by mathematical induction, you need to identify the 
property P(n). In this example, P(n) is the sentence 


27" — 1 is divisible by 3. < the property (P(n)) 


By substitution, the statement for the basis step, P(O), is 


2? 9 — 1 is divisible by 3. < basis (P(0)) 


The supposition for the inductive step, P(x), is 


ak — | is divisible by 3, <— inductive hypothesis (P(k)) 


and the conclusion to be shown, P(k + 1), is 


2°**1) _ 1 is divisible by 3. < to show (P(k+ 1) 
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Recall that an integer m is divisible by 3 if, and only if, m = 3r for some integer r. Now 
the statement P(0) is true because 27° — 1 = 2°— 1 = 1— 1 = 0, which is divisible by 3 
because 0 = 3-0. 

To prove the inductive step, you suppose that k is any integer greater than or equal to 0 
such that P(x) is true. In other words, you suppose that 2°% — 1 is divisible by 3. You must 
then prove the truth of P(k + 1). Thus, you must show that 2°k*+)) _ 1 is divisible by 3. Now 


2k+D _ y= 92kt2_ 


= grt 2 —1 _ by the laws of exponents. 
=2*.4-]1. 
Your aim is to show that 2™%-4— 1 is divisible by 3, but why should that be so? Both 


27*.4—1 and 2% -— 1 have a lot in common, and, by the inductive hypothesis, 2 = 1 is 
divisible by 3. Observe what happens if you subtract 27* — 1 from 2*-4 — 1: 


7*.4— 1 —(27*—1) = 2*3, 
————— i—i — y 


ji t t 
divisible by 3? divisible by 3 divisible by 3 
Adding 2% — 1 to both sides gives 


2.4 — 1 = 27*.3 + (27*— 1), 


T t t 
divisible by 3? divisible by 3 divisible by 3 


Both terms of the sum on the right-hand side of this equation are divisible by 3; hence 
the sum is divisible by 3. (See exercise 15 of Section 4.3.) Therefore, the left-hand side of 
the equation is also divisible by 3, which is what was to be shown. 

This discussion is summarized as follows: 


Proposition 5.3.2 


For each integer n = 0, 2*" — 1 is divisible by 3. 


Proof (by mathematical induction): Let the property P(n) be the sentence “2” — 1 
is divisible by 3.” 


27" — 1 is divisible by 3. < Pin) 


Show that P(O) is true: 
To establish P(0), we must show that 
Note To write P(0), just 


copy P(n) and replace 270 — 1 is divisible by 3. — P(0) 
each n by 0. 


But 


79_1=979-1=1-1=0, 


and 0) is divisible by 3 because 0 = 3-0. Hence P(0) is true. 


Show that for any integer k = 0, if P(k) is true then P(k + 1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 0. That is:] 


Note ene Let k be any integer with k = 0, and suppose that 


copy P(n) and replace 
each n by k. 


a — | is divisible by 3. <— P(k) inductive hypothesis 
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By definition of divisibility, this means that 
2*-1=3r for some integer r. 


. [We must show that P(k + 1) is true. That is:] We must show that 
Note To write P(k+ 1), 


es Teplice 2°K+)_ 1 is divisible by 3. — P(k +1) 
eachn by k+ 1. 
Now 


k+) 1 = 2%+2_] 


= J : 7 -1 by the laws of exponents 
=2*.4-—1 

=2%*3+1)-1 

= 2%*.3+ oO —1) by the laws of algebra 
= on ower a by inductive hypothesis 
= 30" +r) by factoring out the 3. 


But 2% + ris an integer because it is a sum of products of integers, and so, by defini- 
tion of divisibility, 2*“* ! — 1 is divisible by 3 [as was to be shown]. 

[Since we have proved the basis step and the inductive step, we conclude that the propo- 
sition is true.] 


Another way to prove the inductive step for a divisibility property is illustrated in the 
answer for exercise 11 at the end of this section. It is shown in Appendix B. 


CESE Proving an Inequality 


Use mathematical induction to prove that for each integer n = 3, 
2"7+1<2" 


Solution In this example the property P(n) is the inequality 


2nt+1 <2". < the property (P(n)) 


By substitution, the statement for the basis step, P(3), is 


2-3+1< 2. <— basis (P(3)) 


The supposition for the inductive step, P(k), is 


2k+1< zk < inductive hypothesis (P(k)) 


and the conclusion to be shown is 


Wk+IY+1<2**1) cto show (PTD) 


To prove the basis step, observe that the statement P(3) is true because 2:3 + | = 7, 2 = 8, 
and 7 < 8. 

For the inductive step you assume the inductive hypothesis that for a particular but 
arbitrarily chosen integer k = 3, 2k+1< 2*, and then you must show that 2k+ 1 < ee 
With inequality proofs you can often apply the inductive hypothesis at an early stage. In 
this case, you can substitute from the induction hypothesis to obtain 


Ak+1)+1 = 2k+1)+2 < 2+2. 
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At this point you need to think about your goal. You want the right-hand side of the in- 
equality to be 2**! | Since you already know that 2(k+1)+1< 2*+2, an easy way to 
complete your job would be to show that 


2442 < oF 
But this inequality is true because 2 < 2* for k = 3, and thus 
2+2 <2 4+2 = 22 = ott 


Note Properties of order by algebra and a law of exponents. Transitivity of order then implies that 2(k + 1) + 1 < g, 
are listed in Appendix A. This discussion is summarized in the following formal proof. 


Proposition 5.3.3 


For every integer n = 3, 2n +1 < 2". 


Proof (by mathematical induction): 
Let the property P(n) be the inequality 


2n+1 <2”. <— P(n) 


Show that P(3) is true: 

To establish P(3), we must show that 
Note For P(3), copy P(n) 2:3+1<2. P6) 
and replace each n by 3. 

Now 


2:-3+1=7 and 2? =8 and 7<8. 
Hence P(3) is true. 


Show that for every integer k = 3, if P(k) is true then P(k + 1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 3. That is:] 


Suppose that k is any integer with k = 3 such that 
Note For P(x), copy P(n) 


and replace each n by k. +1 < 2k. < Pik) n 
inductive hypothesis 


[We must show that P(k + 1) is true. That is:] We must show that 


Note For P(k + 1), copy Wkt1)t+1< ake) < P(k+1) 
P(n) and replace each n 
by k+1. Now 
2(k+1)+1=2k+1+4+2 by algebra 
<2F+2 because, by the inductive 
hypothesis, 2k+1< 3 
xak because 2 < 2* since k = 3 
= 2: 24 by algebra 
= gel by the laws of exponents. 


Thus by transitivity of order 2(k + 1) +1 < 2%+*! [as was to be shown]. 
[Since we have proved the basis step and the inductive step, we conclude that the 
proposition is true.] 
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The next example demonstrates how to use mathematical induction to show that the 
terms of a sequence satisfy a certain explicit formula. 


CAE Proving a Property of a Sequence 


Define a sequence qj, a2, a3,...as follows:* 
a, = 2 


a, = 5a,-, for every integer k = 2. 


a. Write the first four terms of the sequence. 


b. It is claimed that for each integer n = 0, the nth term of the sequence has the same 
value as that given by the formula 2- 5""! In other words, the claim is that the terms 
of the sequence satisfy the equation a, = 2-5""'. Prove that this is true. 


Solution 
a. a, =2 
ay = 5q; = 5a, = 5:2 = 10 
az = 5az3-1 = 5a, = 5-10 = 50 
a4 = 5a4—-1 = 5a; = 5:50 = 250 
b. To use mathematical induction to show that every term of the sequence satisfies the 
equation, begin by showing that the first term of the sequence satisfies the equation. 


Then suppose that an arbitrarily chosen term a, satisfies the equation and prove that 
the next term a; + also satisfies the equation. 


Proof (by mathematical induction): 
Let a), a2, a3, ... be the sequence defined by specifying that a, = 2 and a, = 5a,_, for 


every integer k = 2, and let the property P(n) be the equation 
a,=2:5"!' a Pn) 
We will use mathematical induction to prove that for every integer n = 1, P(n) is true. 
Show that P(A) is true: 
To establish P(1), we must show that 


Note For P(1), copy P(n) aj= 2.5171 Lpi 


and replace each n by 1. 
Now the left-hand side of P(1) is 


a =2 by definition of a], a2, a3,..-, 
and the right-hand side of P(1) is 
2:517! = 2:50 = 2:1 =2. 
Thus the two sides of P(1) are equal to the same quantity, and hence P(1) is true. 


Show that for each integer k = 1, if P(k) is true then P(k + 1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 1. That is:] Let k 
be any integer with k = 1, and suppose that 


Note For P(k), copy P(n) a= 2.5471 <— P(k) 
and replace each n by k. inductive hypothesis 
[We must show that P(k + 1) is true. That is:] We must show that 
Qs = 2.504 D-1 


*This is another example of a recursive definition. The general subject of recursion is discussed in Section 5.6. 
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or, equivalently, 


Note For P(k +1), copy A+, = 2:54. —P(k+1) 
P(n) and replace each n 
by k+1. But the left-hand side of P(k + 1) is 
Ag+, = Sak+1)-1 by definition of a), a, a3,... 
= Sag since (k+1)—-1 =k 


=5:-(2: 5% D) by inductive hypothesis 
= 2-(5: = 4 by regrouping 
= 2; 5k by the laws of exponents, 


which is the right-hand side of P(k + 1) /as was to be shown]. 


[Since we have proved the basis step and the inductive step, we conclude that the formula 
holds for all terms of the sequence.] a 


A Problem with Trominoes 


The word polyomino, a generalization of domino, was introduced by Solomon Golomb in 
1954 when he was a 22-year-old university student. Subsequently, he and others proved 
many interesting properties about them, and they became the basis for the popular com- 
puter game Tetris. A particular type of polyomino, called a tromino, is made up of three 
attached squares, which can be of two types: 


straight and L-shaped 


Call a checkerboard that is formed using m squares on a side an m X m (“m by m”) check- 
Ra erboard. Observe that if one square is removed from a 4 X 4 checkerboard, the remaining 
squares can be completely covered by L-shaped trominoes. For instance, a covering for 
one such board is illustrated in the figure to the left. 
In his first article about polyominoes, Golomb included a proof of the following theo- 
rem. It is a beautiful example of an argument by mathematical induction. 


Theorem 5.3.4 Covering a Board with Trominoes 


For any integer n = 1, if one square is removed from a 2” X 2” checkerboard, the 
remaining squares can be completely covered by L-shaped trominoes. 


The main insight leading to a proof of this theorem is the observation that because 
24+! = 2.2" when a 2%+! x 2**! board is split in half both vertically and horizon- 
tally, each half side will have length 2* and so each resulting quadrant will be a 
2* x 2* checkerboard. 


Note Even when P(n) is a Proof (by mathematical induction): 

long sentence, P(1), P(k), Let the property P(n) be the sentence 

and P(k + 1) are each 

obtained by copying P(n) If any square is removed from a 2” X 2” checkerboard, then 

and replacing each n by 1, the remaining squares can be completely covered by L-shaped <— P(n) 
k, k + 1, respectively. trominoes. 
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TEST YOURSELF 
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Show that P(A) is true: 

A 2! x x 2! checkerboard just consists of four squares. If one square is removed, 
the remaining squares form an L, which can be covered by a single L-shaped tro- 
mino, as illustrated in the figure to the left. Hence P(1) is true. 


Show that for every integer k = 1, if P(k) is true then P(k + 1) is also true: 


[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 3. That is:] 
Let k be any integer such that k = 1, and suppose that 


If any square is removed from a 2* x 2* checkerboard, then 
the remaining squares can be completely covered by L-shaped <— P(k) 
trominoes. 


P(k) is the inductive hypothesis. 
[We must show that P(k + 1) is true. That is:] We must show that 


If any square is removed from a 2k+1 x 24+! checkerboard, 
then the remaining squares can be completely covered by <— P(k+1) 
L-shaped trominoes. 


Consider a 2'*! x 2**! checkerboard with one square removed. Divide it into four 
equal quadrants: Each will consist of a 2* x 2* checkerboard. In one of the quad- 
rants, one square will have been removed, and so, by inductive hypothesis, all the re- 
maining squares in this quadrant can be completely covered by L-shaped trominoes. 

The other three quadrants meet at the center of the checkerboard, and the center 
of the checkerboard serves as a corner of a square from each of those quadrants. An 
L-shaped tromino can, therefore, be placed on those three central squares. This situ- 
ation is illustrated in the figure to the left. 

By inductive hypothesis, the remaining squares in each of the three quadrants can 
be completely covered by L-shaped trominoes. Thus every square in the eee a aie 
checkerboard except the one that was removed can be completely covered by 
L-shaped trominoes /as was to be shown]. 


1. Mathematical induction differs from the kind of 2. Mathematical induction can be used to ____ 
induction used in the natural sciences because it is conjectures that have been made using inductive 
actually a form of reasoning. reasoning. 

EXERCISE SET 5.3 

1. Use mathematical induction (and the proof of 3. Stamps are sold in packages containing either 
proposition 5.3.1 as a model) to show that any 5 stamps or 8 stamps. 
amount of money of at least 14¢ can be made up a. Show that a person can obtain 5, 8, 10, 13, 15, 


using 3¢ and 8¢ coins. 


2. Use mathematical induction to show that any post- 
age of at least 12¢ can be obtained using 3¢ and 7¢ 


stamps. 


16, 20, 21, 24, or 25 stamps by buying a collec- 
tion of 5-stamp packages and 8-stamp packages. 
b. Use mathematical induction to show that any 
quantity of at least 28 stamps can be obtained 
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4. 


5. 


6. 


7. 
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by buying a collection of 5-stamp packages 
and 8-stamp packages. 


For each positive integer n, let P(n) be the sentence 
that describes the following divisibility property: 


5" — 1 is divisible by 4. 


Write P(0). Is P(O) true? 

Write P(k). 

Write P(k + 1). 

In a proof by mathematical induction that this di- 
visibility property holds for every integer n = 0, 
what must be shown in the inductive step? 


ao oo» 


For each positive integer n, let P(n) be the inequality 
2"<(n+I1)!. 


Write P(2). Is P(2) true? 

Write P(k). 

Write P(k + 1). 

In a proof by mathematical induction that this 
inequality holds for every integer n = 2, what 
must be shown in the inductive step? 


angy 


For each positive integer n, let P(n) be the sentence 


Any checkerboard with dimensions 2 X 3n can 
be completely covered with L-shaped trominoes. 


Write P(1). Is P(1) true? 

Write P(k). 

Write P(k + 1). 

In a proof by mathematical induction that P(n) 
is true for each integer n = 1, what must be 
shown in the inductive step? 


angop 


For each positive integer n, let P(n) be the sentence 


In any round-robin tournament involving n 
teams, the teams can be labeled 7}, D, 73,..., Ths 
so that 7; beats 7;,, for every i = 1, 2,...,n 


Write P(2). Is P(2) true? 

Write P(k). 

Write P(k + 1). 

In a proof by mathematical induction that P(n) 
is true for each integer n = 2, what must be 
shown in the inductive step? 


angy 


Prove each statement in 8-23 by mathematical induction. 


8. 


5” — 1 is divisible by 4, for every integer n = 0. 


9. 7" — 1 is divisible by 6, for each integer n = 0. 


10. 


n? — 7n + 3 is divisible by 3, for each integer n = 0. 
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22. 


23. 


24. 


25; 


26. 


27. 


28. 


. For any integer n = 0, 7” — 


. 3™” — 1 is divisible by 8, for each integer n = 0. 


2” is divisible by 5. 


. For any integer n = 0, x” — y” is divisible by x — y, 


where x and y are any integers with x # y. 


. n? — n is divisible by 6, for each integer n = 0. 

. n(n? +5) is divisible by 6, for each integer n = 0. 
. 2” <(n+1)!, for every integer n = 2. 

. 1+3n = 4", for every integer n = 0. 

. 5"+9 < 6”, for each integer n = 2. 

. n? <2", for every integer n = 5. 

. 2” < (n+ 2)!, for each integer n = 0. 


ay 


vot , for every integer 


AtA Va 


1 +nx S (1 +x)”, for every real number x > —1 
and every integer n = 2. 


a. n? > 2n+1, for each integer n = 2. 
b. n! >n’, for each integer n = 4. 


A sequence dj, a2, a3, . . . is defined by letting 
a, = 3 and a, = Tagı for each integer k = 2. 
Show that a, = 3-7" ' for every integer n = 1. 


A sequence bo, bi, b2,... is defined by letting 
bo = 5 and by = 4+ by_, for each integer k = 1. 
Show that b, > 4n for every integer n = 0. 


A sequence Co, C1, C2,... is defined by letting 
Co = 3 and c = (c) for every integer k = 1. 
Show that c, = 3°" for each integer n = 0. 


A sequence d}, do as a 
dı = 2 and d; = 


that for every integer n = 1, d, = 


is defined by letting 


aot | for each integer k = 2. Show 


2 
ni: 


Prove that for every integer n = 1, 
1 I3 bob Qn = 1) 
3 (2nt+1)(2n+3)+-+:+(2n+(2n-1)) 


Exercises 29 and 30 use the definition of string and string 
length from page 13 in Section 1.4. Recursive definitions 
for these terms are given in Section 5.9. 


29. 


A set L consists of strings obtained by juxtaposing 
one or more of abb, bab, and bba. Use mathemati- 
cal induction to prove that for every integer n = 1, 
if a string s in L has length 3n, then s contains an 
even number of b’s. 


30 


31. 


H 32. 


33. 


H 34. 


35. 


Hb. 


. A set S consists of strings obtained by juxtapos- 
ing one or more copies of 1110 and 0111. 
Use mathematical induction to prove that for ev- 
ery integer n = 1, if a string s in S has length 4n, 
then the number of l’s in s is a multiple of 3. 


Use mathematical induction to give an alternative 
proof for the statement proved in Example 4.9.9: 
For any positive integer n, a complete graph on n 


1 
vertices has a edges. Hint: Let P(n) be the 


sentence, “the number of edges in a complete graph 
nn — 


” 


on n vertices is 


Some 5 X 5 checkerboards with one square 
removed can be completely covered by L-shaped 
trominoes, whereas other 5 X 5 checkerboards 
cannot. Find examples of both kinds of checker- 
boards. Justify your answers. 


Consider a 4 X 6 checkerboard. Draw a covering 
of the board by L-shaped trominoes. 


a. Use mathematical induction to prove that for 
each integer n = 1, any checkerboard with 
dimensions 2 X 3n can be completely covered 
with L-shaped trominoes. 

b. Letn be any integer greater than or equal 
to 1. Use the result of part (a) to prove by 
mathematical induction that for every inte- 
ger m, any checkerboard with dimensions 
2m X 3n can be completely covered with 
L-shaped trominoes. 


Let m and n be any integers that are greater than 
or equal to 1. 

a. Prove that a necessary condition for an m X n 
checkerboard to be completely coverable by 
L-shaped trominoes is that mn be divisible by 3. 
Prove that having mn be divisible by 3 is not 

a sufficient condition for an m X n checker- 
board to be completely coverable by L-shaped 
trominoes. 


36. In a round-robin tournament each team plays every 


other team exactly once with ties not allowed. If 
the teams are labeled 7}, T>, . .. , T„, then the out- 
come of such a tournament can be represented by a 
directed graph, in which the teams are represented 
as dots and an arrow is drawn from one dot to an- 
other if, and only if, the following team represented 
by the first dot beats the team represented by the 
second dot. For example, the following directed 
graph shows one outcome of a round-robin tourna- 
ment involving five teams, A, B, C, D, and E. 
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C 


VASYA 


D 


Use mathematical induction to show that in 
any round-robin tournament involving n teams, 
where n = 2, it is possible to label the teams 
Ti; T>,..., T, so that T; beats 7;,, for all i = 1, 
2,..., n— 1. (For instance, one such labeling in 
the example above is 7, = A, T) = B, T3 = C, 
T4 = E, T; = D.) (Hint: Given k + 1 teams, 
pick one—say T’—and apply the inductive 
hypothesis to the remaining teams to obtain an 
ordering 7,, T>,..., Tp. Consider three cases: 
T' beats T,, T’ loses to the first m teams (where 
1 =m =k-—1)and beats the (m+ 1)st team, 
and T’ loses to all the other teams.) 


H* 37. On the outside rim of a circular disk the integers 


H 38. 


39. 


from 1 through 30 are painted in random order. 
Show that no matter what this order is, there must be 
three successive integers whose sum is at least 45. 


Suppose that n a’s and n b’s are distributed around 
the outside of a circle. Use mathematical induction 
to prove that for any integer n = 1, given any such 
arrangement, it is possible to find a starting point 
so that if you travel around the circle in a clock- 
wise direction, the number of a’s you pass is never 
less than the number of b’s you have passed. For 
example, in the diagram shown below, you could 
start at the a with an asterisk. 


For a polygon to be convex means that given any 
two points on or inside the polygon, the line join- 
ing the points lies entirely inside the polygon. Use 
mathematical induction to prove that for every 
integer n = 3, the angles of any n-sided convex 
polygon add up to 180(n — 2) degrees. 


40. a. Prove that in an 8 X 8 checkerboard with alter- 


nating black and white squares, if the squares 
in the top right and bottom left corners are 
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H 41. 


42. 


43. 
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removed the remaining board cannot be 
covered with dominoes. (Hint: Mathematical 
induction is not needed for this proof.) 

b. Use mathematical induction to prove that for 
each positive integer n, if a 2n X 2n checker- 
board with alternating black and white squares 
has one white square and one black square 
removed anywhere on the board, the remaining 
squares can be covered with dominoes. 


A group of people are positioned so that the dis- 
tance between any two people is different from the 
distance between any other two people. Suppose 
that the group contains an odd number of people 
and each person sends a message to their nearest 
neighbor. Use mathematical induction to prove 
that at least one person does not receive a mes- 
sage from anyone. [This exercise is inspired by 

the article “Odd Pie Fights” by L. Carmony, The 
Mathematics Teacher, 72(1), 1979, 61—64.] 


Show that for any even integer n, it is possible to 
find a group of n people who are all positioned so 
that the distance between any two people is differ- 
ent from the distance between any other two people, 
so that each person sends a message to their nearest 
neighbor, and so that every person in the group re- 
ceives a message from another person in the group. 


Define a game as follows: You begin with an urn 
that contains a mixture of white and black balls, 
and during the game you have access to as many 
additional white and black balls as you might 
need. In each move you remove two balls from 
the urn without looking at their colors. If the balls 
are the same color, you put in one black ball. If 
the balls are different colors, you put the white 
ball back into the urn and keep the black ball out. 
Because each move reduces the number of balls 
in the urn by one, the game will end with a single 
ball in the urn. If you know how many white balls 
and how many black balls are initially in the urn, 
can you predict the color of the ball at the end of 
the game? [This exercise is based on one described 
in “Why correctness must be a mathematical con- 
cern” by E. W. Dijkstra, www.cs.utexas.edu/users 
/EWD/transcriptions/EW D07xx/EWD720.htm1 ] 
a. Map out all possibilities for playing the game 
starting with two balls in the urn, then three 
balls, and then four balls. For each case keep 
track of the number of white and black balls 
you start with and the color of the ball at the 
end of the game. 
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H b. Does the number of white balls seem to be pre- 


44. 


dictive? Does the number of black balls seem to 
be predictive? Make a conjecture about the color 
of the ball at the end of the game given the num- 
bers of white and black balls at the beginning. 

c. Use mathematical induction to prove the con- 
jecture you made in part (b). 


Let P(n) be the following sentence: Given any 
graph G with n vertices satisfying the condition 
that every vertex of G has degree at most M, then 
the vertices of G can be colored with at most M+ 1 
colors in such a way that no two adjacent vertices 
have the same color. Use mathematical induction to 
prove this statement is true for every integer n = 1. 


In order for a proof by mathematical induction to be valid, 
the basis statement must be true for n = a and the argu- 
ment of the inductive step must be correct for every inte- 
ger k = a. In 45 and 46 find the mistakes in the “proofs” 
by mathematical induction. 


45. 


H 46. 


“Theorem:” For any integer n = 1, all the num- 
bers in a set of n numbers are equal to each other. 


“Proof (by mathematical induction): It is obvi- 
ously true that all the numbers in a set consisting 
of just one number are equal to each other, so 
the basis step is true. For the inductive step, let 


A = {q}, do,..., Ay, Ay4 1} be any set of k + 1 num- 
bers. Form two subsets each of size k: 

B = {a, a, a3,..., ap} and 

C = {a;, A3, 4s... ak+1}- 


(B consists of all the numbers in A except ag+1, 
and C consists of all the numbers in A except a2.) 
By inductive hypothesis, all the numbers in B 
equal a, and all the numbers in C equal a, (since 
both sets have only k numbers). But every number 
in A is in B or C, so all the numbers in A equal ay; 
hence all are equal to each other.” 


“Theorem:” For every integer n = 1, 3” — 2 is even. 
“Proof (by mathematical induction): Suppose 
the theorem is true for an integer k, where k = 1. 
That is, suppose that 3* — 2 is even. We must show 
that 3‘*! — 2 is even. Observe that 
3ktl_9 = 3*.3-2 = 34(1+2)-2 

= (3-2) + 3*-2, 
Now 3* —2 is even by inductive hypothesis and 
3*-2 is even by inspection. Hence the sum of 
the two quantities is even (by Theorem 4.1.1). It 


follows that 3‘*! — 2 is even, which is what we 
needed to show.” 
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ANSWERS FOR TEST YOURSELF 


1. deductive 2. prove 


EZ] Strong Mathematical Induction 
and the Well-Ordering Principle 
for the Integers 


Mathematics takes us still further from what is human into the region of absolute 


necessity, to which not only the actual world, but every possible world, must conform. 
—Bertrand Russell, 1902 


Strong mathematical induction is similar to ordinary mathematical induction in that it is 
a technique for establishing the truth of a sequence of statements about integers. Also, 
a proof by strong mathematical induction consists of a basis step and an inductive step. 
However, the basis step may contain proofs for several initial values, and in the inductive 
step the truth of the predicate P(n) is assumed not just for one value of n but for all values 
through k, and then the truth of P(k + 1) is proved. 


Principle of Strong Mathematical Induction 


Let P(n) be a property that is defined for integers n, and let a and b be fixed integers 
with a = b. Suppose the following two statements are true: 


1. P@, P(a+1),..., and P(b) are all true. (basis step) 


2. For every integer k = b, if P(i) is true for each integer i from a through k, then 
P(k + 1) is true. (inductive step) 


Then the statement 
for every integer n = a, P(n) 
is true. (The supposition that P(i) is true for each integer i from a through k is called 


the inductive hypothesis. Another way to state the inductive hypothesis is to say 
that P(@), P(a+1),..., P(k) are all true.) 


Any statement that can be proved with ordinary mathematical induction can be proved 
with strong mathematical induction. The reason is that given any integer k = b, if the truth 
of P(k) alone implies the truth of P(k + 1), then certainly the truth of P(a), P(a + 1),..., and 
P(k) implies the truth of P(k+ 1). It is also the case that any statement that can be proved 
with strong mathematical induction can be proved with ordinary mathematical induction. A 
proof is sketched in exercise 27 at the end of this section. 

Strictly speaking, the principle of strong mathematical induction can be written without 
a basis step if the inductive step is changed to “For every integer k = a — 1, if P(d) is true 
for each integer i from a through k, then P(k + 1) is true.” The reason for this is that the 
statement “P(i) is true for each integer i from a through k” is vacuously true for k = a— 1. 
Hence, if the implication in the inductive step is true, then the conclusion P(a) must also 
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be true,* which proves the basis step. However, in many cases the proof of the implication 
for k > b does not work for a = k £ b. So it is a good idea to get into the habit of thinking 
separately about the cases where a = k S b by explicitly including a basis step. 

The principle of strong mathematical induction is known under a variety of different 
names including the second principle of induction, the second principle of finite induc- 
tion, and the principle of complete induction. 


Applying Strong Mathematical Induction 


The divisibility-by-a-prime theorem states that any integer greater than | is divisible by a 
prime number. We prove this theorem using strong mathematical induction. 


Divisibility by a Prime 


Prove Theorem 4.4.4: Any integer greater than | is divisible by a prime number. 


Solution The idea for the inductive step is this: If you are given an integer greater than 1 
that is not itself prime, then it is a product of two smaller positive integers, each of which 
is greater than 1. By inductive hypothesis, you are assuming that each of these smaller 
integers is divisible by a prime number, and so, by transitivity of divisibility, those prime 
numbers also divide the integer you started with. 


Proof (by strong mathematical induction): 
Let the property P(n) be the sentence 
n is divisible by a prime number. < P(n) 


Show that P(2) is true: 
To establish P(2), we must show that 
2 is divisible by a prime number. <— P(2) 

But this is true because 2 is divisible by 2 and 2 is a prime number. 
Show that for every integer k = 2, if P) is true for each integer from 2 through 
k, then P(k + 1) is also true: 
Let k be any integer with k = 2 and suppose that 

i is divisible by a prime number for each integer 

i from 2 through k. < inductive hypothesis 


We must show that 

k+ 1 is divisible by a prime number. <— P(k+1) 
Case 1 (k + 1 is prime): In this case k + 1 is divisible by a prime number, namely, itself. 
Case 2 (k +1 is not prime): In this case k + 1 = ab where a and b are integers with 
1<a<k+1land1<b<k-+1. Thus, in particular, 2 = a =k, and so by induc- 
tive hypothesis, a is divisible by a prime number p. In addition because k + 1 = ab, 


we have that k+ 1 is divisible by a. Hence, since k+ 1 is divisible by a and a is 
divisible by p, by transitivity of divisibility, k + 1 is divisible by the prime number p. 


*If you have proved that a certain if-then statement is true and if you also know that the hypothesis is true, 
then the conclusion must be true. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


5.4 STRONG MATHEMATICAL INDUCTION AND THE WELL-ORDERING PRINCIPLE FOR THE INTEGERS 303 


Therefore, regardless of whether k + 1 is prime or not, it is divisible by a prime 
number /as was to be shown]. 
[Since we have proved both the basis and the inductive step of the strong mathematical 


induction, we conclude that the given statement is true.] 


Both ordinary and strong mathematical induction can be used to show that the terms 
of certain sequences satisfy certain properties. The next example shows how this is done 
using strong induction. 


Proving a Property of a Sequence with Strong Induction 


Define a sequence So, S1, 52,... as follows: 


So =0, 5, =4, 5, = 6a,-;—Sa,_-, for every integer k = 2. 
a. Find the first four terms of this sequence. 


b. It is claimed that for each integer n = 0, the nth term of the sequence has the same 
value as that given by the formula 5" — 1. In other words, the claim is that all the terms 
of the sequence satisfy the equation s, = 5" — 1. Prove that this is true. 


Solution 
a. 59 = 0, 5, = 4, s= 65; —Ss9 = 6°4—5-0 = 24, 
53 = 65, — 5s; = 6:24—5-4 = 144 — 20 = 124 


b. To use strong mathematical induction to show that every term of the sequence satisfies 
the equation, the basis step must show that the first two terms satisfy it. This is necessary 
because, according to the definition of the sequence, computing values of later terms 
requires knowing the values of the two previous terms. So if the basis step only shows 
that the first term satisfies the equation, it would not be possible to use the inductive step 
to deduce that the second term satisfies the equation. In the inductive step you suppose 
that for an arbitrarily chosen integer k = 1, all the terms of the sequence from sọ through 
Sx Satisfy the given equation and you then deduce that s,4, must also satisfy the equation. 


Proof: 


Let So, Si, 52,... be the sequence defined by specifying that sy = 0, sı = 4, and 
Sx = Oay—1 — 5az-2 for every integer k = 2, and let the property P(n) be the formula 


S, = 5"-1. < P(n) 


We will use strong mathematical induction to prove that for every integer n = 0, 
P(n) is true. 


Show that P(0) and P(A) are true: 
To establish P(O) and P(1), we must show that 


so = 5°— 1 and s; =5!—1. <P(0) and P(1) 


But, by definition of so, Si, S2,..., we have that sy =O and s, = 4. Since 
5°-1=1-1=0 and 5!—1 =5-—1 = 4, the values of sg and s, agree with the 
values given by the formula. 


(continued on page 304) 
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Show that for every integer k = 1, if P@ is true for each integer i from 0 through k, 
then P(k +1) is also true: 
Let k be any integer with k = 1 and suppose that 


S; = 5'— 1 for each integer i with 0 < i S k. < inductive hypothesis 
We must show that 
Seep = 51-1. P+) 


But since k = 1, we have that k+ 1 = 2, and so 


Sk+1 = 65, — 55,1 by definition of Sọ, 51, 59,... 


= 6(5* 1) 5(5* oa 1) by definition hypothesis 


=6: 5k =6= a +5 by multiplying out and applying a law of exponents 
= (6-— ia =] by factoring out 6 and arithmetic 

=5- 5k =i by arithmetic 

— gkt+l _ h 

=5 1 by applying a law of exponents, 


[as was to be shown]. 
[Since we have proved both the basis and the inductive step of the strong mathematical 
induction, we conclude that the given statement is true.] 


A Sequence That Involves the Floor Function 
Define a sequence qj, a>, a3, . . . as follows: 
a, = 0, 
ay = 2, 
ak = 34 x/2,+ 2 for each integer k = 3. 
a. Find the first seven terms of the sequence. 


b. Prove that a, is even for every integer n = 1. 


Solution 
a ay = 0 
ay = 2 


az = 343/2)+2 = 3a, +2=3:0+2=2 
a4 = 34/2) + 2 = 3a, +2 = 3:2+2=8 
ds = 35/2) + 2 = 34, +2 =3:2+2=8 
aş = 34 6/2) + 2 = 3a; +2 = 3:-2+2 = 8 
a7 = 37/2, t 2 = 3a; +2 = 3:2+2 = 8 


b. Let the property P(n) be the sentence “a, is even.” We use strong mathematical induc- 
tion to show that the property holds for every integer n = 1. 


Show that P(1) and P(2) are true: The property is true for n = 1 and n = 2 because a, = 0 
and a, = 2 and both 0 and 2 are even integers. 
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Show that for every integer k = 1, if P@) is true for each integer i from 1 through k, then 
P(k +1) is also true: Let k be any integer with k = 1 and suppose that 


a; is even for each integer i with 1 S i S k. < inductive hypothesis 
[We must show that a, is even.] By definition of aj, a2, a3,... 
ak = 344/2)+2 for every integer k = 3. <— P(k+1) 


Now ax/2) is even by the inductive hypothesis /because k = 1 and so 1 = |k/2| = k]. Thus 
Bagaj 18s even [because it is a product of an odd and an even integer], and hence 3ax/2; + 2 is 
even [because a sum of two even integers is even]. Consequently, ag, which equals 3a,/2) + 2, 
is even fas was to be shown]. 

[Since we have proved the basis step and the inductive step of the strong mathematical 
induction, we conclude that the given statement is true.] E 


Another use of strong induction concerns the computation of products. A product of 
four numbers may be computed in a variety of different ways as indicated by the placement 
of parentheses. For instance, 


(Œx1x2)x3)x4 means multiply x, and x2, multiply the result by x3, 
and then multiply that number by x4, 


and 


(x1x2)(x3x4) Means multiply xı and x2, multiply x3 and x4, 
and then take the product of the two. 


Note that in both examples above, although the factors are multiplied in a different order, 
the number of multiplications—three—is the same. Strong mathematical induction is used 
to prove a generalization of this fact. 


Note Like many Convention 

definitions, this may . . : 

look strange but it makes Let us agree to say that a single number x, is a product with one factor and can be 
things work out nicely for computed with zero multiplications. 


extreme cases. 


The Number of Multiplications Needed to Multiply n Numbers 


Prove that for any integer n = 1, if x), x2,..., x, are n numbers, then no matter how the 
parentheses are inserted into their product, the number of multiplications used to compute 
the product is n — 1. 


Solution The truth of the basis step follows immediately from the convention about a 
product with one factor. The inductive step is based on the fact that when several numbers 
are multiplied together, each step of the process involves multiplying two individual quan- 
tities. For instance, the final step for computing ((x1x2)x3)(x4x5) is to multiply ()x2)x3 and 
xX4X5. In general, when k+ 1 numbers are multiplied, the two quantities in the final step 
each consist of fewer than k + 1 factors. This is what makes it possible to use the inductive 
hypothesis. 
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Proof (by strong mathematical induction): 


Let the property P(n) be the sentence 


If x1, X2,..., X, are n numbers, then no matter 

how parentheses are inserted into their product, 

the number of multiplications used to compute < P(n) 
the product is n— 1. 


Show that P(A) is true: 
To establish P(1), we must show that 


The number of multiplications needed to compute 
the product of x, is 1 — 1. < P(1) 


This is true because, by convention, x; is a product that can be computed with 0 
multiplications, and 0 = 1 — 1. 


Show that for every integer k = 1, if P(@i) is true for each integer i from 1 through 
k, then P(k + 1) is also true: 
Let k be any integer with k = | and suppose that 


For each integer i from 1 through k, if x), x2, . . . , x; are 

numbers, then no matter how parentheses are inserted 

into their product, the number of multiplications used to <— inductive hypothesis 
compute the product is i— 1. 


We must show that 


If x1, X2,- . . , X¢4, are k + 1 numbers, then no matter 

how parentheses are inserted into their product, the 

number of multiplications used to compute the product = <—P(k+1) 
is (k+1)—-1 =k. 


Consider a product of k + 1 factors: x1, x2..., Xz+1. When parentheses are inserted 
in order to compute the product, some multiplication is the final one and each of 
the two factors making up the final multiplication is a product of fewer than k + 1 
factors. Let L be the product of the left-hand factors and R be the product of the 
right-hand factors, and suppose that L is composed of / factors and R is composed of 
r factors. Then l +r = k+ 1, the total number of factors in the product, and 


lslsk and lsrsk. 


By inductive hypothesis, evaluating L takes /—1 multiplications and evaluating R 
takes r — 1 multiplications. Because one final multiplication is needed to evaluate L- R, 
the number of multiplications needed to evaluate the product of all k + 1 factors is 


d-)+(r-1I+1=(4+AN-1=(+)-1=k 


[as was to be shown]. 


[Since we have proved the basis step and the inductive step of the strong mathematical 
induction, we conclude that the given statement is true.] 


Strong mathematical induction makes possible a proof of the fact used frequently in com- 
puter science that every positive integer n has a unique binary integer representation. The 
proof looks complicated because of all the notation needed to write down the various steps. 
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But the idea of the proof is simple. It is that if integers smaller than n have unique repre- 
sentations as sums of powers of 2, then the unique representation for n as a sum of powers 
of 2 can be found by taking the representation for n/2 (or for (n — 1)/2 if n is odd) and 
multiplying it by 2. 


Theorem 5.4.1 Existence and Uniqueness of Binary Integer Representations 


Given any positive integer n, n has a unique representation in the form 


n=c,2"+¢,-4°2" | 4+++++09:2? +¢,:2 +, 


where r is a nonnegative integer, c, = 1, and C 1 or O for each j = 0, 1, 2,..., 
P= Il, 


Proof: 
We give separate proofs by strong mathematical induction to show first the existence 
and second the uniqueness of the binary representation. 


Existence (proof by strong mathematical induction): Let the property P(n) be the 
equation 


n =c p +e 2T] Fey 2? +¢1:2 Co, <— P(n) 


where r is a nonnegative integer, c, = 1, and c; = 1 or O for each j = 0, 1, 2,..., 


p=: 

Show that P(1) is true: 

Let r = 0 and cy = 1. Then 1 = c,-2", and so n = 1 can be written in the required 
form. 


Show that for every integer k = 1, if P@) is true for each integer i from 1 through 
k, then P(k +1) is also true: 
Let k be an integer with k = 1. Suppose that for each integer i from | through k, 


i= er te Qh beet Co" 2? + Cr 2 Fi; < inductive hypothesis 


where r is a nonnegative integer, c, = 1, and cj = 1 or O for each j = 0, 1, 2,..., 
r— 1. We must show that k + 1 can be written as a sum of powers of 2 in the required 
form. 


Case 1 (k +1 is even): In this case (k + 1)/2 is an integer, and by inductive hypoth- 
esis, since 1 < (k + 1)/2 < k, then 

k+1 r p=] 2 

“a C2 Fed Reb cg 2° GF 2+ Co; 
where r is a nonnegative integer, c, = 1, and cj = 1 or O for each j = 0, 1, 2,..., 
r—1. Multiplying both sides of the equation by 2 gives 

k+l =cette te toy Hc + c9°2, 

which is a sum of powers of 2 of the required form. 


Case 2 (k +1 is odd): In this case k/2 is an integer, and by inductive hypothesis, 
since 1 = k/2 = k, then 


kK r cae et 2 
ae Fea > Reig 26 ie 258 Cp 


(continued on page 308) 
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where r is a nonnegative integer, c, = 1, and c= 1 or O for each j = 0, 1, 2,..., 
r—1. Multiplying both sides of the equation by 2 and adding 1 gives 


ktleep 2 te, yp 2 te tey Pte 2 +92 +1, 
which is also a sum of powers of 2 of the required form. 


The preceding arguments show that regardless of whether k + 1 is even or odd, k + 1 
has a representation of the required form. /Or, in other words, P(k + 1) is true as was 
to be shown.] 


[Since we have proved the basis step and the inductive step of the strong mathematical 
induction, the existence half of the theorem is true.] 


Uniqueness: To prove uniqueness, suppose that there is an integer n with two dif- 
ferent representations as a sum of nonnegative integer powers of 2. Equating the two 
representations and canceling all identical terms gives 

Wt ey +e +e 2teg = +d i+ s+dy2+dy 541 


where r and s are nonnegative integers and each c; and each d; equal 0 or 1. With- 
out loss of generality, we may assume that r < s. Now by the formula for the sum 
of a geometric sequence (Theorem 5.2.2) and because r < s (which implies that 
r+1Ss), 
Vte 14+ +e 2+eg SW HQ 1+ + 4+241=2"'-1 
<7. 
Thus 
Wey lH tea 2teg <M +d, 2T Hd do, 


which contradicts equation (5.4.1). Hence the supposition is false, so any integer n 
has only one representation as a sum of nonnegative integer powers of 2. 


The Well-Ordering Principle for the Integers 


The well-ordering principle for the integers looks very different from both the ordinary and 
the strong principles of mathematical induction, but it can be shown that all three principles 
are equivalent. In other words, if any one of the three is true, then so are both of the others. 


Well-Ordering Principle for the Integers 


Let S be a set of integers containing one or more integers all of which are greater 
than some fixed integer. Then S has a least element. 


When the context makes the reference clear, we will write simply “the well-ordering 
principle” rather than “the well-ordering principle for the integers.” 


Finding Least Elements 


In each case, if the set has a least element, state what it is. If not, explain why the well- 
ordering principle is not violated. 
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a. The set of all positive real numbers 
b. The set of all nonnegative integers n such that n? <n 


c. The set of all nonnegative integers of the form 46 — 7k, where k is an integer 


Solution 

a. There is no least positive real number. For if x is any positive real number, then x/2 is 
a positive real number that is less than x. No violation of the well-ordering principle 
occurs because the well-ordering principle refers only to sets of integers, and this set 
is not a set of integers. 


b. There is no least nonnegative integer n such that n? < n because there is no nonnega- 
tive integer that satisfies this inequality. The well-ordering principle is not violated be- 
cause the well-ordering principle refers only to sets that contain at least one element. 


c. The following table shows values of 46 — 7k for various values of k. 


k 0 1 2 3 4 5 6 7 tee =—1 =2 =3 
46 — 7k 46 39 32 25 18 11 4 =3 ess 53 60 67 


The table suggests, and you can easily confirm, that 46 —7k < 0 for k= 7 and that 
46 — 7k = 46 for k = 0. Therefore, from the other values in the table it is clear that 4 is the 
least nonnegative integer of the form 46 — 7k. This corresponds to k = 6. E 


Another way to look at the analysis of Example 5.4.5(c) is to observe that subtracting six 
T’s from 46 leaves 4 left over and this is the least nonnegative integer obtained by repeated 
subtraction of 7’s from 46. In other words, 6 is the quotient and 4 is the remainder for the 
division of 46 by 7. More generally, in the division of any integer n by any positive integer d, 
the remainder r is the least nonnegative integer of the form n — dk. This is the heart of the fol- 
lowing proof of the existence part of the quotient-remainder theorem (the part that guarantees 
the existence of a quotient and a remainder of the division of an integer by a positive integer). 
For a proof of the uniqueness of the quotient and remainder, see exercise 21 of Section 4.8. 


Quotient-Remainder Theorem (Existence Part) 


Given any integer n and any positive integer d, there exist integers q and r such that 


n=dqtr and 0=r<d. 


Proof: Let S be the set of all nonnegative integers of the form 


n—dk, 
where k is an integer. This set has at least one element. [For if n is nonnegative, then 
n—-O-d=n=0, 


and son—O-d is in S. And if n is negative, then 
n—nd=n(1—d)=0, 
Í 


<0 = 0 since d is a positive integer 


(continued on page 310) 
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and so n—nd is in S.] It follows by the well-ordering principle for the integers that S 
contains a least element r. Then, for some specific integer value of k, say q, 


n—dq =r 
[because every integer in S can be written in this form]. Adding dq to both sides gives 
n = dq+r. 


Furthermore, r < d. [For suppose r = d. Then 


n—d(q+1)=n-—dq-—d=r—-d=0, 


and so n— d(q + 1) would be a nonnegative integer in S that would be smaller than r. But 
r is the smallest integer in S. This contradiction shows that the supposition r = d must 
be false.] The preceding arguments prove that there exist integers r and q for which 


n=dq+r and O=sr<d 


[as was to be shown]. 


Another consequence of the well-ordering principle is the fact that any strictly decreas- 
ing sequence of nonnegative integers is finite. That is, if r4, r2, 73, ...iS a sequence of non- 
negative integers satisfying 


T> Tipi 


for every i = 1, then r}, ro, r3, ...is a finite sequence. [For by the well-ordering principle 
such a sequence has a least element, say r}. It follows that r, is the final term of the sequence 
because if there were an additional term r4, then since the sequence is strictly decreasing, 
Tk+1 < 1% which would be a contradiction.] This fact is frequently used in computer science 
to prove that algorithms terminate after a finite number of steps. 


TEST YOURSELF 


1. Ina proof by strong mathematical induction the 
basis step may require checking a property P(n) 
for more value of n. 


2. Suppose that in the basis step for a proof by strong 
mathematical induction the property P(n) was 
checked for every integer n from a through b. 
Then in the inductive step one assumes that for 


EXERCISE SET 5.4 


any integer k = b, the property P(n) is true for 
all values of i from through and one 
shows that is true. 


. According to the well-ordering principle for the 


integers, if a set S of integers contains at least 
______and if there is some integer that is less than 
or equal to every , then 


1. Suppose ay, do, a3,.. 
follows: 


. is a sequence defined as 


4&4 = 1, ag = 3; 
ak = aķ-2 +2a,—, for each integer k = 3. 


Prove that a, is odd for every integer n = 1. 


. Suppose by, bo, b3,.. 


. 1s a sequence defined as fol- 
lows: 
bi =4, by = 12, 
by = by-2 + bg-ı1 for each integer k = 3. 


Prove that b, is divisible by 4 for every integer n = 1. 
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. Suppose that co, C1, Co,... is a sequence defined 


as follows: 
Co 2, Cy 2; C2 6, 
Ck = 3c,_-3. for every integer k = 3. 


Prove that c, is even for each integer n = 0. 


. Suppose that d), dz, d3,... is a sequence defined 


as follows: 
9 10 
dy =—, dy, 
1 10 P 
d, = dy-\"dy-2 for every integer k = 3. 


Prove that 0 < d, = 1 for each integer n = 1. 


. Suppose that éo, €j, é2,... 
as follows: 


eo = 12, e; = 29, 


is a sequence defined 


ex = Sez- — 6ez—2 for each integer k = 2. 


Prove that e, = 5-3" + 7-2" for every integer 
n=0. 


. Suppose that fo, fis fo, ..- 
follows: 

fo =5, fi = 16, 

Jk = Th-1 — 10fk-2 


Prove that f, = 3-2” + 2-5" for each integer 
n=20. 


is a sequence defined as 


for every integer k = 2. 


. Suppose that g1, go, g3,... is a sequence defined 


as follows: 

i= 3, 62> 5, 

8k = 38k-1ı —28,-2 for each integer k = 3. 
Prove that g, = 2” + 1 for every integer n = 1. 


. Suppose that ho, Ay, ho,... 
as follows: 


b= h =2, h =3, 
hy = hg- + Iya + hg-3 


is a sequence defined 


for each integer k = 3. 


a. Prove that h, = 3” for every integer n = 0. 
b. Suppose that s is any real number such 
that s? = s? + s + 1. (This implies that 
2 > s > 1.83.) Prove that h,, = s” for every 
integer n = 2. 


. Define a sequence a}, a>, a3,... as follows: 
a, = 1, a = 3, and a = ay_, + ay_» for every 


Tl. 


14. 


15. 
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integer k = 3. (This sequence is known as the 
Lucas sequence.) Use strong mathematical induc- 
tion to prove that a, = \z) for every integer n = 1. 


. The introductory example solved with ordinary 


mathematical induction in Section 5.3 can also be 
solved using strong mathematical induction. Let P(n) 
be “any n¢ can be obtained using a combination of 
3¢ and 5¢ coins.” Use strong mathematical induction 
to prove that P(n) is true for every integer n = 8. 


You begin solving a jigsaw puzzle by finding two 
pieces that match and fitting them together. Every 
subsequent step of the solution consists of fitting 
together two blocks, each of which is made up 

of one or more pieces that have previously been 
assembled. Use strong mathematical induction 

to prove that for every integer n = 1, the number 
of steps required to put together all n pieces of a 
jigsaw puzzle is n— 1. 


. The sides of a circular track contain a sequence 


of n cans of gasoline. For each integer n = 1, the 
total amount in the cans is sufficient to enable a 
certain car to make one complete circuit of the 
track. In addition, all the gasoline could fit into 
the car’s gas tank at one time. Use mathematical 
induction to prove that it is possible to find an 
initial location for placing the car so that it will 
be able to traverse the entire track by using the 
various amounts of gasoline in the cans that it 
encounters along the way. 


. Use strong mathematical induction to prove the 


existence part of the unique factorization of 
integers theorem (Theorem 4.4.5). In other words, 
prove that every integer greater than | is either a 
prime number or a product of prime numbers. 


Any product of two or more integers is a result 
of successive multiplications of two integers at a 
time. For instance, here are a few of the ways in 
which aja2a3a4 might be computed: (a,a2)(a3a4) 
or ((a;a2)a3)a4) or a,((a2a3)a4). Use strong math- 
ematical induction to prove that any product of 
two or more odd integers is odd. 


Define the “sum” of one integer to be that integer, 
and use strong mathematical induction to prove 
that for every integer n = 1, any sum of n even 
integers is even. 


. Use strong mathematical induction to prove that 


for every integer n = 2, if n is even, then any sum 
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17. 


18. 


19. 


20. 


21. 


22. 


23. 


of n odd integers is even, and if n is odd, then any 
sum of n odd integers is odd. 


Compute 4', 4’, 4°, 44, 4°, 4°, 4”, and 4°. Make a 
conjecture about the units digit of 4” where n is a 
positive integer. Use strong mathematical induc- 
tion to prove your conjecture. 


Compute 9°, 9! 9? 9° 9$, and 9°. Make a conjec- 
ture about the units digit of 9” where n is a posi- 
tive integer. Use strong mathematical induction to 
prove your conjecture. 


Suppose that aj, a2, a3, . . . is a sequence defined as 
follows: 


a, = 1 ay = 2-a x2, for every integer k = 2. 
Prove that a„ = n for each integer n = 1. 


Suppose that by, bo, b3,...1s a sequence defined as 
follows: 


bı = 0, by = 3 dy =5-Dxj2, +6 for every integer 
k 23. 


Prove that b,, is divisible by 3 for each integer 
n=l 


Suppose that c4, C2, C3, . . 
follows: 


.is a sequence defined as 


co. = 1, cy 1 Ck = Ok/2 T OK/2] for every 
integer k = 2. 


Prove that c, = n for each integer n = 1. 


One version of the game NIM starts with two piles 
of objects such as coins, stones, or matchsticks. 

In each turn a player is required to remove from 
one to three objects from one of the piles. The 

two players take turns doing this until both piles 
are empty. The loser is the first player who can’t 
make a move. Use strong mathematical induction 
to show that if both piles contain the same number 
of objects at the start of the game, the player who 
goes second can always win. 


Define a game G as follows: Begin with a pile of n 
stones and 0 points. In the first move split the pile into 
two possibly unequal sub-piles, multiply the number 
of stones in one sub-pile times the number of stones in 
the other sub-pile, and add the product to your score. 
In the second move, split each of the newly created 
piles into a pair of possibly unequal sub-piles, mul- 
tiply the number of stones in each sub-pile times the 
number of stones in the paired sub-pile, and add the 
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new products to your score. Continue by successively 

splitting each newly created pile of stones that has at 

least two stones into a pair of sub-piles, multiplying 
the number of stones in each sub-pile times the num- 
ber of stones in the paired sub-pile, and adding the 
new products to your score. The game G ends when 
no pile contains more than one stone. 

a. Play G starting with 10 stones and using the fol- 
lowing initial moves. In move | split the pile of 
10 stones into two sub-piles with 3 and 7 stones 
respectively, compute 3-7 = 21, and find that 
your score is 21. In move 2 split the pile of 
3 stones into two sub-piles, with 1 and 2 stones 
respectively, and split the pile of 7 stones into 
two sub-piles, with 4 and 3 stones respectively, 
compute 1:2 = 2 and 4-3 = 12, and find that 
your score is 21 +2 + 12 = 35. In move 3 split 
the pile of 4 stones into two sub-piles, each 
with 2 stones, and split the pile of 3 stones into 
two sub-piles, with 1 and 2 stones respectively, 
and find your new score. Continue splitting 
piles and computing your score until no pile 
has more than one stone. Show your final score 
along with a record of the numbers of stones in 
the piles you created with your moves. 

b. Play G again starting with 10 stones, but use a 
different initial move from the one in part (a). 
Show your final score along with a record of 
the numbers of stones in the piles you created 
with your moves. 

c. Show that you can use strong mathematical 
induction to prove that for every integer n = 1, 
given the set-up of game G, no matter how you 
split the piles in the various moves, your final 
score is (n(n — 1))/2. The basis step may look 
a little strange because a pile consisting of one 
stone cannot be spilt into any sub-piles. Anoth- 
er way to say this is that it can only be split into 
zero piles, and that gives an answer that agrees 
with the general formula for the final score. 


24. Imagine a situation in which eight people, num- 


bered consecutively 1—8, are arranged in a circle. 
Starting from person #1, every second person in the 
circle is eliminated. The elimination process con- 
tinues until only one person remains. In the first 
round the people numbered 2, 4, 6, and 8 are elimi- 
nated, in the second round the people numbered 3 
and 7 are eliminated, and in the third round person 
#5 is eliminated, so after the third round only per- 
son #1 remains, as shown on the next page. 
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After the Ist Round After the 2nd Round After the 3rd Round 


Initial State 


ii il 1 


a. Given a set of sixteen people arranged in a 
circle and numbered, consecutively 1—16, list 
the numbers of the people who are eliminated 
in each round if every second person is elimi- 
nated and the elimination process continues 
until only one person remains. Assume that the 
starting point is person #1. 

b. Use ordinary mathematical induction to prove 
that for every integer n = 1, given any set of 
2” people arranged in a circle and numbered 
consecutively | through 2”, if one starts from 
person #1 and goes repeatedly around the 
circle successively eliminating every second 
person, eventually only person #1 will remain. 

c. Use the result of part (b) to prove that for 
any nonnegative integers n and m with 
2" = 2"+m<2""! ifr = 2" +m, then given 
any set of r people arranged in a circle and num- 
bered consecutively 1 through y, if one starts from 
person #1 and goes repeatedly around the circle 
successively eliminating every second person, 
eventually only person #(2m + 1) will remain. 


25. Find the mistake in the following “proof” that 


purports to show that every nonnegative integer 
power of every nonzero real number is 1. 


“Proof: Let r be any nonzero real number and let 
the property P(n) be the equation r” = 1. 


Show that P(O) is true: P(O) is true because r° = 1 
by definition of zeroth power. 

Show that for every integer k = 0, if P@ is 

true for each integer i from 0 through k, then 
P(k + 1) is also true: Let k be any integer with 

k = 0 and suppose that 7’ = 1 for each integer i 
from 0 through k. This is the inductive hypothesis. 
We must show that “*! = 1. Now 


perl = ktk-k-D because k + k — (k — 1) = 
k+k-k+1=k+1 

rf- rf 

=a by the laws of exponents 
r 
1-1 

= oa by inductive hypothesis 

= 1. 

ktl 
Thus r° = 1 [as was to be shown]. 


26. 


27. 


28. 


H 29. 


30. 


31. 


32. 


33. 
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[Since we have proved the basis step and the inductive 
step of the strong mathematical induction, we conclude 
that the given statement is true. ]” 


Use the well-ordering principle for the integers to 
prove Theorem 4.4.4: Every integer greater than 1 
is divisible by a prime number. 


Use the well-ordering principle for the integers to 
prove the existence part of the unique factoriza- 
tion of integers theorem. In other words, prove 
that every integer greater than | is either prime or 
a product of prime numbers. 


a. The Archimedean property for the rational 
numbers states that for every rational number 
r, there is an integer n such that n > r. Prove 
this property. 

b. Prove that given any rational number r, the 
number —r is also rational. 

c. Use the results of parts (a) and (b) to prove that 
given any rational number r, there is an integer 
m such that m < r. 


Use the results of exercise 28 and the well-ordering 
principle for the integers to show that given any 
rational number r, there is an integer m such that 
ms=r<mtl. 


Use the well-ordering principle to prove that 

given any integer n = 1, there exists an odd 

integer m and a nonnegative integer k such that 
k 

n= 2°-m. 


Give examples to illustrate the proof of 
Theorem 5.4.1. 


Suppose P(n) is a property such that 

1. PO), PC), P(2) are all true, 

2. for each integer k = 0, if P(x) is true, then 
P(3k) is true. Must it follow that P(n) is true for 
every integer n = 0? If yes, explain why; if no, 
give a counterexample. 


Prove that if a statement can be proved by strong 

mathematical induction, then it can be proved by 

ordinary mathematical induction. To do this, let 

P(n) be a property that is defined for each integer 

n, and suppose the following two statements are 

true: 

1. P(a), P(a+ 1), Pla+2),..., Pd). 

2. For any integer k = b, if P(i) is true for each 
integer i from a through k, then P(k + 1) is true. 


The principle of strong mathematical induction 
would allow us to conclude immediately that P(n) 
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is true for every integer n = a. Can we reach the where c, = 1 or 2 and c; = 0, 1, or 2 for each 
same conclusion using the principle of ordinary integer i = 0, 1, 2,..., r—1. Sketch a proof of this 
mathematical induction? Yes! To see this, let O(n) fact. 

be the property 


H* 35. Use mathematical induction to prove the existence 
P(j) is true for each integer j witha S j =n. part of the quotient-remainder theorem. In other 
words, use mathematical induction to prove that 
given any integer and any positive integer d, there 
exist integers q and r such that n = dq + r and 


Then use ordinary mathematical induction to 
show that Q(n) is true for every integer n = b. 
That is, prove: 


arag. 
1. Qb) is true. MEN 
2. For each integer k = b, if Q(k) is true then H* 36. Prove that if a statement can be proved by ordi- 
Q(k + 1) is true. nary mathematical induction, then it can be proved 


H 34. Itis a fact that every integer n = 1 can be written Dy the well-ondeting principle: 


in the form H 37. Use the principle of ordinary mathematical induc- 
: = tion to prove the well-ordering principle for the 
cp 3l + e13 TlH e3 c3 + Co, ; P E Prep 
integers. 


ANSWERS FOR TEST YOURSELF 


1. than one 2.a; k; P(k+1) 3. one integer; integer in S; S contains a least element 


E Application: Correctness of Algorithms 


[P]Jrogramming reliably—must be an activity of an undeniably mathematical nature .... 
You see, mathematics is about thinking, and doing mathematics is always trying to think 
as well as possible. —Edsger W. Dijkstra (1981) 


What does it mean for a computer program to be correct? Each program is designed 
to do a specific task—calculate the mean or median of a set of numbers, compute the 
size of the paychecks for a company payroll, rearrange names in alphabetical order, 
and so forth. We will say that a program is correct if it produces the output specified in 
its accompanying documentation for each set of input data of the type specified in the 
documentation.* 

Most computer programmers write their programs using a combination of logical anal- 
ysis and trial and error. In order to get a program to run at all, the programmer must first 
fix all syntax errors (such as writing ik instead of if, failing to declare a variable, or using 
a restricted keyword for a variable name). When the syntax errors have been removed, 
however, the program may still contain logical errors that prevent it from producing cor- 
Edsger W. Dijkstra rect output. Frequently, programs are tested using sets of sample data for which the cor- 
(1930-2002) rect output is known in advance. And often the sample data are deliberately chosen to test 
the correctness of the program under extreme circumstances. But for most programs the 
number of possible sets of input data is either infinite or unmanageably large, and so no 
amount of program testing can give perfect confidence that the program will be correct for 
all possible sets of legal input data. 


*Consumers of computer programs want an even more stringent definition of correctness. If a user puts in data 
of the wrong type, the user wants a decent error message, not a system crash. 
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Since 1967, with the publication of a paper by Robert W. Floyd,* considerable effort 
has gone into developing methods for proving programs correct at the time they are com- 
posed. One of the pioneers in this effort, Edsger W. Dijkstra, asserted that “we now take 
the position that it is not only the programmer’s task to produce a correct program but 
also to demonstrate its correctness in a convincing manner.” Another leader in the field, 
David Gries, went so far as to say that “a program and its proof should be developed hand- 
in-hand, with the proof usually leading the way.’** In this section we give an overview of 
the general format of correctness proofs and the details of one crucial technique, the Joop 
invariant procedure, and we switch from using the term program to using the more general 
term algorithm. 


Courtesy of Christiane Floyd 


Robert W. Floyd Assertions 


1936-2002 r z p : ia, See : 
f : Consider an algorithm that is designed to produce a certain final state from a certain 


initial state. Both the initial and final states can be expressed as predicates involv- 
ing the input and output variables. Often the predicate describing the initial state is 
called the pre-condition for the algorithm, and the predicate describing the final state 
is called the post-condition for the algorithm. 


Genscan Algorithm Pre-Conditions and Post-Conditions 


Here are pre- and post-conditions for some typical algorithms. 

a. Algorithm to compute a product of nonnegative integers 
Pre-condition: The input variables m and n are nonnegative integers. 
Post-condition: The output variable p equals mn. 


b. Algorithm to find quotient and remainder of the division of one positive integer by 
another 


Pre-condition: The input variables a and b are positive integers. 
Post-condition: The output variables q and r are integers such that a = bq +r and 
Osr<b. 


c. Algorithm to sort a one-dimensional array of real numbers 


Pre-condition: The input variable A[1], A[2], . . . , A[m] is a one-dimensional array of 
real numbers. 


Post-condition: The output variable B[1], B[2],..., B[n] is a one-dimensional array 
of real numbers with same elements as A[1], A[2],..., A[n] but with 
the property that B[i] = B[j] whenever i < j. E 


A proof of algorithm correctness consists of showing that if the pre-condition for the 
algorithm is true for a collection of values for the input variables and if the statements of 
the algorithms are executed, then the post-condition is also true. 


*R. W. Floyd, “Assigning meanings to programs,” Proc. Symp. Appl. Math., Amer. Math. Soc. 19 (1967), 
19-32. 


Edsger Dijkstra in O. J. Dahl, E. W. Dijkstra, and C. A. R. Hoare, Structured Programming (London: Aca- 
demic Press, 1972), p. 5. 


**David Gries, The Science of Programming (New York: Springer-Verlag, 1981), p. 164. 
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The steps of an algorithm are divided into sections with assertions about the current 
state of algorithm variables inserted at strategically chosen points: 


[Assertion 1: pre-condition for the algorithm] 
{Algorithm statements} 
[Assertion 2] 


{Algorithm statements} 


[Assertion k — 1] 
{Algorithm statements} 


[Assertion k: post-condition for the algorithm] 


Successive pairs of assertions are then treated as pre- and post-conditions for the al- 
gorithm statements between them. For each i = 1, 2,...,k—1, one proves that if Asser- 
tion 7 is true and all the algorithm statements between Assertion i and Assertion (i+ 1) 
are executed, then Assertion (i+ 1) is true. Once all these individual proofs have been 
completed, one knows that Assertion k is true. And since Assertion | is the same as the 
pre-condition for the algorithm and Assertion k is the same as the post-condition for the 
algorithm, one concludes that the entire algorithm is correct with respect to its pre- and 
post-conditions. 


Loop Invariants 


The method of loop invariants is used to prove correctness of a loop with respect to certain 
pre- and post-conditions. It is based on the principle of mathematical induction. Suppose 
that an algorithm contains a while loop and that entry to this loop is restricted by a con- 
dition G, called the guard. Suppose also that assertions describing the current states of 
algorithm variables have been placed immediately preceding and immediately following 
the loop. The assertion just preceding the loop is called the pre-condition for the loop 
and the one just following is called the post-condition for the loop. The annotated loop 
has the following appearance: 


[Pre-condition for the loop] 
while (G) 


[Statements in the body of the loop. 
None contain branching statements 
that lead outside the loop.] 


end while 


[Post-condition for the loop] 


Definition 


A loop is defined as correct with respect to its pre- and post-conditions if, and 
only if, whenever the algorithm variables satisfy the pre-condition for the loop and 
the loop terminates after a finite number of steps, the algorithm variables satisfy the 
post-condition for the loop. 


Establishing the correctness of a loop uses the concept of loop invariant. A loop invari- 
ant is a predicate with domain a set of integers, which satisfies the following condition: For 
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each iteration of the loop, if the predicate is true before the iteration, then it is true after the 
iteration. Furthermore, if the predicate satisfies the following two additional conditions, 
the loop will be correct with respect to its pre- and post-conditions: 


1. The predicate is true before the first iteration of the loop. 


2. If the loop terminates after a finite number of iterations, the truth of the loop invariant 
ensures the truth of the post-condition for the loop. 


The following theorem, called the loop invariant theorem, formalizes these ideas. It was 
first developed by C. A. R. Hoare in 1969. 


R. Stonehouse/Camera Press/Redux 


C. A. R. Hoare $ 
(born 1934) Theorem 5.5.1 Loop Invariant Theorem 


Let a while loop with guard G be given, together with pre- and post-conditions that 
are predicates in the algorithm variables. Also let a predicate J(n), called the loop 
invariant, be given. If the following four properties are true, then the loop is correct 
with respect to its pre- and post-conditions. 


I. Basis Property: The pre-condition for the loop implies that /(0) is true before 
the first iteration of the loop. 


II. Inductive Property: For every integer k = 0, if the guard G and the loop in- 
variant J(k) are both true before an iteration of the loop, then /(k + 1) is true after 
an iteration of the loop. 

III. Eventual Falsity of Guard: After a finite number of iterations of the loop, the 
guard G becomes false. 

IV. Correctness of the Post-Condition: If N is the least number of iterations after 
which G is false and J(N) is true, then the values of the algorithm variables will 
be as specified in the post-condition of the loop. 


Proof: The loop invariant theorem follows easily from the principle of mathematical 
induction. Assume that /(n) is a predicate that satisfies properties I-IV of the loop 
invariant theorem. [We will prove that the loop is correct with respect to its pre- and 
post-conditions.] Properties I and II are the basis and inductive steps needed to prove 
the truth of the following statement: 


For every integer n = 0, if the while loop 


. . ; 5S 
iterates n times, then /(n) is true. 


Thus, by the principle of mathematical induction, since both I and II are true, state- 
ment (5.5.1) is also true. 

Property III says that the guard G eventually becomes false. At that point 
the loop will have been iterated some number, say N, of times. Since /(7) is true 
after the nth iteration for every n = 0, then J(n) is true after the Nth iteration. 
That is, after the Nth iteration the guard is false and J(N) is true. But this is the 
hypothesis of property IV, which is an if-then statement. Since statement IV is 
true (by assumption) and its hypothesis is true (by the argument just given), it 
follows (by modus ponens) that its conclusion is also true. That is, the values of 
all algorithm variables after execution of the loop are as specified in the post- 
condition for the loop. 
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Developing a good loop invariant is a tricky process. Although learning how to do it is 
beyond the scope of this book, it is worth pursuing in a more advanced course. 

Another tricky aspect of handling correctness proofs arises from the fact that execution 
of an algorithm is a dynamic process—it takes place in time. As execution progresses, the 
values of variables keep changing, yet often their names stay the same. In the following 
discussion, when we need to make a distinction between the values of a variable just before 
execution of an algorithm statement and just after execution of the statement, we will at- 
tach the subscripts old and new to the variable name. 


CAUSE Correctness of a Loop to Compute a Product 


The following loop is designed to compute the product mx for a nonnegative integer 
m and a real number x, without using a built-in multiplication operation. Prior to the 
loop, variables i and product have been introduced and given initial values i = 0 and 
product = 0. 


[Pre-condition: m is a nonnegative integer, 
x is a real number, i = 0, and product = 0.] 


while (i # m) 
1. product := product + x 
2.i:=i+1 

end while 


[Post-condition: product = mx] 


Let the loop invariant be 


(n):i=n and product = nx 


The guard condition G of the while loop is 


G:i#m 


Use the loop invariant theorem to prove that the while loop is correct with respect to the 
given pre- and post-conditions. 


Solution 


I. Basis Property: /I(0) is true before the first iteration of the loop.] 
IO) is “i = 0 and product = 0-x,” which is true before the first iteration of the 
loop because 0-x = 0. 


II. Inductive Property: /If G A^ K(k) is true before a loop iteration (where k = 0), then 
I(k + 1) is true after the loop iteration.] 
Suppose k is a nonnegative integer such that G ^ J(k) is true before an iteration 
of the loop. Then as execution reaches the top of the loop, i # m, product = kx, and 
i = k. Since i # m, the guard is passed and statement | is executed. Before execution 
of statement 1, 


producta = kx. 
Thus execution of statement 1 has the following effect: 


product,.y = productyjy +x = kxt+x = (k+ 1)x. 
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Similarly, before statement 2 is executed, 
laid = k, 
so after execution of statement 2, 


i =i tl =k+1. 


new 


Hence after the loop iteration, the statement (k+ 1), namely, (i = k+ 1 and 
product = (k + 1)x), is true. This is what we needed to show. 


II. Eventual Falsity of Guard: [After a finite number of iterations of the loop, G 
becomes false.] 
The guard G is the condition i # m, and m is a nonnegative integer. By I and II, it 
is known that 


for every integer n = 0, if the loop is iterated 
n times, then i = n and product = nx. 


So after m iterations of the loop, i = m. Thus G becomes false after m iterations of 
the loop. 


IV. Correctness of the Post-Condition: /If N is the least number of iterations after which 
G is false and I(N) is true, then the value of the algorithm variables will be as specified in 
the post-condition of the loop.] 

According to the post-condition, the value of product after execution of the loop 
should be mx. But if G becomes false after N iterations, i = m. And if [(N) is true, 
i = N and product = Nx. Since both conditions (G false and J(N) true) are satisfied, 
m = i = N and product = mx as required. E 


In the remainder of this section, we present proofs of the correctness of the crucial loops 
in the division algorithm and the Euclidean algorithm. (These algorithms were given in 
Section 4.10.) 


Correctness of the Division Algorithm 


The division algorithm is supposed to take a nonnegative integer a and a positive integer d 
and compute nonnegative integers q and r such that a = dq +r and0 =r < d. Initially, the 
variables r and q are introduced and given the values r = a and q = 0. The crucial loop, 
annotated with pre- and post-conditions, is the following: 


[Pre-condition: a is a nonnegative integer 
and d is a positive integer, r = a, and q = 0.] 


while (r = d) 


l.r:=r-d 
2.q:=q+1 
end while 


[Post-condition: q and r are nonnegative integers 
with the property thata = qd+r and0 Sr < d.] 


Proof: 
To prove the correctness of the loop, let the loop invariant be 


In:r=a-nd=0 and n=q. 
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The guard of the while loop is 


G:r=d. 


I. Basis Property: /1(0) is true before the first iteration of the loop.] 
IQ) is “r = a—0-d = 0 and q = 0.” But by the pre-condition, r = a, a = 0, and 
q = 0. So since a = a—0-d, then r = a—0-d and (0) is true before the first iteration 
of the loop. 


II. Inductive Property: /If G ^ K(k) is true before an iteration of the loop (where k = 0), 
then I(k + 1) is true after iteration of the loop.] 
Suppose k is a nonnegative integer such that G ^ J(k) is true before an iteration 
of the loop. Since G is true, r = d and the loop is entered. Also since /(k) is true, 
r= a-—kd=Oandk = q. Hence, before execution of statements 1 and 2, 


Fold =d and Foda — 47 kd and dold = k. 


When statements 1 and 2 are executed, then 


Fhew — Fold d= (a kd) d=a (k+ 1)d S2 


and 
Anew — old PESKE 532 


In addition, since r qa = d before execution of statements 1 and 2, after execution of 
these statements, 


Trew = tair d z d=d = 0, 5.54 


Putting equations (5.5.2), (5.5.3), and (5.5.4) together shows that after iteration of the 
loop, 


rew Z0 and pew =a-—(kK+1)d and dry =kKt+1. 


Hence /(k + 1) is true. 


II. Eventual Falsity of the Guard: [After a finite number of iterations of the loop, G be- 
comes false.] 

The guard G is the condition r = d. Each iteration of the loop reduces the value 
of r by d and yet leaves r nonnegative. Thus the values of r form a decreasing se- 
quence of nonnegative integers, and so (by the well-ordering principle) there must be 
a smallest such r, say “pin. Then rin < d. [If fmin were greater than d, the loop would 
iterate another time, and a new value of r equal to rin — d would be obtained. But this 
new value would be smaller than rpin, which would contradict the fact that fmin is the 
smallest remainder obtained by repeated iteration of the loop.] Hence as soon as the 
value r = Fmin iS Computed, the value of r becomes less than d, and so the guard G 
is false. 


IV. Correctness of the Post-Condition: [Jf N is the least number of iterations after which 
G is false and I(N) is true, then the values of the algorithm variables will be as specified 
in the post-condition of the loop.] 

Suppose that for some nonnegative integer N, G is false and J(N) is true. Then 
r<d,r=a-—Nd,r= 0, and q = N. Since q = N, substitution gives 


r=a-—qd, 
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and adding gd to both sides produces 
a=qd+r. 

Combining the two inequalities involving r gives 
Osr<d. 


Because these are the values of g and r specified in the post-condition, the proof is 
complete. E 


Correctness of the Euclidean Theorem 


The Euclidean algorithm is supposed to take integers A and B with A > B = 0 and com- 
pute their greatest common divisor. Just before the crucial loop, variables a, b, and r have 
been introduced with a = A, b = B, and r = B. The crucial loop, annotated with pre- and 
post-conditions, is the following: 


[Pre-condition: A and B are integers 
with A > B = 0,a = A,b = B,r = B.] 
while (b + 0) 


1l.r:= a mod b 


2.a:=b 
3.b:=r 
end while 


[Post-condition: a = gcd(A, B).] 


Proof: 
To prove the correctness of the loop, let the invariant be 


In): gcd(a, b) = gcd(A, B) and OSb<a. 


The guard of the while loop is 


G:b #0. 


I. Basis Property: /1(0) is true before the first iteration of the loop.] 
1(0) is 


gcd(A, B) = gcd(a, b) and OSb<a. 
According to the pre-condition, 


a=A, b=B, r=B8B, and O0=B<A. 


Hence gcd(A, B) = gcd(a, b). Since 0 = B < A, b = B, and a = A then0=b<a. 
Hence /(0) is true. 


II. Inductive Property: [Jf G ^ K(k) is true before an iteration of the loop (where k = 0), 
then I(k + 1) is true after iteration of the loop.] 

Suppose k is a nonnegative integer such that G ^ K(k) is true before an iteration of 

the loop. [We must show that I(k + 1) is true after iteration of the loop.] Since G is true, 
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baa # O and the loop is entered. And since /(k) is true, immediately before statement 1 
is executed, 


gcd(doig, boi) = ged(A, B) and 0 S dog < dou. 5.5.5 
After execution of statement 1, 
Thew = Agia Mod Doig. 
Thus, by the quotient-remainder theorem, 
oid = Doi’ + new for some integer q 
and Few has the property that 
0 = few < Dota: 5.5.6 
By Lemma 4.10.2, 
gcd(oia, Dota) = BCA DoIa new). 
So by the equation of (5.5.5), 
gcd(Doia, fnew) = ged(A, B). SAI 
When statements 2 and 3 are executed, 
Anew = boa and Dnew = Fnew- 55:9 
Substituting equations (5.5.8) into equation (5.5.7) yields 
gcd(anew Onew) = gcd(A, B). 3.39 
And substituting the values from the equations in (5.5.8) into inequality (5.5.6) gives 
0S bien < Aien: 5.5.10 
Hence after the iteration of the loop, by equation (5.5.9) and inequality (5.5.10), 
gcd(a, b) = gcd(A, B) and O0=b<a, 
which is Z(k + 1). [This is what we needed to show.] 


II. Eventual Falsity of the Guard: [After a finite number of iterations of the loop, G be- 
comes false.] 

Each value of b obtained by repeated iteration of the loop is nonnegative and less 
than the previous value of b. Thus, by the well-ordering principle, there is a least value 
bmin. The fact is that bmin = 0. [Tf Dyin is not 0, then the guard is true, and so the loop 
is iterated another time. In this iteration a value of r is calculated that is less than the 
previous value of b, bmin. Then the value of b is changed to r, which is less than Dyin. 
This contradicts the fact that bin is the least value of b obtained by repeated iteration 
of the loop. Hence bmin = 0.) Since Dyin = O, the guard is false immediately following 
the loop iteration in which bnin is calculated. 


IV. Correctness of the Post-Condition: /If N is the least number of iterations after which 
G is false and I(N) is true, then the values of the algorithm variables will be as specified 
in the post-condition.] 

Suppose that for some nonnegative integer N, G is false and (N) is true. [We must 
show the truth of the post-condition: a = gcd(A, B).] Since G is false, b = 0, and since 
I(N) is true, 


gcd(a, b) = gcd(A, B). 5.5.11 
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Substituting b = 0 into equation (5.5.11) gives 


But by Lemma 4.10.1, 


gcd(a, 0) = gced(A, B). 


gcd(a, 0) = a. 


Hence a = gcd(A, B) [as was to be shown]. 


TEST YOURSELF 


1. A pre-condition for an algorithm is —— anda 
post-condition for an algorithm is 


2. A loop is defined as correct with respect to its 
pre- and post-conditions if, and only if, whenever 
the algorithm variables satisfy the pre-condition 
for the loop and the loop terminates after a finite 
number of steps, then 


3. For each iteration of a loop, if a loop invariant is 
true before iteration of the loop, then 


4. Given a while loop with guard G and a predicate 
I(n) if the following four properties are true, then 


the loop is correct with respect to its pre- and 
post-conditions: 


(a) The pre-condition for the loop implies that 

before the first iteration of the loop. 

(b) For every integer k = 0, if the guard G and the 
predicate /(k) are both true before an iteration 
of the loop, then _____. 

(c) After a finite number of iterations of the 
loop, —____.. 

(d) If N is the least number of iterations after 
which G is false and (N) is true, then the 
values of the algorithm variables will be as 
specified 


EXERCISE SET 5.5 


Exercises 1-5 contain a while loop and a predicate. In 
each case show that if the predicate is true before entry 
to the loop, then it is also true after exit from the loop. 


1. loop: while (m = 0 and m = 100) 


m:=mt+ 1 
n:=n-l1 
end while 


predicate: m +n = 100 
2. loop: while (m = 0 and m = 100) 


. loop: while (n = 0 and n = 100) 


n:=n+1 
end while 


predicate: 2” < (n + 2)! 


. loop: while (n = 3 and n < 100) 


n:=n+1 
end while 


predicate: 2n + 1 < 2” 


Exercises 6-9 each contain a while loop annotated with a 


m:=m+4 
ni=n-2 
end while 


predicate: m+n is odd 


3. loop: while (m = 0 and m = 100) 


m:=3-m 
n:=5-n 
end while 


predicate: m>n 


pre- and a post-condition and also a loop invariant. In each 
case, use the loop invariant theorem to prove the correct- 
ness of the loop with respect to the pre- and post-conditions. 


6. [Pre-condition: m is a nonnegative integer, x is a 
real number, i = 0, and exp = 1.] 
while (i # m) 
l. exp := exp’x 
2.i:=it1 
end while 
[Post-condition: exp = x” 
loop invariant: [(n) is “exp = x” and i = n.” 
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[Pre-condition: largest = A[\] andi = 1] 
while (i # m) 
Li:=i+l 
2. if A[i] > largest then largest := A[i] 
end while 
[Post-condition: largest = maximum value of 
A[1], A[2],..., Alm]/ 


loop invariant: I (n) is “largest = maximum value 
of A[1], A[2],..., Alm + 1] andi = n+ 1.” 


. [Pre-condition: sum = A[1] andi = 1] 


while (i # m) 


li:=itl 
2. sum: = sum + Ali] 
end while 


[Post-condition: sum = A[1] + A[2] +--+: + A[m]] 


loop invariant: J(n) is “i = n + 1 and sum = A[1] + 
A[2] +++: +A[n+ 1] 


. [Pre-condition: a = A and A is a positive integer] 


while (a > 0) 
a:=a-2 
end while 


[Post-condition: a = 0 if A is even anda = —1ifA 
is odd.] 
loop invariant: J(7) is “Both a and A are even in- 


tegers or both are odd integers and, in either case, 
a=-1.” 


Prove correctness of the while loop of Algorithm 
4.10.3 (in exercise 27 of Exercise Set 4.10) with 
respect to the following pre- and post-conditions: 


Pre-condition: A and B are positive integers, 


a = A, and b = B. 

One of a or b is zero and the 
other is nonzero. Whichever is 
nonzero equals gcd(A, B). 


Post-condition: 


Use the loop invariant 


I(n) “(1) a and b are nonnegative integers with 
gcd(a, b) = gcd(A, B), 
(2) at most one of a and b equals 0, 
3)O0Sat+tbsAt+B-n.” 


The following while loop implements a way to 
multiply two numbers that was developed by the 
ancient Egyptians. 
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[Pre-condition: A and B are positive integers, 
x =A, y = B, and product = 0.] 


while (y # 0) 
r:=ymod2 
ifr =0 
then do x := 2-x 
yi=ydiv2 
end do 
ifr = 1 
then do product := product + x 
yi=y-l 
end do 


end while 


[Post-condition: product = A-B] 

a. Make a trace table to show that the algorithm 
gives the correct answer for multiplying 
A = 13 times B = 18. 

b. Prove the correctness of this loop with respect 
to its pre- and post-conditions by using the 
loop invariant 


I(n): “xy + product = A-B.” 


The following sentence could be added to the loop 
invariant for the Euclidean algorithm: 


There exist integers u, v, s, and t such that 
a=uA+vB and b= sA+ tB. 532 


a. Show that this sentence is a loop invariant for 


while (b + 0) 


r:= a mod b 


a:=b 
b:=r 
end while 


b. Show that if initially a = A and b = B, then 
sentence (5.5.12) is true before the first itera- 
tion of the loop. 

c. Explain how the correctness proof for the Eu- 
clidean algorithm together with the results of (a) 
and (b) above allow you to conclude that given 
any integers A and B with A > B = 0, there ex- 
ist integers u and v so that gcd(A, B) = uA + vB. 

d. By actually calculating u, v, s, and t at each 
stage of execution of the Euclidean algorithm, 
find integers u and v so that gcd(330, 156) = 
330u + 156v. 
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ANSWERS FOR TEST YOURSELF 


1. a predicate that describes the initial state of the input the loop 3. it is true after iteration of the loop 4. (a) 1(0) 
variables for the algorithm; a predicate that describes the is true (b) /(k+1) is true after the iteration of the loop 
final state of the output variables for the algorithm (c) the guard G becomes false (d) in the post-condition of 
2. the algorithm variables satisfy the post-condition for the loop 


EJ Defining Sequences Recursively 


So, Nat’ralists observe, a Flea/Hath smaller Fleas that on him prey,/And these have 
smaller Fleas to bite ’em,/And so proceed ad infinitum. —Jonathan Swift, 1733 


A sequence can be defined in a variety of different ways. One informal way is to write the 
first few terms with the expectation that the general pattern will be obvious. We might say, 
for instance, “consider the sequence 3, 5, 7,....’ Unfortunately, misunderstandings can 
occur when this approach is used. The next term of the sequence could be 9 if we mean a 
sequence of odd integers, or it could be 11 if we mean the sequence of odd prime numbers. 

The second way to define a sequence is to give an explicit formula for its nth term. For 
example, a sequence dp, 41, ad... can be specified by writing 


(-1)”" 
n+1 


The advantage of defining a sequence by such an explicit formula is that each term of the 
sequence is uniquely determined and can be computed in a fixed, finite number of steps 
by substitution. 

The third way to define a sequence is to use recursion, as was done in Examples 5.3.3, 
5.4.2, and 5.4.3. This requires giving both an equation, called a recurrence relation, that 
defines each later term in the sequence by reference to earlier terms and also one or more 
initial values for the sequence. Sometimes it is very difficult or impossible to find an 
explicit formula for a sequence, but it is possible to define the sequence using recursion. 
Defining sequences recursively is similar to proving theorems by mathematical induction. 
The recurrence relation is like the inductive step and providing initial values is like proving 
the basis step. Indeed, the fact that sequences can be defined recursively is equivalent to 
the fact that mathematical induction works as a method of proof. 


a, = for every integer n = 0. 


Definition 


A recurrence relation for a sequence ap, a), d>,...1s a formula that relates each 
term a, to certain of its predecessors ay_), Gy_,.--, Qy_;, Where i is an integer with 
k—i= 0. If i is a fixed integer, the initial conditions for such a recurrence relation 
specify the values of do, ay, d>,..., aj}. If i depends on k, the initial conditions 
specify the values of do, 41, . . . , Am, Where m is an integer with m = 0. 


Computing Terms of a Recursively Defined Sequence 


Define a sequence Co, C1, C2, . . . recursively as follows: For every integer k = 2, 


(1) ck = Ck-1 + keg + 1 recurrence relation 
(2) co 


1 and c, =2 initial conditions 


Find Cy, C3, and C4. 
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Solution Co = Cy +2eg +1 by substituting k = 2 into (1) 
=2+2:1+1 since cı = 2 and cp = 1 by (2) 
(3) .c@2 =5 
C3 = Co +3c, +1 by substituting k = 3 into (1) 
=51+3:2+1 since cy = 5 by (3) and cı = 2 by (2) 
(4) ..c3 = 12 
C4 = €3+4ce,4+ 1 by substituting k = 4 into (1) 
=12+4:54+1 since cz = 12 by (4) and c2 = 5 by (3) 
(5) <é = 33 a 


A given recurrence relation may be expressed in several different ways. 


Seen Writing a Recurrence Relation in More Than One Way 


Note Think of the Let so, 51, 52,... be a sequence that satisfies the following recurrence relation: 
recurrence relation as ; 
sp = 3sp-1 — 1, where For every integer k = 1, sy = 3s}17 1. 


any positive integer ex- 
pression may be placed in 
the box 


Explain why the following statement is true: 


For every integer k = 0, S44) = 35,—1. 


Solution In informal language, the recurrence relation says that any term of the se- 
quence equals 3 times the previous term minus 1. Now for any integer k = 0, the term 
previous to 5,41 is s,. Thus for any integer k = 0, Sk+1 = 3s}— 1. E 


A sequence defined recursively need not start with a subscript of zero. Also, a given 
recurrence relation may be satisfied by many different sequences; the actual values of the 
sequence are determined by the initial conditions. 


SEWICEXA Sequences That Satisfy the Same Recurrence Relation 


Let a}, ao, a3, . . . and by, b2, b3, . . . satisfy the recurrence relation that the kth term equals 3 
times the (k — 1)st term for every integer k = 2: 


(1) 4kg = 3ay_} and by = 3b- 
But suppose that the initial conditions for the sequences are different: 
(2) q4 =2 and ġ=l. 


Find (a) a2, A3, A4 and (b) bo, b3, b4. 


Solution 
a. dy = 3a, =3:2=6 b. b, = 3b, = 3-1 =3 
az = 3a, = 3:6 = 18 b, = 3b, = 3:3 =9 
a4 = 3a; = 3-18 = 54 by = 3b, = 3:9 = 27 
Thus 


a), dp, a3, . . . begins 2, 6, 18, 54,... and 
by, bo, b3,... begins 1, 3, 9, 27,.... | 
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Showing That a Sequence Given by an Explicit Formula Satisfies 
a Certain Recurrence Relation 


The sequence of Catalan numbers, named after the Belgian mathematician Eugéne 
Catalan (1814-1894), arises in a remarkable variety of different contexts in discrete math- 
ematics. It can be defined as follows: For each integer n = 1, 


1 /2n 
C, = ; 
n+1\n 
a. Find C}, C2, and C3. 


b. Show that this sequence satisfies the recurrence relation C = tC, for every 


integer k = 2. 


Solution 


aal Veo. Ge |S ee: BH l= oes 
ae a TS aa eR Taa) g 


b. To obtain the kth and (k — 1)st terms of the sequence, just substitute k and k — | in 
place of n in the explicit formula for C4, C2, C3,.... 


c- 1 (% 

k k+I\k 
— 1 2k-1)\ _ 1/2k-2 
RIT (k-1+t\ k-1 J) k\k-1/ 


Then start with the right-hand side of the recurrence relation and transform it into the left- 
hand side: For each integer k = 2, 


Art Collection 3/Alamy Stock Photo 


Eugène Catalan 
(1814-1894) 


aa ae E acea] by substituting 
k+1 k+1|[k\ k-1 
2(2k—-1) 1 (2k — 2)! 
= = by the formula for n choose r 
k+1 k (k—1)'(2k—2—(k—1))! 
= l *(2(2k—1))- sami by rearranging the factors 
k+l (k(k— 1)(k— 1)! 
spoki ie ie oe Ga es 
k+1 k!(k— 1)! 2 k 2k 


1 21 1 1l 
k+1 2 k! (k-11)! k 


(2k): (2k— 1): (2k— 2)! by rearranging the factors 


! 2 
= aa because k(k— 1)! = k!,= = 1, 
en tae and 2k-(2k — 1): (2k — 2)! = (2k)! 
1 /2k 
= k+i\k by the formula for n choose r 
= C; by definition of C,, Cz, C3,.... 
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Examples of Recursively Defined Sequences 


Recursion is one of the central ideas of computer science. To solve a problem recursively 
means to find a way to break it down into smaller subproblems each having the same form 
as the original problem—and to do this in such a way that when the process is repeated 
many times, the last of the subproblems are small and easy to solve and the solutions of the 
subproblems can be woven together to form a solution to the original problem. 

Probably the most difficult part of solving problems recursively is to figure out how 
knowing the solution to smaller subproblems of the same type as the original problem 
will give you a solution to the problem as a whole. You suppose you know the solutions to 
smaller subproblems and ask yourself how you would best make use of that knowledge to 
solve the larger problem. The supposition that the smaller subproblems have already been 
solved has been called the recursive paradigm or the recursive leap of faith. Once you take 
this leap, you are right in the middle of the most difficult part of the problem, but generally, 
the path to a solution from this point, though difficult, is short. The recursive leap of faith 
is similar to the inductive hypothesis in a proof by mathematical induction. 


The Tower of Hanoi 


In 1883 a French mathematician, Edouard Lucas, invented a puzzle that he called the Tower 
of Hanoi (La Tour D’Hanot). The puzzle consisted of eight disks of wood with holes in their 
centers, which were piled in order of decreasing size on one pole in a row of three. A facsimile 
of the cover of the box is shown in the figure below. Those who played the game were sup- 
posed to move all the disks one by one from one pole to another, never placing a larger disk on 
top of a smaller one. The directions to the puzzle claimed it was based on an old Indian legend: 

On the steps of the altar in the temple of Benares, for many, many years Brahmins have 
been moving a tower of 64 golden disks from one pole to another; one by one, never plac- 
ing a larger on top of a smaller. When all the disks have been transferred the Tower and 
the Brahmins will fall, and it will be the end of the world. 

The puzzle offered a prize of ten thousand francs (about $45,000 US today) to anyone 
who could move a tower of 64 disks by hand while following the rules of the game. (See 
Edouard Lucas Figure 5.6.1 on the following page.) Assuming that you transferred the disks as efficiently 
(184251831) as possible, how many moves would be required to win the prize? 


Paul Fearn / Alamy Stock Photo 


Science et Nature, vol. 1 (1884) number 8 (January 8), 127—128. Image Courtesy of 


From the paper "La Tour d'Hanoi" by N. Claus (pseudonym for Edouard Lucas), in 
Paul Stockmeyer. 
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FIGURE 5.6.1 


Solution An elegant and efficient way to solve this problem is to think recursively. Sup- 
pose that you, somehow or other, have found the most efficient way possible to transfer a 
tower of k — 1 disks one by one from one pole to another, obeying the restriction that you 
never place a larger disk on top of a smaller one. What is the most efficient way to transfer 
a tower of k disks from one pole to another? The answer is sketched in Figure 5.6.2, where 
pole A is the initial pole and pole C is the target pole. 


Step 1 : Transfer the top k — 1 disks from pole A to pole B. If k > 2, execution of this step 
will require a number of moves of individual disks among the three poles. But the 
point of thinking recursively is not to get caught up in imagining the details of how 
those moves will occur. 


Step 2 : Move the bottom disk from pole A to pole C. 


Step 3 : Transfer the top k— 1 disks from pole B to pole C. (Again, if k > 2, execution of 
this step will require more than one move.) 


To see that this sequence of moves is most efficient, observe that to move the bottom 
disk of a stack of k disks from one pole to another, you must first transfer the top k — 1 disks 
to a third pole to get them out of the way. Thus transferring the stack of k disks from pole A 
to pole C requires at least two transfers of the top k — 1 disks: one to transfer them off the 
bottom disk to free the bottom disk so that it can be moved and another to transfer them 
back on top of the bottom disk after the bottom disk has been moved to pole C. If the bot- 
tom disk were not moved directly from pole A to pole C but were moved to pole B first, at 
least two additional transfers of the top k — 1 disks would be necessary: one to move them 
from pole A to pole C so that the bottom disk could be moved from pole A to pole B and 
another to move them off pole C so that the bottom disk could be moved onto pole C. This 
would increase the total number of moves and result in a less efficient transfer. 

Hence the minimum sequence of moves must include going from the initial position (a) 
to position (b) to position (c) to position (d). It follows that 


the minimum the minimum the minimum the minimum 
Note Defining the . number of moves number of number of number of 
sequence symbolically is 
a crucial step in solving needed to tranfer | _ | moves needed $ moves needed = moves needed sii 
the problem. The recur- a tower of k disks to go from to go from to go from Bi 
rence relation and initial from pole A to position (a) position (b) position (c) 
diti ifi w ae ngs 
conditionis ate specitied pole C to position (b) to position (c) to position (d) 


in terms of the sequence. 
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Tower of 


A B C 
Initial Position 
(a) 


Tower of k— 1 


f disks 


A B Ç 
Position after Transferring k — 1 Disks from A to B 
(b) 


Tower of k— 1 


/ disks 


A B C 
Position after Moving the Bottom Disk from A to C 
(c) 
Tower of 


A B C 
Position after Transferring k — 1 Disks from B to C 
(d) 


FIGURE 5.6.2 Moves for the Tower of Hanoi 
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For each integer n = 1, let 


__ | the minimum number of moves needed to transfer 
” a tower of n disks from one pole to another 


Note that the numbers m, are independent of the labeling of the poles; it takes the same 
minimum number of moves to transfer n disks from pole A to pole C as to transfer n disks 
from pole A to pole B, for example. Also the values of m, are independent of the number of 
larger disks that may lie below the top n, provided these remain stationary while the top n 
are moved. Because the disks on the bottom are all larger than the ones on the top, the top 
disks can be moved from pole to pole as though the bottom disks were not present. 

Going from position (a) to position (b) requires m,_; moves, going from position (b) to 
position (c) requires just one move, and going from position (c) to position (d) requires mg—1 
moves. By substitution into equation (5.6.1), therefore, 

my = Mmg + 1+ mg 


= 2m,_,+ 1 for every integer k = 2. 


The initial condition, or base, of this recursion is found by using the definition of the se- 
quence. Because just one move is needed to move one disk from one pole to another, 


foe the minimum number of moves needed to move | _ 
i a tower of one disk from one pole to another 


Hence the complete recursive specification of the sequence mı, m2, m3, ...is as follows: 
For every integer k = 2, 

(1) m, = 2m; +1 recurrence relation 

(2)m=1 initial conditions. 


Here is a computation of the next five terms of the sequence: 


(3) m =2m +1=2-1+1=3 by (1) and (2) 

(4) m, = 2m +1 =2:3+1=7 by (1) and (3) 

(5) mg = 2m, +1 =2:-7+1=15 by (1) and (4) 

(6) m; = 2mgt+ 1 =2:-15+1=31 by (1) and (5) 

(7) mę = 2m; +1 =2-31+1 = 63 by (1) and (6). 
Going back to the legend, suppose the priests work rapidly and move one disk every 
second. Then the time from the beginning of creation to the end of the world would be 
Moa seconds. In the next section we derive an explicit formula for m„. Meanwhile, we can 


compute 77164 on a calculator or a computer by continuing the process started above. (Try 
it!) The approximate result is 


1.844674 X 10'° seconds = 5.84542 X 10!! years 
584.5 billion years, 


Ih 


which is obtained by the estimate of 


60 - 60 - 24 - (365.25) = oo 
Ty 


1 7 % 
minutes hours days seconds 
seconds per per per per per 
minute hour day year year 
seconds in a year (figuring 365.25 days in a year to take leap years into account). a 
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The Fibonacci Numbers 


One of the earliest examples of a recursively defined sequence occurs in the writings of 
Leonardo of Pisa, commonly known as Fibonacci, who was the greatest European math- 
ematician of the Middle Ages and promoted the use of Hindu-Arabic numerals for calcula- 
tion. In 1202 Fibonacci posed the following problem: 

A single pair of rabbits (male and female) is born at the beginning of a year. Assume 
the following conditions: 


1. Rabbit pairs are not fertile during their first month of life but thereafter give birth to one 
new male/female pair at the end of every month. 


2. No rabbits die. 


Bettmann/Getty Images 


How many rabbits will there be at the end of the year? 
Fibonacci (Leonardo of 


Pisa) (ca. 1175-1250) Solution One way to solve this problem is to plunge right into the middle of it using 
recursion. Suppose you know how many rabbit pairs there were at the ends of previous 
months. How many will there be at the end of the current month? 

The crucial observation is that the number of rabbit pairs born at the end of month k is 
the same as the number of pairs alive at the end of month k — 2. Why? Because it is exactly 
the rabbit pairs that were alive at the end of month k — 2 that were fertile during month k. 
The rabbits born at the end of month k — 1 were not. 


month k—2 k~ 1 k 
| 


t l 
Each pair alive here f gives birth to a pair here + 


Now the number of rabbit pairs alive at the end of month k equals the ones alive at the end 
of month k — 1 plus the pairs newly born at the end of the month. Thus 


Note Itis essential to the number the number the number 
rephrase this observation of rabbit of rabbit of rabbit 
in terms of a sequence. pairs alive | = | pairs alive +| pairs born 
at the end at the end at the end 
of month k of month k— 1 of month k 
the number the number 
of rabbit of rabbit 
= | pairs alive + | pairs alive : 56:2 
at the end at the end 
of month k— 1 of month k— 2 


For each integer n = 1, let 


F = the number of rabbit pairs 
ý alive at the end of month n 


and let 


Fo = the initial number of rabbit pairs 
1. 


Then by substitution into equation (5.6.2), for every integer k = 2, 


Fy = Fy + Fk- 
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Now Fo = 1, as already noted, and F; = 1 also, because the first pair of rabbits is not fer- 
tile until the second month. Hence the complete specification of the Fibonacci sequence is 
as follows: For every integer k = 2, 


(1) Fy = Fy-y + Fg-2 recurrence relation 
(2) Fy =1, Fy =1 initial conditions. 


To answer Fibonacci’s question, compute F», F3, and so forth through F3: 


(3) Fp, =F, + Fo =1+1 =2 by (1) and (2) 

(4) Fz = F + FF, =24+1 = by (1), (2), and (3) 
(5) Fy = F3+ Fp =34+2 = by (1), (3), and (4) 
(6) F; = Fyt+ F3=5+4+3 = 8 by (1), (4), and (5) 
(7) Fg =F5+F4=8+5 = 13 by (1), (5), and (6) 
(8) F7 = Fe t+ F; = 13+8 =21 by (1), (6), and (7) 
(9) Fg = F7+ Fe = 214+ 13 = 34 by (1), (7), and (8) 
(10) Fo = Fg + F} = 34 +21 = 55 by (1), (8), and (9) 
(1) Fio = Fo + Fg = 55 +34 = 89 by (1), (9), and (10) 
(12) Fy, = Fio t+ Fo = 89+55 = 144 by (1), (10), and (11) 
(13) Fy. = Fy, + Fio = 1444+ 89 = 233 by (1), (11), and (12) 

At the end of the twelfth month there are 233 rabbit pairs, or 466 rabbits in all. E 


ZEWA A Compound Interest 


On your twenty-first birthday you get a letter informing you that on the day you were born 
an eccentric rich aunt deposited $100,000 in a bank account earning 4% interest com- 
pounded annually and she now intends to turn the account over to you, provided you can 
figure out how much it is worth. What is the amount currently in the account? 


Solution To approach this problem recursively, observe that 


the amount in the amount in the interest 
the account at | _ | the account at earned on the 
the end of any | | the end of the account during 
particular year previous year the year 


Now the interest earned during the year equals the interest rate, 4% = 0.04 times the 
amount in the account at the end of the previous year. Thus 


the amount in the amount in the amount in 
the account at the account at the account at 

= + (0. 3 : 6. 
the end of any the end of the (0.04) the end of the lad 


particular year previous year previous year 


For each positive integer n, let 


step is to define the An = 


sequence symbolically. 


Note Again, a crucial 
at the end of yearn 


the amount in the 2 


and let 


_ if initial amount 
Ap = 


in the account l = $100,000. 
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Then for any particular year k, substitution into equation (5.6.3) gives 
Ax = Åk—1 F (0.04) -A-1 
= (1 +0.04)-A;—1 = (1.04) -A}—1 by factoring out A;—1- 

Consequently, the values of the sequence Ag, Aj, A>, . . . are completely specified as follows: 
for each integer k = 1, 

(1) Ag = (1.04)-A,_1 recurrence relation 

(2) Ag = $100,000 initial condition. 
The number 1.04 is called the growth factor of the sequence. 

In the next section we derive an explicit formula for the value of the account in any 
year n. The value on your twenty-first birthday can also be computed by repeated substitu- 
tion as follows: 

(3) Ay = 1.04-Ap = (1.04)-$100,000 = $104,000 by (1) and (2) 
(4) Ay 1.04-A,; = (1.04)-$104,000 = $108,160 by (1) and (3) 
(5) A; = 1.04-A, = (1.04)-$108,160 = $112,486.40 by (1) and 4) 


(22) A» = 1.04+Ayo = (1.04)-$210,684.92 = $219,112.31 by (1) and 21) 
(23) Ap, = 1.04+Ay9 = (1.04)-$219,112.31 = $227,876.81 by (1) and (22) 


The amount in the account is $227,876.81 (to the nearest cent). Fill in the dots (to check the 
arithmetic) and collect your money! E 


Compound Interest with Compounding Several Times a Year 


When an annual interest rate of i is compounded m times per year, the interest rate paid per 
period is i/m. (For instance, if 3% = 0.03 annual interest is compounded quarterly, then 
the interest rate paid per quarter is 0.03/4 = 0.0075.) 

For each integer k = 1, let P, = the amount on deposit at the end of the kth period, 
assuming no additional deposits or withdrawals. Then the interest earned during the kth 
period equals the amount on deposit at the end of the (k — 1)st period times the interest 
rate for the period: 

interest earned during kth period = P,_, (). 
The amount on deposit at the end of the kth period, P,, equals the amount at the end of the 
(k — 1)st period, P,_ , plus the interest earned during the kth period: 


P= Past Pea( 4) = Peat +4) 5.6.4 


Suppose $10,000 is left on deposit at 3% compounded quarterly. 

a. How much will the account be worth at the end of one year, assuming no additional 
deposits or withdrawals? 

b. The annual percentage yield (APY) is the percentage increase in the value of the ac- 
count over a one-year period. What is the APY for this account? 

Solution 


a. For each integer n = 1, let P,, = the amount on deposit after n consecutive quarters, 
assuming no additional deposits or withdrawals, and let Pp be the initial $10,000. Then 
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by equation (5.6.4) with i = 0.03 and m = 4, a recurrence relation for the sequence 
Po, P, Po,...48 

(1) P = P-1(1 + 0.0075) = (1.0075)-P,_; for every integer k = 1. 

The amount on deposit at the end of one year (four quarters), P4, can be found by 

successive substitution: 

(2) Po = $10,000 

(3) P, = 1.0075: Po = (1.0075) - $10,000.00 = $10,075.00 by (1) and (2) 

(4) Py = 1.0075-P, = (1.0075)-$10,075.00 = $10,150.56 by (1) and (3) 

(5) P, = 1.0075-P = (1.0075)-$10,150.56 = $10,226.69 by (1) and (4) 

(6) P4 = 1.0075-P3 = (1.0075)-$10,226.69 = $10,303.39 by (1) and (5) 


Hence after one year there is $10,303.39 (to the nearest cent) in the account. 
b. The percentage increase in the value of the account, or APY, is 


10,303.39 — 10,000 
10,000 


= 0.03034 = 3.034%. E 


Recursive Definitions of Sum and Product 


Addition and multiplication are called binary operations because only two numbers can be 
added or multiplied at a time. Careful definitions of sums and products of more than two 
numbers use recursion. 


Definition 


Given numbers aj, a2,..., Ap, Where n is a positive integer, the summation from 
i = 1 to n of the a;, denoted Da a;, is defined as follows: 


tall 


a) mla iid = Il, 
=1 


1 
SG =a, and Sa; = ( 
i=l i=l 


The product from i = 1 to n of the a;, denoted IT/= |i, is defined by 


L 


n=l 


1 n 
[fea =a, and Ta = (ila) y iii > il 
i=1 i=1 i=l 


The effect of these definitions is to specify an order in which sums and products of 
more than two numbers are computed. For example, 


4 3 2 
Sa; = (Sa) +a, = (2a) +a) + dg = ((a; + a) + a3) + a4. 
i=l i=l i 


i=1 
The recursive definitions are used with mathematical induction to establish various 
properties of general finite sums and products. 


Example 5.6.9 Wukela 


Prove that for any positive integer n, if a), do,...,a, and b}, bz, . . . , b, are real numbers, then 


Sath) = Sait Sb: 
i=1 i=1 i=1 
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Solution The proof is by mathematical induction. Let the property P(n) be the equation 


n n n 
SGtb) = Sar Sb. <— P(n) 
i=1 i=1 i=1 


We must show that P(n) is true for every integer n = 1. We do this by mathematical induc- 
tion on n. 


Show that P(1) is true: To establish P(1), we must show that 


1 1 1 
(a+b) = Xat Xb: <— P(1) 
i=l i=l i=l 


Now 


1 
> (a,+ bi) =a +b; by definition of È 
i=1 


also by definition of È. 


| 
i 
+ 
1 


Hence P(1) is true. 


Show that for every integer k = 1, if P(k) is true then P(k +1) is also true: 
Suppose that k is any integer with k = 1 and that aj, a2, . . . , ay, ay, and by, bo, ..., by, Dk+1 
are real numbers such that 


k k k 
> (a; + b;) = > a;+ = bj. <— P(k) inductive hypothesis 
i=1 i=1 i=1 


We must show that 


k+l k+l k+l 
X (aitb) = Xat > bs <— P(k+1) 
i=l i=l i=l 


[We will show that the left-hand side of this equation equals the right-hand side.] 
Now the left-hand side of the equation is 


k+1 k 
> (ai + b) = >» (a; + b;) + (dg41 + bevy) by definition of = 
i=1 i=l 


k k 
= (Sa oP By n) + (Ag + bga) by inductive hypothesis 
i=1 i=1 


by the associative and 
commutative laws of algebra 


k k 
= (Satan) a (Sb 
i=l 


i=1 


k+l k+1 
= Xa+ Ýb; by definition of È 
i=l i=l 
which equals the right-hand side of the equation. [This is what was to be shown.] a 
1. A recursive definition for a sequence consists of a 2. A recurrence relation is an equation that defines 
and : each later term of a sequence by reference to 


in the sequence. 
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3. 


4. 


EXERCISE SET 5.6 


Initial conditions for a recursive definition of a 
sequence consist of one or more of the ______ of 
the sequence. 


To solve a problem recursively means to divide 
the problem into smaller subproblems of the same 
type as the initial problem, to suppose —______, 
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and to figure out how to use the supposition to 


. A crucial step for solving a problem recursively 


is to define a in terms of which the 
recurrence relation and initial conditions can be 
specified. 


Find the first four terms of each of the recursively defined 
sequences in 1-8. 


1. 


10. 


Tl. 


12. 


ay, = 2a,_, + k, for every integer k = 2 
4i = 1 


- bg = bg-1 + 3p, for every integer k = 2 


b,=1 


e. = kei). for every integer k = 1 


Co = 1 

d, = Kd) for every integer k = 1 

do =3 

Sk = Sp, + 2Sk-2, for every integer k = 2 
s= 1l, s,=1 

ty = tk—-1 + 2tg—2, for every integer k = 2 
to = =l, ti =2 


Uy = kug- — ug-2, for every integer k = 3 
uy = l; W = 1 


© Vk = Vk-1 + vg—2 + 1, for every integer k = 3 


v=l, n=3 


Let do, a), a2, . . . be defined by the formula 

an = 3n + 1, for every integer n = 0. Show that 
this sequence satisfies the recurrence relation 
a, = ay, + 3, for every integer k = 1. 


Let bo, b4, b2, . . . be defined by the formula 
b, = 4", for every integer n = 0. Show that 
this sequence satisfies the recurrence relation 
b; = 4b,_,, for every integer k = 1. 


Let co, C1, C2,... be defined by the formula 

Cn = 2" — 1 for every integer n = 0. Show that 
this sequence satisfies the recurrence relation 
Ck = 2c,—-; + 1 for every integer k = 1. 


—])" 
Let so, 51, 52,... be defined by the formula s,, = c = 


for every integer n = 0. Show that this sequence satis- 
fies the following recurrence relation for every integer 
k= 1: 


7 Sk-1 
Sp = =a. 


13. 


14. 


15. 


16. 


17. 


18. 


Let tọ, t4, t2, . . . be defined by the formula 

t, = 2 +n for every integer n = 0. Show that this 
sequence satisfies the following recurrence rela- 
tion for every integer k = 2: 


ty = 2-1 — tk-2 


Let do, di, do, . . . be defined by the formula 

d, = 3" — 2" for every integer n = 0. Show that 
this sequence satisfies the following recurrence 
relation for every integer k = 2: 


dy = Sdy—| = 6d;—2 


For the sequence of Catalan numbers defined in 
Example 5.6.4, prove that for each integer n = 1, 


C= 1 2n+2 
” 4n+2\ n+l) 


Use the recurrence relation and values for the 
Tower of Hanoi sequence mı, m2, m3, . . . discussed 
in Example 5.6.5 to compute m7 and mg. 


Tower of Hanoi with Adjacency Requirement: 
Suppose that in addition to the requirement that 
they never move a larger disk on top of a smaller 
one, the priests who move the disks of the Tower 
of Hanoi are also allowed only to move disks one 
by one from one pole to an adjacent pole. Assume 
poles A and C are at the two ends of the row and 
pole B is in the middle. Let 


the minimum number of moves 
needed to transfer a tower of n 
disks from pole A to pole C 


a. Find q, a2, and a3. b. Find ay. 
c. Find a recurrence relation for a), a2, a3,.... 
Justify your answer. 


dn = 


Tower of Hanoi with Adjacency Requirement: 
Suppose the same situation as in exercise 17. Let 


the minimum number of moves 
needed to transfer a tower of n 
disks from pole A to pole B 


bn = 
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H* e. 


19. 


20. 


21. 
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a. Find b, bo, and b3. 

b. Find b4. 

c. Show that b; = a,_; + 1 + b,_, for each inte- 
ger k = 2, where dj, do, a3, . . . is the sequence 
defined in exercise 17. 

d. Show that b} = 3b,_, + 1 for each integer 

k22. 

Show that b = 3b,_, + 1 for each integer k = 2. 


Four-Pole Tower of Hanoi: Suppose that the 
Tower of Hanoi problem has four poles in a row 
instead of three. Disks can be transferred one by 
one from one pole to any other pole, but at no time 
may a larger disk be placed on top of a smaller 
disk. Let s„ be the minimum number of moves 
needed to transfer the entire tower of n disks from 
the left-most to the right-most pole. 
a. Find Sis S2 and S3. b. Find S4. 
c. Show that s, = 2s,_, + 3 for every integer 

k 2 3. 


Tower of Hanoi Poles in a Circle: Suppose that 
instead of being lined up in a row, the three poles 
for the original Tower of Hanoi are placed in a 
circle. The monks move the disks one by one from 
one pole to another, but they may only move disks 
one over in a clockwise direction and they may 
never move a larger disk on top of a smaller one. 
Let c, be the minimum number of moves needed 
to transfer a pile of n disks from one pole to the 
next adjacent pole in the clockwise direction. 
a. Justify the inequality c, = 4c,_; + 1 for each 
integer k = 2. 
b. The expression 4c;,_, + 1 is not the minimum 
number of moves needed to transfer a pile of 
k disks from one pole to another. Explain, for 
example, why c3 # 4c) + 1. 


Double Tower of Hanoi: In this variation of the 
Tower of Hanoi there are three poles in a row and 
2n disks, two each of n different sizes, where n is 
any positive integer. Initially one of the poles con- 
tains all the disks placed on top of each other in 
pairs of decreasing size. Disks are transferred one 
by one from one pole to another, but at no time 
may a larger disk be placed on top of a smaller 
disk. However, a disk may be placed on top of one 
of the same size. Let t, be the minimum number 
of moves needed to transfer a tower of 2n disks 
from one pole to another. 

a. Find ¢, and t3. b. Find f3. 

c. Find a recurrence relation for t4, t2, f3,.... 
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22. Fibonacci Variation: A single pair of rabbits 


23. 


In 24-34, Fo, Fy, Fo). - 


24. 


25. 


26. 


27. 


(male and female) is born at the beginning of a 
year. Assume the following conditions (which are 
somewhat more realistic than Fibonacci’s): 

(1) Rabbit pairs are not fertile during their first 
months of life but thereafter give birth to four 
new male/female pairs at the end of every 
month. 

(2) No rabbits die. 


a. Let 7, = the number of pairs of rabbits alive at 
the end of month n, for each integer n = 1, and 
let ro = 1. Find a recurrence relation for ro, r4, 
r2, ... . Justify your answer. 

b. Compute ro, F1, F2, 73, F4, Fs, and rg. 

c. How many rabbits will there be at the end of 
the year? 


Fibonacci Variation: A single pair of rabbits 
(male and female) is born at the beginning of a 
year. Assume the following conditions: 

(1) Rabbit pairs are not fertile during their first 
two months of life, but thereafter give birth 
to three new male/female pairs at the end of 
every month. 

(2) No rabbits die. 


a. Lets, = the number of pairs of rabbits alive at 
the end of month n, for each integer n = 1, and 
let sọ = 1. Find a recurrence relation for so, S1, 
S2, . . . . Justify your answer. 

b. Compute So, 51, 52, S3, 54, and ss. 

c. How many rabbits will there be at the end of 
the year? 


. is the Fibonacci sequence. 


Use the recurrence relation and values for Fo, F1, 
F ,... given in Example 5.6.6 to compute F;3 and 


The Fibonacci sequence satisfies the recurrence 
relation Fg = F,_, + Fy—2, for every integer k = 2. 
a. Explain why the following is true: 
Fg+1 = Fy + Fy, for each integer k = 1. 
b. Write an equation expressing Fz+2 in terms of 
F41 and Fy; 
c. Write an equation expressing F;+3 in terms 
Fg+2 and Fg+1.- 


Prove that F} = 3F,_3 + 2Fz—4 for every integer 
k=4. 


Prove that F% — Fe_, = F,Fp+1 — Fy- F41, for 
every integer k = 1. 


28. 


29. 


30. 


* 31. 


H* 32. 


33. 


H 34. 


H* 35. 


36. 


37. 


38. 


Prove that Fea E F = Fi = 2F,F,_, for each 
integer k = 1. 


Prove that Fg = Fe = F,_,F +42, for every integer 
k= 1. 


Use mathematical induction to prove that for each 
integer n = 0, Fy42F a — Foti = (~ 1)". 
Use strong mathematical induction to prove that 


F, < 2” for every integer n = 1. 


Prove that for each integer n = 0, gcd(F,4.1, Fp) = 1. 
(The definition of gcd is given in Section 4.10.) 


It turns out that the Fibonacci sequence satisfies 
the following explicit formula: For every integer 
Fa = 0, 


Pp 1 (E eon 
ae 2 2 l 


Verify that the sequence defined by this formula 
satisfies the recurrence relation Fy = Fọ—1 + Fy-2 
for every integer k = 2. 


(For students who have studied calculus) Find 


[Fn ; denis sone 
tim(“E4), assuming that the limit exists. 


ne 


(For students who have studied calculus) Prove 
F 

that lim(2=) exists. 

(For students who have studied calculus) Define 


Xo, Xj, X2, . . - as follows: 


xX, = V2+x,-, for each integer k = 1 
xo = 0. 


Find lim, _,.. x,. (Assume that the limit exists.) 


Compound Interest: Suppose a certain amount of 

money is deposited in an account paying 4% an- 

nual interest compounded quarterly. For each posi- 

tive integer n, let R, = the amount on deposit at 

the end of the nth quarter, assuming no additional 

deposits or withdrawals, and let Rọ be the initial 

amount deposited. 

a. Find a recurrence relation for Ro, Ry, Ro,.... 
Justify your answer. 

b. If Ro = $5,000, find the amount of money on 
deposit at the end of one year. 

c. Find the APY for the account. 


Compound Interest: Suppose a certain amount of 
money is deposited in an account paying 3% an- 
nual interest compounded monthly. For each 


39. 


40. 


41. 


42. 
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positive integer n, let S, = the amount on deposit 

at the end of the nth month, and let Sọ be the ini- 

tial amount deposited. 

a. Find a recurrence relation for So, S1, S2,..., 
assuming no additional deposits or withdraw- 
als during the year. Justify your answer. 

b. If Sọ = $10,000, find the amount of money on 
deposit at the end of one year. 

c. Find the APY for the account. 


With each step you take when climbing a stair- 
case, you can move up either one stair or two 
stairs. As a result, you can climb the entire 
staircase taking one stair at a time, taking two at a 
time, or taking a combination of one- and two- 
stair increments. For each integer n = 1, if the 
staircase consists of n stairs, let c,, be the number 
of different ways to climb the staircase. Find a 
recurrence relation for c), C2, C3, . . . . Justify your 
answer. 


A set of blocks contains blocks of heights 1, 2, 
and 4 centimeters. Imagine constructing towers by 
piling blocks of different heights directly on top 
of one another. (A tower of height 6 cm could be 
obtained using six 1-cm blocks, three 2-cm blocks 
one 2-cm block with one 4-cm block on top, one 
4-cm block with one 2-cm block on top, and so 
forth.) Let t, be the number of ways to construct 

a tower of height n cm using blocks from the set. 
(Assume an unlimited supply of blocks of each 
size.) Find a recurrence relation for t, t2, f3,.... 
Justify your answer. 


Assume the truth of the distributive law (Ap- 
pendix A, F3), and use the recursive definition of 
summation, together with mathematical induction, 
to prove the generalized distributive law that for 
every positive integer n, if aj, do,..., A, and c are 
real numbers, then 


Assume the truth of the commutative and as- 
sociative laws (Appendix A, F1 and F2), and use 
the recursive definition of product, together with 
mathematical induction, to prove that for every 
positive integer n, if aj, do,..., a, and bj, bo,..., 
b,, are real numbers, then 
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43. Assume the truth of the commutative and as- and mathematical induction to prove that for each 
sociative laws (Appendix A, F1 and F2), and use positive integer n, if a), a2, . . . , a, are real numbers, 
the recursive definition of product, together with then 
mathematical induction, to prove that for each i i 
positive integer n, if a), a2, . . ., a, and c are real Xa = > | a;| 

i=l i=l 


numbers, then 
n ( n ) 45. Prove that any sum of even integers is even. 
n 


46. Prove that any sum of an odd number of odd 


H 44. The triangle inequality for absolute value states that ine ects iodi, 


for all real numbers a and b, |a+b| = |a| + |b]. H 47. Deduce from exercise 46 that for any positive 
Use the recursive definition of summation, the integer n if there is a sum of n odd integers that is 
triangle inequality, the definition of absolute value, even, then n is even. 


ANSWERS FOR TEST YOURSELF 


1. recurrence relation; initial conditions 2. earlier terms 3. values of the first few terms 4. that the smaller subproblems 


have already been solved; solve the initial problem 5. sequence 


LWA Solving Recurrence Relations by Iteration 


The keener one’s sense of logical deduction, the less often one makes hard and fast 
inferences. —Bertrand Russell, 1872-1970 


Suppose you have a sequence that satisfies a certain recurrence relation and initial condi- 
tions. It is often helpful to know an explicit formula for the sequence, especially if you need 
to compute terms with very large subscripts or if you need to examine general properties 
of the sequence. Such an explicit formula is called a solution to the recurrence relation. In 
this section, we discuss methods for solving recurrence relations. For example, in the text 
and exercises of this section, we will show that the Tower of Hanoi sequence of Example 
5.6.5 satisfies the formula 
m, = 2"-1, 
and that the compound interest sequence of Example 5.6.7 satisfies 


A, = (1.04)"+ $100,000. 


The Method of Iteration 


The most basic method for finding an explicit formula for a recursively defined sequence 
is iteration. Iteration works as follows: Given a sequence do, 44, a2, . . . defined by a recur- 
rence relation and initial conditions, you start from the initial conditions and calculate suc- 
cessive terms of the sequence until you see a pattern developing. At that point you guess 
an explicit formula. 


Finding an Explicit Formula 


Let do, a), a2, . . . be the sequence defined recursively as follows: For each integer k = 1, 


(1) ay = ay, +2 recurrence relation 


(2) a=l initial condition. 


Use iteration to guess an explicit formula for the sequence. 
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Tip Do no arithmetic 

except 

e replace n-1 and 1-n 
by n, 

e reformat repeated 
numbers, 


e eliminate parentheses. 
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Solution Recall that to say 
a, = ak- +2 foreach integer k = 1 


means 


ag = agı +2 no matter what positive integer is 
placed into the box 


In particular, 


a, = a +2, 
ay =a,+2, 
a3 = a T 2, 


and so forth. Now use the initial condition to begin a process of successive substitutions 
into these equations, not just of numbers (as was done in Section 5.6) but of numerical 
expressions. 

The reason for using numerical expressions rather than numbers is that in these prob- 
lems you are seeking a numerical pattern that underlies a general formula. The secret 
of success is to leave most of the arithmetic undone. However, you do need to eliminate 
parentheses as you go from one step to the next. Otherwise, you will soon end up with a 
bewilderingly large nest of parentheses. Also, it is nearly always helpful to use shorthand 
notations for regrouping additions, subtractions, and multiplications of numbers that re- 
peat. Thus, for instance, you would write 


5:2 instead of 2+2+2+2+2 
and 25 instead of 2:2-2-2-2. 


Notice that you don’t lose any information about the number patterns when you use these 
shorthand notations. 
Here’s how the process works for the given sequence: 


ag= 1 the initial condition 
aq, =ap9+2=14+2 by substitution 
ee 
pa 
a =a; +2=(1+2)+2 =1+2+2 eliminate parentheses 
n 
pm 
a3 =a+2=(1+2+2)+2 =1]1+2+2+2 eliminate parentheses again; write 
EA 


3-2 instead of 2 + 2 + 2? 


e mwm 


gi 
a4 =a3+ 2 =(1+2+2+2)+2=1+2+2+2+2 © eliminate parentheses again; 
definitely write 4- 2 instead of 
2 +2 + 2 + 2—the length of the 
string of 2’s is getting out of hand. 


Since it appears helpful to use the shorthand k-2 in place of 2 +2 +: -+ 2 (k times), we 
do so, starting again from ap. 
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aj =1 =1+0-2 _ the initial condition 
a; =aj+ 2 =14 2 =1+4+1-2 by substitution 
—— 


\ 


SS 
airt 2=(14+2)+2 =1+2-2 
SS 
=—_—_ 
a3 =a2+2=(14+2-2)+2=14+3-2 
—S—4 
n 
a4 = a3 + 2 = (1+ 3-2) +2 =14+4-2 Atthis point it certainly seems likely that 
the general pattern is 1 + n- 2; check 


(eat whether the next calculation supports this. 


a 
as =a4+ 2 =(1+4-2)+2=1+5-2_  Ttdoes! So go ahead and write an answer. 
It’s only a guess, after all. 


a2 


Guess: a, =1+n-2 = 1+ 2n for every integer n. 


The answer obtained for this problem is just a guess. To be sure of the correctness of this 
guess, you will need to check it by mathematical induction. Later in this section, we will 
show how to do this. | 


A sequence like the one in Example 5.7.1, in which each term equals the previous term 
plus a fixed constant, is called an arithmetic sequence. In the exercises at the end of this 
section you are asked to show that the nth term of an arithmetic sequence always equals 
the initial value of the sequence plus n times the fixed constant. 


Definition 


A sequence do, 44, 4), . . . is called an arithmetic sequence if, and only if, there is a 
constant d such that 


a, = aķ-ı +d foreach integer k = 1. 


It follows that 


a, =aọ+dn for every integer n = 0. 


See wa An Arithmetic Sequence 


Under the force of gravity, an object falling in a vacuum falls about 9.8 meters per second 
(m/sec) faster each second than it fell the second before. Thus, neglecting air resistance, a 
skydiver’s speed upon leaving an airplane is approximately 9.8 m/sec one second after de- 
parture, 9.8 + 9.8 = 19.6 m/sec two seconds after departure, and so forth. If air resistance 
is neglected, how fast would the skydiver be falling 60 seconds after leaving the airplane? 


Solution Let s, be the skydiver’s speed in m/sec n seconds after exiting the airplane, 
assuming there is no air resistance. Then sọ is the initial speed, and since the diver would 
travel 9.8 m/sec faster each second than the second before, 


Sg = Sp-1 + 9.8 m/sec for every integer k = 1. 
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It follows that so, 5), S2, . . . is an arithmetic sequence with a fixed constant of 9.8, and thus 
Sn = So +(9.8)n for each integer n = 0. 


Hence sixty seconds after exiting and neglecting air resistance, the skydiver would travel 
at a speed of 


S60 = 0+ (9.8)(60) = 588 m/sec. 


Now 588 m/sec is over half a kilometer per second, which is very fast for a human being 
to travel. Fortunately for the skydiver, taking air resistance into account reduces the speed 
considerably. E 


In an arithmetic sequence, each term equals the previous term plus a fixed constant. In 
a geometric sequence, each term equals the previous term times a fixed constant. Geomet- 
ric sequences arise in a large variety of applications, such as compound interest, certain 
models of population growth, radioactive decay, and the number of operations needed to 
execute certain computer algorithms. 


eee The Explicit Formula for a Geometric Sequence 


Let r be a fixed nonzero constant, and suppose a sequence do, 41, 42, . .. is defined recur- 
sively as follows: 


a, = ra,_, foreach integer k = 1, 
ay = a. 
Use iteration to guess an explicit formula for this sequence. 


Solution ao = 


a; = rao = ra 


Naan oo 
N 

2 

d} = ra; —=r\ra)=— ra 
2 1 (ra) 

g= 

2 3 

az = ra, =r (ra) = ra 
3 2 (r"a) 

a 
Ga 


a4 = ra; =r (rea) = rta 


n n 


Guess: a, =r"a=ar" for any arbitrary integer n > 0 


In the exercises at the end of this section, you are asked to prove that this formula is 
correct. a 


Definition 


A sequence do, a), d>,...1s called a geometric sequence if, and only if, there is a 
constant r such that 


a, = ra,-, for each integer k = 1. 
It follows that 


a, = aor" for each integer n = 0. 
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See eee A Geometric Sequence 


As shown in Example 5.6.7, if a bank pays interest at a rate of 4% per year compounded an- 
nually and A, denotes the amount in the account at the end of year n, then A, = (1.04) Ay_}, 
for each integer k = 1, assuming no deposits or withdrawals during the year. Suppose the 
initial amount deposited is $100,000, and assume that no additional deposits or withdraw- 
als are made. 


a. How much will the account be worth at the end of 21 years? 


b. In how many years will the account be worth $1,000,000? 


Solution 


a. Ao, Aj, A2, . . . is a geometric sequence with initial value 100,000 and constant multi- 
plier 1.04. Hence, 


A, = $100,000-(1.04)" for every integer n = 0. 
After 21 years, the amount in the account will be 
A>; = $100,000: (1.04)! = $227,876.81. 


This is the same answer as that obtained in Example 5.6.7 but is computed much more 
easily (at least if a calculator with a powering key, such as [A] or |x’ |, is used). 


b. Let t be the number of years needed for the account to grow to $1,000,000. Then 
$1,000,000 = $100,000: (1.04). 


Dividing both sides by 100,000 gives 
10 = (1.04, 
and taking natural logarithms of both sides results in 


Note Properties of loga- i 
rithms are reviewed in In(10) = In(1.04). 


Section 7.2. 
Then 


In(10) = ¢1n(1.04 because log, (x^) = a log, (x) (see 
n(10) n(1:04) exercise 35 of Section 7.2) 


and so 
Indo 
In(1.04) 
Hence the account will grow to $1,000,000 in approximately 58.7 years. E 


An important property of a geometric sequence with constant multiplier greater 
than 1 is that its terms increase very rapidly in size as the subscripts get larger and 
larger. For instance, the first ten terms of a geometric sequence with a constant multi- 
plier of 10 are 


1, 10, 10%, 10°, 10°. 10%, 106, 10", 108, 10°. 


Thus, by its tenth term, the sequence already has the value in = 1,000,000,000 = 1 billion. 
The following box indicates some quantities that are approximately equal to certain powers 
of 10. 
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10’ = number of seconds in a year 

10° = number of bytes of memory in a personal computer 

10'' = number of neurons in a human brain 

10'’ = age of the universe in seconds (according to one theory) 


10°! = number of seconds to process all possible positions of a checkers game if 
moves are processed at a rate of | per billionth of a second 


10°! = number of atoms in the universe 


10''' = number of seconds to process all possible positions of a chess game if moves 
are processed at a rate of 1 per billionth of a second 


Using Formulas to Simplify Solutions Obtained by Iteration 


Explicit formulas obtained by iteration can often be simplified by using formulas such as 
those developed in Section 5.2. For instance, according to the formula for the sum of a geo- 
metric sequence with initial term 1 (Theorem 5.2.2), for each real number r except r = 1, 
n+l 1 
l+rt+r+---+r" = —— for every integer n = 0. 


r=, 

And according to the formula for the sum of the first n integers (Theorem 5.2.1), 

—_ nant il) 
2 


An Explicit Formula for the Tower of Hanoi Sequence 


P23 ern for every integer n = 1. 


Recall that the Tower of Hanoi sequence m, m2, m3,...of Example 5.6.5 satisfies the 
recurrence relation 


my = 2my-, +1 for each integer k = 2 
and has the initial condition 
mM, = l. 


Use iteration to guess an explicit formula for this sequence, and make use of a formula 
from Section 5.2 to simplify the answer. 


Solution By iteration, 


m=1 
mo=2m;+1=2:1+1 = 20 +1, 
"a 
aS 
Mg = 2m. + 1=2(24+1) +1 = 29 4+2+1, 
r 
pA 
p 
m@ = 2m3 + 1= 2(2 +2+1) +1 =20+2+2+1, 
a 


p eal 


Se 
me = 2mgt 1= 2(Ż +2 + 241) + 1= 294 24+ 274+ 241. 
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These calculations show that each term up to ms is a sum of successive powers of 2, start- 
A ing with 2° = 1 and going up to 2%, where k is 1 less than the subscript of the term. The 
pattern would seem to continue to higher terms because each term is obtained from the 
preceding one by multiplying by 2 and adding 1; multiplying by 2 raises the exponent of 
each component of the sum by 1, and adding 1 adds back the 1 that was lost when the previ- 
ous | was multiplied by 2. For instance, for n = 6, 


Caution! Be careful 
when you use the distribu- 
tive law. For instance, 


202141 ¢24141 

because me = 2ms +1 = (2+2 +27 +241) t¢1 = 2°4+274+27 4274241, 
2(2+1)+1=2:3+1=7, 

whereas Thus it seems that, in general, 


Y+1+1=44+2=6. 
m, = 27142" 740-42 4241, 


By the formula for the sum of a geometric sequence (Theorem 5.2.2), 


n 


tg grt e 22 S aL 


Hence the explicit formula seems to be 


m, =2"—1 for every integer n = 1. m 


Using Recursion to Compute the Number of Edges of K, 


In Example 4.9.9 the handshake theorem was used to prove that the number of edges of K, is 
=l : : : ieee ees as 
m 1 and in exercise 31 of Section 5.3 you were asked to prove it using mathematical in- 
duction. This result can also be obtained using recursion and the formula for the sum of the 


first n positive integers. Observe that the first few values of K,, can be pictured as follows: 


e e o FAT AI 
Ky K3 Ky 


Ky 


You can obtain K; from K4 by adding one new vertex and drawing four new edges, one 
each between the new vertex and each vertex of K4. 


New vertex 


Thus 
the number of edges of K; = the number of edges of K, + the 4 new edges. 


By the same reasoning, if for each integer n = 1, s,, is the number of edges of K, then 


Sk = Sk—-1 + (k— 1) for each integer k = 2 recurrence relation 


sı = the number of edges in K, = 0 initial condition. 


Use iteration to find an explicit formula for 5), S2, 53,.... 
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Solution Because 


Sp = Sy- +(k— 1) for each integer k = 2 


and oa 
s0 = Om 

then, in particular, ee ea 
sg =H tl= 010 


i 
sog =s +2=(0+1)+2=0+1+; 
— 


E we 
sa = 93 +3=(004142)4+3=0414246; 
— mm 
weet 


———S Sas, 
sg =94t+4= (04+14243)4+4=041424344, 


Guess: sa =0-+14+2+---+@—-D) 


By Theorem 5.2.1, 


(n—1)n  n(n-1) 


(iste Ga, z 


Hence it appears that 


L n(n— 1) 


Sh 5 for every integer n = 1, 


which agrees with the results obtained in Sections 4.9 and 5.3. E 


Checking the Correctness of a Formula 
by Mathematical Induction 


As you can see from some of the previous examples, the process of solving a recurrence re- 
lation by iteration can involve complicated calculations. It is all too easy to make a mistake 
and come up with the wrong formula. That is why it is important to confirm your calcula- 
tions by checking the correctness of your formula. The most common way to do this is to 
use mathematical induction. 


ZCEWJGENAA Using Mathematical Induction to Verify the Correctness of a Solution 
to a Recurrence Relation 


In Example 5.6.5 we obtained a formula for the Tower of Hanoi sequence. Use mathemati- 
cal induction to show that this formula is correct. 


Solution What does it mean to show the correctness of a formula for a recursively de- 
fined sequence? Given a sequence of numbers that satisfies a certain recurrence relation 
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and initial condition, the job is to show that each term of the sequence satisfies the pro- 
posed explicit formula. In this case, you need to prove the following statement: 


If mı, m2, m3, ...is the sequence defined by 
my = 2m, + 1 for each integer k = 2, and 
mı = 1; 


then m, = 2” — 1 for every integer n = 1. 


Proof of Correctness: Let mı, m, m3,...be the sequence defined by specifying that 
m; = 1 and m = 2m_,+ 1 for each integer k = 2, and let the property P(n) be the 
equation 


m, = 2"—-1, <— P(n) 
We will use mathematical induction to prove that for every integer n = 1, P(n) is true. 


Show that P(1) is true: 
To establish P(1), we must show that 


m=2'-1. P0) 
Now the left-hand side of P(1) is 
m=1 by definition of mı, m», m3,.-- 
and the right-hand side of P(1) is 
2!—-1=2-1=1. 
Thus the two sides of P(1) equal the same quantity, and hence P(1) is true. 


Show that for every integer k = 1, if P(k) is true then P(k + 1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 1. That is:] 
Suppose that k is any integer with k = 1 such that 


mg = zk =f; <— P(k) inductive hypothesis 
[We must show that P(k + 1) is true. That is:] We must show that 
m = 21-1.  Pk+1) 


But the left-hand side of P(k + 1) is 


M+) = 2M K+ 1)-1 Tl by definition of m,, mz, m3,... 


= 2m, +1 

= 2(2* —-1)+1 by substitution from the inductive hypothesis 

= 2194] by the distributive law and the fact that 2-2" = 2*+! 
= ae =| by basic algebra 


which equals the right-hand side of P(k + 1). [Since the basis and inductive steps have been 
proved, it follows by mathematical induction that the given formula holds for every integer 
n=1.] E 


Discovering That an Explicit Formula Is Incorrect 


The following example shows how the process of trying to verify a formula by mathemati- 
cal induction may reveal a mistake. 
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Using Verification by Mathematical Induction to Find a Mistake 
Let co, C1, C2, . . . be the sequence defined as follows: 
Cy = 2c,-,; +k for each integer k = 1, 
co = I. 
Suppose your calculations suggest that co, cj, C2,... satisfies the following explicit 
formula: 
C, = 2" +n for every integer n = 0. 


Is this formula correct? 


Solution Start to prove the statement by mathematical induction and see what develops. 
The proposed formula satisfies the basis step of the inductive proof since on the one hand, 
Co = 1 by definition and on the other hand, 2+0=14+0=1. 

In the inductive step, you suppose that k is any integer with k = 0 such that 


Ge = gk +k, This is the inductive hypothesis. 


and then you must show that 
Copy = B+ (k41). 


To do this, you start with c,,,, substitute from the recurrence relation, and use the induc- 
tive hypothesis: 


Ceey = 2c t (K+ 1) by the recurrence relation 

= 2(2*+k)+(k+1) by substitution from the inductive hypothesis 

= 2¢*D43¢4 4 by basic algebra. 
To finish the verification, therefore, you need to show that 

Wel 3k+1 = 214 (k+1). 
Now this equation is equivalent to 
2k =0 by subtracting 2**! + k+ 1 from both sides 
which is equivalent to 
k=0 by dividing both sides by 2. 
But this is false since k may be any nonnegative integer. For instance, when k = 1, then 
k+1 = 2, and 
Co =2:3+2=8 whereas 27+2=4+2=6. 


So the formula does not give the correct value for c). Hence the sequence co, C1, C2,..- 
does not satisfy the proposed formula. a 


Once you have found a proposed formula to be false, you should look back at your cal- 
culations to see where you made a mistake, correct it, and try again. 


TEST YOURSELF 


1. To use iteration to find an explicit formula for a 2. At every step of the iteration process, it is impor- 
recursively defined sequence, start with the tant to eliminate 
and use successive substitution into the 
look for a numerical pattern. 


to 


3. Ifa single number, say a, is added to itself k times 
in one of the steps of the iteration, replace the sum 
by the expression 
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4. Ifa single number, say a, is multiplied by itself k 
times in one of the steps of the iteration, replace 
the product by the expression —____ 


5. A general arithmetic sequence do, 41, d2,... with 
initial value ap and fixed constant summand d 
satisfies the recurrence relation and has the 
explicit formula 


EXERCISE SET 5.7 


6. 


A general geometric sequence do, 41, d2,... with 
initial value ag and fixed constant multiplier r 
satisfies the recurrence relation and has the 
explicit formula 


. When an explicit formula for a recursively defined 


sequence has been obtained by iteration, its cor- 
rectness can be checked by ____. 


1. The formula 
n(n + 1) 
2 


is true for every integer n = 1. Use this fact to 

solve each of the following problems: 

a. Ifkis an integer and k = 2, find a formula for 
the expression 1+2+3+---+(k-—1). 

b. If nis an integer and n = 1, find a formula for 
the expression 5+2+4+6+8+---+2n. 

c. Ifnis an integer and n = 1, find a formula for 
the expression 3+ 3:24+3-3+:::+3:n+n, 


1+24+34-4+n= 


2. The formula 
r 
ltrtr te tm 


is true for every real number r except r = 1 and 

for every integer n = 0. Use this fact to solve each 

of the following problems: 

a. Ifiis an integer and i = 1, find a formula for 
the expression | + 2 + 274.4201, 

b. If is an integer and n = 1, find a formula for 
the expression 3”. 1+ 3" 7+---+37+341. 

c. Ifnis an integer and n = 2, find a formula for 
the expression 2” + 2”-7-3 +2" 3-3 +--+ 
2°-34+2-34+3. 

d. If is an integer and n = 1, find a formula for 
the expression 


gn — gn lg gn 2 _ on 34. t(j) 1-2 + (- 1)". 


In each of 3-15 a sequence is defined recursively. Use 
iteration to guess an explicit formula for the sequence. 
Use formulas from Section 5.2 to simplify your answers 
whenever possible. 


3. dy = kay, for each integer k = 1 
ag = 1 
by 
4. b= Tebe for each integer k = 1 
bo =1 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


10. 


Tl. 


12. 


13. 


* 14. 


15. 


16. 


17. 


18. 


19. 


< Ck = 3cg—1 + I, for each integer k = 2 


c&=1 


. dą = 2d,_, + 3, for each integer k = 2 


d =2 


. ep = 4e,_, +5, for each integer k = 1 


e= 2 


» fe=Se-1 + 25, for each integer k = 2 


fi=l 


Sk-1 


» 8k=e FD for each integer k = 2 


ail 


h; = Qk h,—,, for each integer k = 1 
ho =]1 


Dk = Pe_1 + 2-3" for each integer k = 2 
pi =2 


Sg = Sp + 2k, for each integer k = 1 
So = 3 


tk = t.-1 + 3k + 1, for each integer k = 1 
to = (0) 


Xk = 3x,-1 + k, for each integer k = 2 
x= 1 


Yk = Ye-1 +, for each integer k = 2 
yal 


Solve the recurrence relation obtained as the an- 
swer to exercise 17(c) of Section 5.6. 


Solve the recurrence relation obtained as the an- 
swer to exercise 21(c) of Section 5.6. 


Suppose d is a fixed constant and do, 44, do,... 

is a sequence that satisfies the recurrence relation 
a, = a, +d, for each integer k = 1. Use math- 
ematical induction to prove that a, = dj + nd, for 
every integer n = 0. 


A worker is promised a bonus if he can increase 
his productivity by 2 units a day every day for a 


period of 30 days. If on day 0 he produces 
170 units, how many units must he produce on day 
30 to qualify for the bonus? 


20. A runner targets herself to improve her time on a 
certain course by 3 seconds a day. If on day 0 she 
runs the course in 3 minutes, how fast must she 
run it on day 14 to stay on target? 


21. Suppose r is a fixed constant and dp, a, dz... is 
a sequence that satisfies the recurrence relation 
a, = ray—,, for each integer k = 1 and ap = a. Use 
mathematical induction to prove that a, = ar”, for 
every integer n = 0. 


22. As shown in Example 5.6.8, if a bank pays interest 
at a rate of i compounded m times a year, then the 
amount of money P, at the end of k time periods 
(where one time period = 1/mth of a year) satis- 
fies the recurrence relation P, = [1 + (i/m)] P,—| 
with initial condition Py = the initial amount 
deposited. Find an explicit formula for P,,. 


23. Suppose the population of a country increases 
at a steady rate of 3% per year. If the population 
is 50 million at a certain time, what will it be 
25 years later? 


24. A chain letter works as follows: One person sends 
a copy of the letter to five friends, each of whom 
sends a copy to five friends, each of whom sends 
a copy to five friends, and so forth. How many 
people will have received copies of the letter after 
the twentieth repetition of this process, assuming 
no person receives more than one copy? 


25. A certain computer algorithm executes twice 
as many operations when it is run with an input 
of size k as when it is run with an input of size 
k— 1 (where kis an integer that is greater than 1). 
When the algorithm is run with an input of size 1, 
it executes seven operations. How many opera- 
tions does it execute when it is run with an input 
of size 25? 


26. A person saving for retirement makes an initial 
deposit of $1,000 to a bank account earning inter- 
est at a rate of 3% per year compounded monthly, 
and each month she adds an additional $200 to the 
account. 

a. For each nonnegative integer n, let A,, be the 
amount in the account at the end of n months. 
Find a recurrence relation relating A; to Ay_}. 
H b. Use iteration to find an explicit formula for A,,. 
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c. Use mathematical induction to prove the cor- 
rectness of the formula you obtained in part (b). 
d. How much will the account be worth at the 
end of 20 years? At the end of 40 years? 
He. In how many years will the account be worth 
$10,000? 


27. A person borrows $3,000 on a bank credit card at 
a nominal rate of 18% per year, which is actually 
charged at a rate of 1.5% per month. 

Ha. What is the annual percentage yield (APY) for 
the card? (See Example 5.6.8 for a definition 
of APY.) 

b. Assume that the person does not place any ad- 
ditional charges on the card and pays the bank 
$150 each month to pay off the loan. Let B,„ be 
the balance owed on the card after n months. 
Find an explicit formula for B,,. 

Hc. How long will be required to pay off the debt? 

d. What is the total amount of money the person 
will have paid for the loan? 


In 28-42 use mathematical induction to verify the cor- 
rectness of the formula you obtained in the referenced 
exercise. 


28. Exercise 3 29. Exercise 4 30. Exercise 5 
31. Exercise 6 32. Exercise 7 33. Exercise 8 


34. Exercise 9 H 35. Exercise 10 36. Exercise 11 


H 37. Exercise 12 38. Exercise 13 39. Exercise 14 


40. Exercise 15 41. Exercise 16 42. Exercise 17 


In each of 43-49 a sequence is defined recursively. (a) Use 
iteration to guess an explicit formula for the sequence. 
(b) Use strong mathematical induction to verify that the 
formula of part (a) is correct. 
43. a, = ree for each integer k = 1 
dy = 2 
44. b, = a for each integer k = 2 
bı =1 
45. Ve = Vk/2 + Ve+1y/2\t 2, for each integer k = 2 
y= 1 


H 46. s, = 25,2, for each integer k = 2 


So = 1,5, =2 


47. ty = k—t,_,, for each integer k = 1 
to = (0) 


H 48. wg = wg—2 + k, for each integer k = 3 


w, =l,wm, =2 
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494 uy = Uk—2'Uk—1, for each integer k = 2 1 1P 

Ug = uy = 2 H 53. Compute | 1 i for small values of n (up to 

In 50 and 51 determine whether the given recursively a about 5 or 6). Conjecture explicit formulas for the 

fined sequence satisfies the explicit formula a, = (n — 1), entries in this matrix, and prove your conjecture 

for every integer n = 1. using mathematical induction. 

50. ag = 2a, + k~ 1, for each integer k = 2 54. In economics the behavior of an economy from one 
a,=0 period to another is often modeled by recurrence 

51. a, = 4a,_,—k + 3, for each integer k = 2 relations. Let Y, be the income in period k and C; be 
a, 20 the consumption in period k. In one economic mod- 


el, income in any period is assumed to be the sum 
of consumption in that period plus investment and 
government expenditures (which are assumed to be 
constant from period to period), and consumption in 
each period is assumed to be a linear function of the 
income of the preceding period. That is, 


52. A single line divides a plane into two regions. 
Two lines (by crossing) can divide a plane 
into four regions; three lines can divide it 
into seven regions (see the figure). Let P, 
be the maximum number of regions into which 


n lines divide a plane, where n is a positive 
integer. Y, = C+E where E is the sum 


of investment plus 
government expenditures 


Line 3 C = c+myY,-; where c and m are constants. 


Substituting the second equation into the first 

gives Y; = E + c + mY;—ı. 

a. Use iteration on the above recurrence relation 
Line 2 to obtain 


n 


Line 1 


m’—1 7 
Y, = (E+c) i +m Yo 
a. Derive a recurrence relation for P, in terms of 
Py, for each integer k = 2. 


b. Use iteration to guess an explicit formula for P,,. 


for every integer n = 1. 
b. (For students who have studied calculus) Show 
that if 0 < m < 1, then limY, = FE 


l-m 


ANSWERS FOR TEST YOURSELF 


1. initial conditions; recurrence relation 2. parentheses 3.k-a 4. a’ 5. a, = 4-1 t+ dia,=agtdn 6. ay = rag]; 


a, = aor" 7. mathematical induction 


EJ Second-Order Linear Homogeneous Recurrence 
Relations with Constant Coefficients 


Genius is 1% inspiration and 99% perspiration. —Thomas Alva Edison, 1932 


In Section 5.7 we discussed finding explicit formulas for recursively defined sequences 
using iteration. This is a basic technique that does not require any special tools beyond 
the ability to discern patterns. In many cases, however, a pattern is not readily discern- 
ible and other methods must be used. A variety of techniques are available for finding 
explicit formulas for special classes of recursively defined sequences. The method ex- 
plained in this section is one that works for the Fibonacci and other similarly defined 
sequences. 
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Definition 


A second-order linear homogeneous recurrence relation with constant coeffi- 
cients is a recurrence relation of the form 


a; = Aa,-,+ Baz-2 for every integer k = some fixed integer, 


where A and B are fixed real numbers with B + 0. 


“Second-order” refers to the fact that the expression for a, contains the two previous 
terms a,_, and ay_>, “linear” to the fact that a,_, and ag-z appear in separate terms and 
to the first power, “homogeneous” to the fact that the total degree of each term is the same 
(thus there is no constant term), and “constant coefficients” to the fact that A and B are 
fixed real numbers that do not depend on k. 


Second-Order Linear Homogeneous Recurrence Relations 
with Constant Coefficients 


State whether each of the following is a second-order linear homogeneous recurrence rela- 
tion with constant coefficients: 


a. ay = 3ay—1 + 2ay_ b. by = bg-1 + bg- + Dy-3 

c. Ck = 3Ch—1 era d. dy = dey + dys dy—2 

e€. ep = 2eg—2 f. fk = 2f-1 +1 

g- 8k = 8k-1 t 8k-2 h. hg = (~ Dh- + (k= 1h- 
Solution 


a. Yes; A = 3 and B = 2 
b. No; not second-order 
. Yes; A = + and B = — 


Nw 


c 

d. No; not linear 

Yes; A = Oand B =2 
No; not homogeneous 
Yes; A = land B= 1 


= o rm o 


No; nonconstant coefficients a 


The Distinct-Roots Case 


Consider a second-order linear homogeneous recurrence relation with constant coefficients: 
a, = Aa,_,; + Ba,» for every integer k = 2, 5.8.1 


where A and B are fixed real numbers. Relation (5.8.1) is satisfied when each a; = 0, but 
it has nonzero solutions as well. Suppose that for some number f with t # 0, the sequence 


A 


LE ae ee ee 


satisfies relation (5.8.1). This means that each term of the sequence equals A times the 
previous term plus B times the term before that. So for each integer k = 2, 


f= AF! + Br. 
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In particular, when k = 2, the equation becomes 
P= Att+B, 
or, equivalently, 
—At—B=0. 5.8.2 


This is a quadratic equation, and the values of ¢ that make it true can be found either by 
factoring or by using the quadratic formula. 

Now work backward. Suppose t is any number that satisfies equation (5.8.2). Does the 
sequence |, ¢, P, fF, ..., t, ... satisfy relation (5.8.1)? To answer this question, multiply 
equation (5.8.2) by f? to obtain 


peep ae Apa? B= O. 


This is equivalent to 


or 


Hence the answer is yes: 1, t, É, i, ..., ¢",... satisfies relation (5.8.1). 
This discussion proves the following lemma. 


Lemma 5.8.1 


Let A and B be real numbers. A recurrence relation of the form 
a, = Aa,-, + Ba,» for every integer k = 2 5.8.1 
is satisfied by the sequence 
n 


EL EE E 


where f is a nonzero real number, if, and only if, t satisfies the equation 


P—At—B=0. 582 


Equation (5.8.2) is called the characteristic equation of the recurrence relation. 


Definition 


Given a second-order linear homogeneous recurrence relation with constant 
coefficients 


a, = Aa,-, + Ba, for every integer k = 2, Sl 
the characteristic equation of the relation is 


P—At—B=0. 582 
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Using the Characteristic Equation to Find Solutions to a Recurrence Relation 


Consider the recurrence relation that specifies that the kth term of a sequence equals the 
sum of the (k — 1)st term plus twice the (k — 2)nd term. That is, 


a, = G1; +2aq,_, for each integer k = 2. 5.8.3 

Find all sequences that satisfy relation (5.8.3) and have the form 
LE Ê, ee oe 
where ż is nonzero. 
Solution By Lemma 5.8.1, relation (5.8.3) is satisfied by a sequence 1, t, oe ne ae 
if, and only if, t satisfies the characteristic equation 
f—t-2=0. 
Since 
P—-t-2=(t-2¢+)), 

the only possible values of t are 2 and —1. It follows that the sequences 


1, 2,27,27,...,2%,... and 1,—-1,(-1)?,(-D’,...,(-D%..... 


are both solutions for relation (5.8.3) and there are no other solutions of this form. Note that 
these sequences can be rewritten more simply as 


1,2,27,2°,...,2",.... and 1,—1,1,—1,...,(—1)’,.... E 


The example above shows how to find two distinct sequences that satisfy a given 
second-order linear homogeneous recurrence relation with constant coefficients. It turns 
out that any linear combination of such sequences produces another sequence that also 
satisfies the relation. 


Lemma 5.8.2 


If ro, 71, r2,... and So, S1, S2, ... are sequences that satisfy the same second-order 
linear homogeneous recurrence relation with constant coefficients, and if C and D 
are any numbers, then the sequence do, a1, a2, . . . defined by the formula 


a, = Cr,+Ds, for every integer n = 0 


also satisfies the same recurrence relation. 


Proof: Suppose ro, F1, r2,... and So, S1, 5,... are Sequences that satisfy the same 
second-order linear homogeneous recurrence relation with constant coefficients. In 
other words, suppose that for some real numbers A and B, 


Fk = År] + Br; and Sk = As;-} + Bs,_» 5.8.4 

for every integer k = 2. Suppose also that C and D are any numbers. Let do, a), 
an, . . . be the sequence defined by 

a, = Cr, + Ds, for every integer n = 0. 5.8.5 


[We must show that ao, dj, a, ...satisfies the same recurrence relation as 
To, Fi F2, .. -and So, S1, 82,.... That is, we must show that a, = Aap- + Bay—», for every 
integer k = 2.] 


(continued on page 356) 
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Example 5.8.3 


For every integer k = 2, 
Aag—ı + Ba,_-2 = A(Cr,-) + Ds;,—1) + BC(Crp_2 + Ds,—2) by substitution from (5.8.5) 
= C(An_, + Bry_2) + D(As,y_1 + BSg—2) by basic algebra 
= Cr, + Dsk by substitution from (5.8.4) 
= a by substitution from (5.8.5). 


Hence do, a), az, . . . satisfies the same recurrence relation as ro, F1, F2, ... and So, Si, 
S2, . . . [as was to be shown]. 


Given a second-order linear homogeneous recurrence relation with constant coeffi- 
cients, if the characteristic equation has two distinct roots, then Lemmas 5.8.1 and 5.8.2 
can be used together to find a particular sequence that satisfies both the recurrence relation 
and two specific initial conditions. 


Finding the Linear Combination That Satisfies the Initial Conditions 
Find a sequence that satisfies the recurrence relation of Example 5.8.2, 

ay = ay—; +2aķ-2 for every integer k = 2, 3.8.3 
and that also satisfies the initial conditions 


a =1 and a, =8. 


Solution Consider the following sequences from Example 5.8.2. 


1,2,27,23,...,2",.... and 1,—1,1,—1,...,(—-1).... 


Both satisfy relation (5.8.3) although neither satisfies the given initial conditions. However, 
by Lemma 5.8.2, any sequence do, dj, dz, ... that satisfies the explicit formula 


a, = C-2"+ D(-1)", 5.8.6 


where C and D are numbers, also satisfies relation (5.8.3). You can find C and D so that 
do, 41, A2, ... satisfies the initial conditions specified in this example by substituting n = 0 
and n = | into equation (5.8.6) and solving for C and D: 


a = 1 =C-2°+D(-1)° 
a, =8=C-2'+D(-1)'. 
When you simplify, you obtain the system 


1=C+D 
8 = 2C—D, 


which can be solved in various ways. For instance, if you add the two equations, you get 


and so C= 3. 
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Then, by substituting into 1 = C + D, you get 
D= —2. 
It follows that the sequence dp, aj, d2,... given by 
an = 3:2” +(—2)(- 1)" = 3-2" -2(- 1)", 


for each integer n = 0, satisfies both the recurrence relation and the given initial 
conditions. a 


The techniques of Examples 5.8.2 and 5.8.3 can be used to find an explicit formula for 
any sequence that satisfies a second-order linear homogeneous recurrence relation with 
constant coefficients for which the characteristic equation has distinct roots, provided that 
the first two terms of the sequence are known. This is made precise in the next theorem. 


Theorem 5.8.3 Distinct-Roots Theorem 


Suppose a sequence do, 44, do, ... satisfies a recurrence relation 
ay = Aa, + Bay» 5.8.1 


for some real numbers A and B with B # 0 and every integer k = 2. If the charac- 
teristic equation 


f—At—B=0 5.8.2 
has two distinct roots r and s, then do, aj, a2, . . . is given by the explicit formula 
An = Cr" + Ds", 


where C and D are the numbers whose values are determined by the values ap and 
dı. 


Note: To say “C and D are determined by the values of ag and a,” means that C and D are 
the solutions to the system of simultaneous equations 


a) = Cr°+Ds° and a = Cr! + Ds!, 
or, equivalently, 
a= C+D and q = Cr+ Ds. 


In exercise 19 at the end of this section you are asked to verify that this system always has 
a solution when r # s. 


Proof: Suppose that for some real numbers A and B, a sequence ap, 44, dp, . . . satis- 
fies the recurrence relation a, = Aa,_, + Ba,_», for every integer k = 2, and sup- 
pose the characteristic equation ? —At—B = 0 has two distinct roots r and s. We 
will show that 


for each integer n = 0, a, = Cr + Ds”, 


where C and D are numbers such that 


ay = Cr°+Ds° and a, = Cr! + Ds!. 


(continued on page 358) 
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Let P(n) be the equation 
dy = Cr" + Ds". <— Pin) 


We use strong mathematical induction to prove that P(n) is true for each integer 
n = 0. In the basis step, we prove that P(0) and P(1) are true. We do this because in 
the inductive step we need the equation to hold for n = 0 and n = 1 in order to prove 
that it holds for n = 2. 


Show that P(0) and P(A) are true: The truth of P(0) and P(1) is automatic because C 
and D are exactly those numbers that make the following equations true: 


dy = Cr°+Ds° and a = Cr'+Ds'. 


Show that for every integer k = 1, if P) is true for each integer i from 0 through 
k, then P(k + 1) is also true: Suppose that k is any integer with k = 1 and for each 
integer i from 0 through k, 


a; = Cr'+Ds' inductive hypothesis. 
We must show that 
G4 = CF + DF". pkt) 
Now by the inductive hypothesis, 
ag = Cr+ Ds* and a) = Cr} +D}, 


SO 
a4, = Aa, + Bazı by definition of ag, a), a2, . - 
= A(cr + Ds*) + B(Cr ~! + Ds H by inductive hypothesis 
= C(Ark + Br!) + D(As* + Bs% 1) by combining like terms 
= Cr+! + ps! by Lemma 5.8.1. 


This is what was to be shown. 


[The reason the last equality follows from Lemma 5.8.1 is that since r and s satisfy the 


2 


characteristic equation (5.8.2), the sequences r°, r, r,...and s°, s!, s, ... satisfy the 


recurrence relation (5.8.1).] 


Remark The ¢ of Lemma 5.8.1 and the C and D of Lemma 5.8.2 and Theorem 5.8.3 are 
referred to simply as numbers. This is to allow for the possibility of complex as well as real 
number values. If both roots of the characteristic equation of the recurrence relation are 
real numbers, then C and D will be real. If the roots are complex but both ag and a, are real 
numbers, then C and D will also be real and equal to each other. 

The next example shows how to use the distinct-roots theorem to find an explicit for- 
mula for the Fibonacci sequence. 


A Formula for the Fibonacci Sequence 


The Fibonacci sequence Fo, F), F3, . . . satisfies the recurrence relation 


F, = Fg- + Fy-2 for every integer k = 2 
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with initial conditions 


Find an explicit formula for this sequence. 


Solution The Fibonacci sequence satisfies the first part of the hypothesis of the distinct- 
roots theorem since the Fibonacci relation is a second-order linear homogeneous recur- 
rence relation with constant coefficients (A = 1 and B = 1). To check that it satisfies the 
second part of the hypothesis, examine the characteristic equation 


f-t-1=0. 
By the quadratic formula, 
1+V5 
1+v1-4(-1) 2 
t = — 
2 1- v5 
2 


and so the roots are distinct. Thus it follows from the distinct-roots theorem that the 
Fibonacci sequence is given by the explicit formula 


1+V5\" 1-V5\" 
iael z5) -o( ~”) for each integer n = 0, 5.8.7 


where C and D are the numbers whose values are determined by the fact that Fy = F, = 1. 
To find C and D, write 


AS) (2 


0 
=C:1+D-1=C+D 
2 3 


| 


and 


Hais (5 (15) 


2 
- (5) r (4), 
2 2 


Thus the problem is to find numbers C and D such that 
C+D=1 
and 


(5) sà =) on 
2 2 


This may look complicated, but in fact it is just a system of two equations in two unknowns. 
In exercise 7 at the end of this section, you are asked to solve the system to show that 


Lyi a pa iN 


. 2V5 2V5 
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Substituting these values for C and D into formula (5.8.7) gives 


no (ar) Cae ae) 
no 2V5 2 2V5 9 > 


Note The numbers or, simplifying, 

(1 + V5)/2 and 

(1 — V5)/2 are related 1 (/14+V5\""! 1 /1-V5\""! 

to the golden ratio of F, = v5 2 V5 2 5.8.8 


Greek mathematics. See 


exercise 24 at the end of for each integer n = 0. Remarkably, even though the formula for F, involves V5, all of the 
a values of the Fibonacci sequence are integers. m 


The Single-Root Case 
Consider again the recurrence relation 

a, = Aa,-, + Ba,_, for every integer k = 2, 5.8.1 
where A and B are real numbers, but suppose now that the characteristic equation 

f—At—B=0 5.8.2 
has a single real root r. By Lemma 5.8.1, one sequence that satisfies the recurrence relation is 
Ler, P,a, em 

But another sequence that also satisfies the relation is 


3 
0, r, 27°, 37°,...,nr",.... 


To see why this is so, observe that since r is the unique root of ĉł-—At-B= 0, the left-hand 
side of the equation can be written in the form (t — ry, and so 


fP-At—B=(t-r? =P -2rt+/. 5.8.9 
Equating coefficients in equation (5.8.9) gives 
A=2r and B=-”. 5.8.10 
Let so, 5}, 52,... be the sequence defined by the formula 
S, = nr” foreach integer n = 0. 


Then 
Asp—1 + Bsy_2 = Alk — Dr! + B(k— 2)r? by definition 


= 2r(k i aa rk 2)? by substitution from 5.8.10 
= 2(k— 1) —(k-—2)r* 

= (2k-2-k+2)4 

= kr by basic algebra 


=S by definition. 


Thus so, S1, S2, ... satisfies the recurrence relation. This argument proves the following 
lemma. 
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Lemma 5.8.4 


Let A and B be real numbers and suppose the characteristic equation 


rP—At—B=0 


. 3 
has a single root r. Then the sequences 1, PP a and 0, 7 2 e 
nr”, ... both satisfy the recurrence relation 


ay = Aag- ar Ba,-> 


for each integer k = 2. 


Lemmas 5.8.2 and 5.8.4 can be used to establish the single-root theorem, which tells 
how to find an explicit formula for any recursively defined sequence satisfying a second- 
order linear homogeneous recurrence relation with constant coefficients for which the 
characteristic equation has just one root. Taken together, the distinct-roots and single-root 
theorems cover all second-order linear homogeneous recurrence relations with constant 
coefficients. The proof of the single-root theorem is very similar to that of the distinct- 
roots theorem and is left as an exercise. 


Theorem 5.8.5 Single-Root Theorem 


Suppose a sequence do, 44, dz, ... satisfies a recurrence relation 
ay = Aag- ar Ba,-> 


for some real numbers A and B with B # 0 and for every integer k = 2. If the charac- 
teristic equation f —At—B =Ohasa single (real) root r, then do, a), a2, . . . is given 
by the explicit formula 


Q, = Cr’ + Dnr", 


where C and D are the real numbers whose values are determined by the values of 
do and any other known value of the sequence. 


Single-Root Case 


Suppose a sequence bo, b4, b>, ... satisfies the recurrence relation 
b, = 4b,_; — 4b, for every integer k = 2, 5.8.11 
with initial conditions 
bo =1 and bh =3. 
Find an explicit formula for bo, b4, by, .... 


Solution This sequence satisfies part of the hypothesis of the single-root theorem be- 
cause it satisfies a second-order linear homogeneous recurrence relation with constant 
coefficients (A = 4 and B = —4). The single-root condition is also met because the char- 
acteristic equation 


P—4t+4=0 


has the unique root r = 2 [since P—4t+4 = (t-27). 
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It follows from the single-root theorem that bop, b1, b2, . . . is given by the explicit formula 
b, = C:2"+Dn2" for each integer n = 0, 5.8.12 


where C and D are the real numbers whose values are determined by the fact that bọ = 1 
and b, = 3. To find C and D, write 


bp = 1=C-2°+D-0-2% =C 
and b; =3 =C-2'+D-1-2! = 2C+2D. 
Hence the problem is to find numbers C and D such that 
Cc=1 
and 2C+2D = 3. 


Substitute C = | into the second equation to obtain 


2+2D = 3, 
1 
and so D=— 
2 
Now substitute C = 1 and D = 5 into formula (5.8.12) to conclude that 
n 1 n n n å 
b, =2 + a =2°| 1+ A for each integer n = 0. E 
1. A second-order linear homogeneous recurrence equation for the relation has two distinct roots r 
relation with constant coefficients is a recur- and s (which could be complex numbers), then the 
rence relation of the form for every integer sequence is given by an explicit formula of the 
k= ____, where ____. form ____ 
2. Given a recurrence relation of the form 4. If a sequence ay, do, az, . . . is defined by a second- 
ay, = Aa, + Ba,» for every integer k = 2, the order linear homogeneous recurrence relation with 


characteristic equation of the relation is constant coefficients and the characteristic equa- 
tion for the relation has only a single root 7, then 


the sequence is given by an explicit formula of the 


3. If a sequence a, a2, a3,...is defined by a second- 
order linear homogeneous recurrence relation 


; i a form ___.. 
with constant coefficients and the characteristic ci 
EXERCISE SET 5.8 
1. Which of the following are second-order linear 2. Which of the following are second-order linear 
homogeneous recurrence relations with constant homogeneous recurrence relations with constant 
coefficients? coefficients? 
a. ay = 2a- — Sag-2 a. a, = (k— l)ay_) + 2kay_2 
b. by a KBy—| + Dio b. by = —by-} + Tb 
C. Ck = 3Ck-1'Ck-2 c c= 3c} 1+1 
e= ke as d. dy = 3d¢_) + d,- 
e. Fg = Tg- — Tg 2 e. rp = Fk—1 — Órg-3 
f. Sk = 105;—2 f. Sy = Sp-1 T 10s%—2 
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. Let dp, 41, a2, ... be the sequence defined by the 
explicit formula 
an= C:2"+D for every integer n = 0, 
where C and D are real numbers. 
a. Find C and D so that ag = 1 and a, = 3. What 
is a in this case? 
b. Find C and D so that ay = 0 and a, = 2. What 
is a in this case? 


. Let bo, bı, b2, . . . be the sequence defined by the 
explicit formula 


b, = C:3" + D(-2)" for each integer n = 0, 


where C and D are real numbers. 

a. Find C and D so that bọ = 0 and b, = 5. What 
is b> in this case? 

b. Find C and D so that bọ = 3 and b, = 4. What 
is b> in this case? 


. Let do, ay, do,.. 
explicit formula 


. be the sequence defined by the 


a, = C:2"+D foreach integer n = 0, 
where C and D are real numbers. Show that for 
any choice of C and D, 

ay = 3a; — 2a,» for every integer k = 2. 
. Let bo, bi, b2, . . . be the sequence defined by the 
explicit formula 


b, = C:3"+ D(-2)" for every integer n = 0, 


where C and D are real numbers. Show that for 
any choice of C and D, 


b, = by_; + 6b, for each integer k = 2. 
. Solve the system of equations in Example 5.8.4 to 
obtain 


_14+V5 


~==V5) 
2V5 i 


e 
2V5 


and D 


In each of 8-10: (a) suppose a sequence of the form 

1, t, t°, Č, ..., t”... where t # 0, satisfies the given recur- 
rence relation (but not necessarily the initial conditions), 
and find all possible values of t: (b) suppose a sequence sat- 
isfies the given initial conditions as well as the recurrence 
relation, and find an explicit formula for the sequence. 


8. ay = 2aķ—ı + 3aķ-2, for every integer k = 2 


a= 1,a, =2 


9. by = 7b- — 10b;-2, for every integer k = 2 


by = 2, b; = 2 


10. 
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Ck = Ck—1 + 6Cy—2, for every integer k = 2 
Co = 0, c= 3 


In each of 11-16 suppose a sequence satisfies the given 
recurrence relation and initial conditions. Find an explicit 
formula for the sequence. 


Tl. 


12. 


13. 


14. 


15. 


H 16. 


17. 


18. 


19. 


20. 


H 21. 


dą = 4d—2, for each integer k = 2 


do = 1; dı = -1 

ep = 9eg—2, for each integer k = 2 

eo = 0, e; = 2 

rp = 2rk—-1 — Fk-2, for each integer k = 2 
mn=lrn=4 

Sk = —As,-1 — 45,2, for every integer k = 2 
So = 0,5, = -1 


ty = 6tr—1 — 9tz—2, for each integer k = 2 

lho = 1, ti =3 

Sg = 2Sk—1 + 25,2, for every integer k = 2 
so = l1,s,=3 

Find an explicit formula for the sequence of 
exercise 39 in Section 5.6. 


Suppose that the sequences sọ, 5), S2, . . - and to, tı, 
t2, . . . both satisfy the same second-order linear 
homogeneous recurrence relation with constant 


coefficients: 
Sk = 5Sk-1— 4Sk-2 for each integer k = 2 
te = St, —4t,-2. for each integer k = 2. 


Show that the sequence 259 + 3%, 25, + 3t, 
259 + 3ħ, . . . also satisfies the same relation. In 
other words, show that 


25, + 3t; = 5(2sz—1 + 3t,_1) a A(2s,_9 + 3t,—>) 
for each integer k = 2. Do not use Lemma 5.8.2. 


Show that if r, s, aọ, and a, are numbers with r # s, 
then there exist unique numbers C and D so that 
C+D= do 
Cr+ Ds = a}. 

Show that if r is a nonzero real number, k and m 
are distinct integers, and a; and a, are any real 
numbers, then there exist unique real numbers C 
and D so that 

C + kD = ak 

Cr” + mDr” = ay. 


Prove Theorem 5.8.5 for the case where the values 
of C and D are determined by ag and ay. 
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Exercises 22 and 23 are intended for students who are Divide the rectangle into a rectangle and a square 

familiar with complex numbers. as shown in the preceding diagram. The square 

is | unit on each side, and the rectangle has 

sides of lengths 1 and ¢ — 1. The ancient Greeks 

considered the outer rectangle to be perfectly 

a, = 2a,-;— 2a,» for every integer k = 2 proportioned (saying that the lengths of its sides 

are in a golden ratio to each other) if the ratio 

of the length to the width of the outer rectangle 

23. Find an explicit formula for a sequence bo, bı, equals the ratio of the length to the width of the 
b>, ... that satisfies inner rectangle. That is, if the number @ satisfies 

the equation 


22. Find an explicit formula for a sequence do, a, 
a, . . . that satisfies 


with initial conditions dg = 1 and a, = 2. 


by = 2by-1 — 5bĮ-2 for each integer k = 2 


with initial conditions bọ = 1 and b; = 1. o o 1 
24. The numbers * H and 1 as that appear in 1 $71 


the explicit formula for the Fibonacci sequence 
are related to a quantity called the golden ratio in 
Greek mathematics. Consider a rectangle of length 
¢ units and height 1, where d > 1. 


a. Show that if ¢ satisfies the equation above, 
then it also satisfies the quadratic equation: 
2 - 
t—-t—-1=0. 

b. Find the two solutions of ? —t—1 = 0 and call 
them Q; and œz. 

c. Express the explicit formula for the Fibonacci 
sequence in terms of hı and do. 


ANSWERS FOR TEST YOURSELF 


1. ay = Aay_; + Ba, >; 2; A and B are fixed real numbers with B #0 2.f°—At-—B=0 3.a, = Cr"+Ds", where C and D 
are real or complex numbers 4. a, = Cr" + Dnr", where C and D are real numbers 


EX] General Recursive Definitions 
and Structural Induction 


GENIE: Oh, aren’t you acquainted with recursive acronyms? I thought everybody 
knew about them. You see, “GOD” stands for “GOD Over Djinn” —which can be 
expanded as “GOD Over Djinn, Over Djinn” —and that can, in turn, be expanded 
to “GOD Over Djinn, Over Djinn, Over Djinn” —which can, in its turn, be further 
expanded.... You can go as far as you like. 

ACHILLES: But lIl never finish! 

GENIE: Of course not. You can never totally expand GOD. 

—Douglas Hofstadter, Gödel, Escher, Bach, 1979 


Sequences of numbers are not the only objects that can be defined recursively. In this sec- 
tion we discuss recursive definitions for sets and functions. We also introduce structural 
induction, which is a version of mathematical induction that is used to prove properties of 
recursively defined sets. 
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Recursively Defined Sets 


To define a set of objects recursively, you identify a few core objects as belonging to the set 
and give rules showing how to build new set elements from old. More formally, a recursive 
definition for a set consists of the following three components: 


I. Base: A statement that certain objects belong to the set. 


II. Recursion: A collection of rules indicating how to form new set objects from those 
already known to be in the set. 


III. Restriction: A statement that no objects belong to the set other than those coming 
from the base and the recursion. 


See Recursive Definition of Boolean Expressions 


Note An example of The set of Boolean expressions was introduced in Section 2.4 as “legal” expressions in- 
a “legal” expression is volving letters from the English alphabet such as p, q, and r, and the symbols ^, V, ~, 
p ^ (q V ~r), and an ex- and ( ). To make precise which expressions are legal, the set of Boolean expressions over a 
ample ofan “illegal” one general alphabet is defined recursively. 

is \ ~pqrv. 


I. Base: Each symbol of the alphabet is a Boolean expression. 


II. Recursion: If P and Q are Boolean expressions, then the following are also Boolean 
expressions: 


IAPA OPVO I©O~P Id) (P) 


III. Restriction: There are no Boolean expressions over the alphabet other than those ob- 
tained from the base and the recursion. 


Derive the fact that the following is a Boolean expression over the English alphabet 
la; Dy C5005 EV ZE 


~(PAQYV Erap). 
Solution (1) By I, p, q, and r are Boolean expressions. 
(2) By (1), IIa), and Il), p ^ q and ~r are Boolean expressions. 
(3) By (2), Hd), and I(a), (p A q) and ~r ^ p are Boolean expressions. 
(4) By (3), IO), and Il), ~(p ^ q) and (~r ^ p) are Boolean expressions. 
(5) By (4 and II(b), ~(p A q) vV (~r ^ p) is a Boolean expression. E 


Sele Recursive Definition of Parenthesis Structures 


Certain configurations of parentheses in algebraic expressions are “legal” [such as ())Q 
and (\QQJ], whereas others are not [such as )( ))) and O) J. Here is a recursive definition to 
generate the set C of legal parenthesis structures. 


I. Base: () is in C. 


II. Recursion: 
Ha) If E is in C, so is (E). 
I(b) If E and F are in C, so is EF. 


II. Restriction: No parenthesis structures are in P other than those obtained from the base 
and the recursion. 


Show that (( ))\( ) is a parenthesis structure in C. 
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Solution (1) ByL,()isinC. 
(2) By (D) and I(a), (©) is in C. 
(3) By (2), (1), and II(b), ())() is in C. 


Recursion is used to give a formal definition for the set of all strings over a finite set. E 


Recursive Definition for the Set of All Strings over a Finite Set 


Let A be any finite set. Call the elements of A characters, and define the set S of all 
strings over A as follows: 


I. Base: A is a string in S, where A denotes the null string, the “string” with no 
characters. 
II. Recursion: New strings are formed according to the following rules: 
Ila) If wis any string in S and if c is any character in A, then 
uc is a String in S, 
where uc is called the concatenation of u and c, and is obtained by appending 
c on the right of u. 


Il(b) If uis any string in S, then both the concatenation of A and u, denoted 
Au, and the concatenation of u and A, denoted uA, are defined to equal 
u. Symbolically: 

Au = uà = u. 

Il) Ifu and v are any strings in S, and if c is any character in A, then the con- 
catenation of u and vc is defined to equal the concatenation of uv and c. 
Symbolically: 

u(vc) = (uv)c. 
III. Restriction: Nothing is a string in S other than objects obtained from the base 
and the recursion. 


The base for the recursive definition of strings indicates only that one character, namely A, is 
a string. The next theorem states that each individual character in the underlying set is a string. 


Theorem 5.9.1 Characters Are Strings 


If A is a finite set and S is the set of all strings over A, then every character in A is a 
string in S. 


Proof: 

(1) Suppose c is any character in A. 

(2) By part I of the definition of string, A is a string in S. 

(3) By part H(a) of the definition of string, Ac is a string in S. 
(4) By part I of the definition of string, Ac = c. 

(5) Thus c is a string in S. 
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Proving a Property of Strings 


Suppose A is a finite set, S is the set of all strings over A, and a and b are in A. Because of 
the way the definition of string is stated, it cannot be deduced immediately that b(aa) is a 
string in S. To prove that this is true, first show that ba and (ba)a are strings in S and use 
the fact that b(aa) = (ba)a. 
Solution (1) By Theorem 5.9.1, a and b are strings in S because both a and b are in A. 
(2) By (1) and I(@), ba is a string in S because b is in S anda E A. 
(3) By (2) and IIl@), (ba)a is a string in S because ba E S anda E A. 
(4) By Il@), baa) = (baja because b and a are strings in S anda E A. 


(5) By (3) and (4), b(aa) is a string in S because it equals (ba)a, which is a 
string in S. a 


Sets of Strings with Certain Properties 


In Gödel, Escher, Bach, Douglas Hofstadter introduces the following recursively defined 
set of strings of M’s, Ps, and U’s, which he calls the MI U-system.* 


I. Base: MI is in the MI U-system. 


II. Recursion: 


II(a) If x Zis in the MI U-system, where x is a string, then x IU is in the MI U-system. 
(In other words, you can add a U to any string that ends in J. For example, since 
MTis in the system, so is MIU.) 


Il(b) If Mx is in the MI U-system, where x is a string, then M xx is in the M I U-system. 
(In other words, you can repeat all the characters in a string that follow an initial 
M. For example, if M UT is in the system, so is MUIUT) 


Ile) If x///y is in the MI U-system, where x and y are strings (possibly null), then x 
U y is also in the MI U-system. (In other words, you can replace [II by U. For 
example, if M II I is in the system, so are MIU and MUL) 


Il) Ifx UU y is in the MI U-system, where x and y are strings (possibly null), then 
x Uy is also in the MI U-system. (In other words, you can replace U U by U. For 
example, if MITU U is in the system, so is MIU.) 


III. Restriction: No strings other than those derived from I and II are in the MI U-system. 
Derive the fact that M UTU is in the M I U-system. 
Solution (1) By I, MI is in the M/U-system. 

(2) By (1) and I(b), MII is in the MI U-system. 

(3) By (2) and II(b), MITI is in the MI U-system. 

(4) By (3) and II), M UI is in the M I U-system. 

(5) By @ and II(a), M UIU is in the MI U-system. a 


Proving Properties about Recursively Defined Sets 


When a set has been defined recursively, a version of mathematical induction, called struc- 
tural induction, can be used to prove that every object in the set satisfies a given property. 


*Douglas Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid (New York: Basic Books, 1979), pp. 33-35. 
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Structural Induction for a Recursively Defined Set 


Let S be a set that has been defined recursively, and let P(x) be a property that 
objects in S may or may not satisfy. To prove that every object in S satisfies P(x), 
perform the following two steps: 


Step 1 (basis step): Show that P(@) is true for each object a in the base for S. 
Step 2 (inductive step): Show that for each x in S, if P(x) is true and if y is obtained 
from x by applying a rule from the recursion, then P(y) is true. To perform this step, 


suppose that x is an arbitrarily chosen element of § 
for which P(x) is true. 
[This supposition is the inductive hypothesis.] 
Then 
show that if y is obtained from x by applying a 
rule from the recursion for S, then P(y) is true. 


Conclusion: Because no objects other than those obtained from the base and recur- 
sion are contained in S, steps 1 and 2 prove that P(x) is true for every object x in S. 


A Property of the Set of Integers 


Let S be the set of all integers defined recursively as follows: 

I. Base: 4 is in S. 

II. Recursion: Given any integer n in S, n +3 is in S. 
MI. Restriction: No integers are in S other than those derived from the base and the recursion. 
Use structural induction to prove that for every integer n in S, n mod 3 = 1. 


Solution 


Proof (by structural induction): Given any integer n in S, let property P(n) be the sen- 
tence “n mod 3 = 1.” 


Show that P(n) is true for each integer n in the base for S: 
The only object in the base for S is 4, and P(A) is true because 4 mod3 = 1 since4 = 3-1 +1. 


Show that for each integer n in S, if P(n) is true and if m is obtained from n by applying 
arule from the recursion for S, then P(m) is true: 


Suppose n is any integer in S such that P(n) is true. Then n mod 3 = 1. [This is the induc- 
tive hypothesis.] The recursion for S$ consists only of one rule, and when the rule is applied 
to n, the result is n + 3. To complete the inductive step, we must show that P(n + 3) is true. 
By inductive hypothesis, 


n= 3k+1_ for some integer k. 
It follows that 


(n+3)mod3 = [(3k+1)+3]mod3 by substitution 
= (3k+4)mod3 by basic algebra 
= [3(k +1)+ 1] mod 3 because k + 1 is an integer, 
1 


which means that P(n + 3) is true [as was to be shown]. 
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Conclusion: 
Because there are no integers in S other than those obtained from the base and the recur- 
sion for S, every integer n in S satisfies the equation n mod 3 = 1. a 


A Property of the Set of Parentheses 


Consider the set C of all legal configurations of parentheses defined in Example 5.9.2. Use 
structural induction to prove that every configuration in C contains an equal number of left 
and right parentheses. 


Solution 


Proof (by structural induction): Given any parenthesis structure x in C, let property P(x) 
be the sentence “x has an equal number of left and right parentheses.” 


Show that P(a) is true for each parenthesis structure in the base for C: 
The only object a in the base for C is ( ), which has one left parenthesis and one right pa- 
renthesis. Since these numbers are equal, P(a) is true. 


Show that for each parenthesis structure x in C, if P(x) is true and if y is obtained from 
x by applying a rule from the recursion for C, then P(y) is true: 
The recursion for C in Example 5.9.2 consists of two rules: H(a) and II(b). 


Suppose E and F are any parenthesis structures in C such that P(E) and P(F) are 
true. In other words, EF has an equal number, say n, of left and right parentheses, and 
F has an equal number, say m, of left and right parentheses. [This is the inductive 
hypothesis.] 

When rule II(a) is applied to E, the result is (Æ), which has n + 1 left parentheses and 
n+ 1 right parentheses. Since these numbers are equal, P((E)) is true. When rule I(b) is 
applied to E and F, the result is EF, which has an equal number, namely m + n, of left and 
right parentheses. So P(EF) is true. 

Thus when the recursion rules for C are applied to parenthesis structures that have an 
equal number of left and right parentheses, the results also have an equal number of left 
and right parentheses, which completes the inductive step. 


Conclusion: 

Because there are no parenthesis structures in C other than those obtained from the base 
and the recursion for C, every parenthesis structure in C has an equal number of left and 
right parentheses. E 


Consider the recursive definition for the set of all strings S over a finite set A given on 
page 366. A recursive definition can also be given for the length of a string. 


Definition Length of a String 


Given the set of all strings S over a finite set A, the length L of a string in S in de- 

fined as follows: 

1. LA) = 0. 

2. For every string u in S and for every character a in A, the length of ua is one more 
than the length of u. Symbolically: 


L(ua) = L(u)+1 where u€ Sanda €E A. 


The following theorem states that the length of a concatenation of two strings is the sum 
of the lengths of the strings. 
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Theorem 5.9.2 Additive Property of String Length 


If S is the set of all strings over a finite set A, then for all strings u and v in S, 
L(uv) = L(u) + L(y). 


Proof (by structural induction): Let S be the set of all strings over a finite set A. 
Given any string v in S, let the property P(v) be the sentence 


For every string u in S, L(uv) = L(u) + L(v). 


We will show that P(v) is true for every string v in S. 


Show that P(a) is true for each string a in the base for S: 
The only string in the base for Sis À, and if u is any string in S, then 


L(uà) = L(u) by part II(b) in the definition of string 
= L(u) + 0 by definition of L. 
= L(u) + L(A) 


This shows that P(A) is true. 


Show that for each string x in S, if P(x) is true and if y is obtained from x by ap- 
plying a rule from the recursion for S, then P(y) is true: 

The recursion for S consists of three rules denoted II(a), II(b), and II(c), but rule 
II(a) is the only one that generates new strings in S. Suppose v is any string in S such 
that P(v) is true. In other words, suppose that L(uv) = L(u) + L(v). [This is the induc- 
tive hypothesis.] 

When rule II(a) is applied to v, the result is vc, where c is a character in A. So, to 
complete the inductive step, we must show that P(vc) is true. Now 


L(u(ve)) = L((uv)c) by part II(c) of the definition of string 
= Ltuv) + 1 by definition of length of a string 
= (L(u) + Lv) +1 because u is assumed to satisfy property P 
= L(u) + (L(y) + 1) by the associative law for addition 


= L(u) + Live) by definition of length of a string. 


Hence P(vc) is true fas was to be shown]. 


Conclusion: 
Because there are no strings in S other than those obtained through the base and the 
recursion for S, we conclude that every string in S satisfies the additive property for 
string length. 


The definition of string only defined concatenation between a string and an element of 
the underlying set. The next theorem extends the operation to pairs of strings. 
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Theorem 5.9.3 The Concatenation of Any Two Strings Is a String 


If S is the set of all strings over a finite set A and u and v are any strings in S, then 
uv is a string in S. 


Proof (by structural induction): Let S be the set of all strings over a finite set A. 
Given any string v in S, let the property P(v) be the sentence 


For every string u in S, uv is a string in S. 


We will show that P(v) is true for every string v in S. 


Show that P(a) is true for each string a in the base for S: 

The only string in the base for S is A, and if u is any string in S, then, by rule II(b) 
in the definition of string, vA = u. Hence the concatenation of u and À is a string in 
S, and so P(A) is true. 


Show that for each string x in S, if P(x) is true and if y is obtained from x by ap- 
plying a rule from the recursion for S, then P(y) is true: 

The recursive definition for S$ consists of three rules denoted II(a), I(b), and H(c), 
but rule II(a) is the only one that generates new strings in S. Suppose v is any string 
in S such that P(v) is true. In other words, suppose that for every string u in S, uv isa 
string in S. [This is the inductive hypothesis.] 

When rule II(a) is applied to v, the result is vc, where c is a character in A. To 
complete the inductive step, we must show that P(vc) is true. To do so, we will show 
that u(vc) is a string in S. 

Now because uv is a string in S, it follows from rule I(a) that (uv)c is also a string 
in S. In addition, by rule II), 


(uv)c = u(vc). 
Therefore, u(vc) is a string in S, which means that P(vc) is true [as was to be shown]. 
Conclusion: 


Because there are no strings in S other than those obtained from the base and the re- 
cursion for S, we conclude that the concatenation of any two strings in S is a string in S. 


Part II(c) of the definition of string states that a concatenation of three strings of a cer- 
tain type is associative—but only when the rightmost string is an element in the underlying 
set. The next theorem generalizes the associativity of concatenation to any three strings. 


Theorem 5.9.4 Concatenation of Strings Is Associative 


If S is the set of all strings over a finite set A and u, v, and w are any strings in S, then 
u(vw) = (uv)w. 


Idea of a proof by structural induction: Let S be the set of all strings over a finite 
set A. Given any string w in S, let the property P(w) be the sentence 


For all strings u and v in S, u(vw) = (uv)w. 


(continued on page 372) 
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The proof must show (1) that P(A) is true, and (2) that if w is any string in S such that 
P(w) is true and if y is obtained from w by applying a rule from the recursion for S, 
then P(y) is true. Now when rule II(a) is applied to w the result is wc for some char- 
acter c in A. A crucial step is to show that u((vw)c) = (u(vw))c. This follows from 
the definition of string because u and vw are in S and c is in A. 

Exercise 21 at the end of this section asks you to write a complete proof. 


Recursive Functions 


A function is said to be defined recursively or to be a recursive function if its rule of defi- 
nition refers to itself. Because of this self-reference, it is sometimes difficult to tell whether 
a given recursive function is well defined. Recursive functions are of great importance in 
the theory of computation in computer science. 


McCarthy’s 91 Function 


The following function M : Z*— Z was defined by John McCarthy, a pioneer in the 
theory of computation and in the study of artificial intelligence: 


n—10 ifn > 100 


a ee +11) ifn<100 


for all positive integers n. Find M(99). 


Solution By repeated use of the definition of M, 


M(99) = M(M(110)) since 99 < 100 

= M(100) since 110 > 100 

= M(M(111)) since 100 < 100 

e = M(101) since 111 > 100 

= 201 since 101 > 100. 
John McCarthy The remarkable thing about this function is that it takes the value 91 for all positive inte- 
(eran ers less than or equal to 101. (You are asked to show this in exercise 24 at the end of this 

g q 

section.) For n > 101, M(n) is well defined because it equals n — 10. E 


SEUMGEXA:J The Ackermann Function 


wo aig 


In the 1920s the German logician and mathematician Wilhelm Ackermann first defined 
a version of the function that now bears his name. This function is important because its 
values are computable but cannot be evaluated using only for-next loops. The function is 
defined on the set of all pairs of nonnegative integers as follows: 


3 A(O,n) =n+1 for all nonnegative integers n 3.9.1 
= 
< A(m, 0) = A(m— 1, 1) for all positive integers m 392 
& A(m, n) = A(m— 1, A(m, n— 1)) for all positive integers m and n 5.9.3 
Wilhelm Ackermann 
(1896-1962) Find A(1, 2). 
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Solution 
A(1, 2) = A(0, A(1, 1)) by (5.9.3) with m = 1 andn = 2 
= A(0, A(O, ACI, 0))) by (5.9.3) with m = 1 andn = 1 
= A(0, A(O, A(O, 1))) by (5.9.2) with m = 1 


= A(0, A(O, 2)) by (5.9.1) withn = 1 
= A(0, 3) by (5.9.1) with n = 2 
=4 by (5.9.1) with n = 3. 


The special properties of the Ackermann function are a consequence of its phenomenal rate 
of growth. While the values of A(0, 0) = 1, ACL, 1) = 3, AQ, 2) = 7, and A(3, 3) = 61 are 
not especially impressive, 


36 


A(4, 4) = 2? 


and the values of A(n, n) continue to increase with extraordinary rapidity thereafter. E 


The argument is somewhat technical, but it is not difficult to show that the Ackermann 
function is well defined. The following is an example of a recursive “definition” that does 
not define a function. 


See A Recursive “Function” That Is Not Well Defined 
Consider the following attempt to define a recursive function G from Z* to Z. For each 
integer n = 1, 
1 if nis 1 
G@) =214+6 (5) if nis even 
G(3n—1) ifnisoddandn> 1. 


Is G well defined? Why? 
Solution Suppose G is a function. Then by definition of G, 
Ga) = 1, 
GQ) = 1+G(Q) =14+1=2, 
G(3) = G(8) = 1+ G4) = 1+0+G(Q)) =1+0 +2) =4, 
G(4) = 1+ GQ) =14+2=3. 


However, 
G(5) = G(14) = 1+ G(7) = 1 + G(20) 
=1+(1+G(0)) =1+(0+(0+4+G6))) = 3 + G6). 


Subtracting G(5) from both sides gives 0 = 3, which is false. Since the supposition that G 
is a function leads logically to a false statement, it follows that G is not a function. B 


A slight modification of the formula of Example 5.9.9 produces a “function” whose 
status of definition is unknown. Consider the following formula: For each integer n = 1, 


1 if nis 1 
n . { 
Tín = 9 T (5) if n is even 
T(3n+1) if nis odd. 
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In the 1930s, a student, Luther Collatz, became interested in the behavior of a related func- 
tion g, which is defined as follows: g(n) = n/2 if n is even, and g(n) = 3n + 1 if n is odd. 
Collatz conjectured that for any initial positive number n, computation of successive values 
of g(n), g°(n), gm), . . . would eventually produce the number 1. Determining whether this 
conjecture is true or false is called the 3n + 1 problem (or the 3x + 1 problem). If Collatz’s 
conjecture is true, the formula for T defines a function; if the conjecture is false, T is not 
well defined. As of the publication of this book the answer is not known, although com- 
puter calculation has established that it holds for extremely large values of n. 


TEST YOURSELF 


1. The base for a recursive definition of a set elements in the and apply the rules from 
is the until you obtain the given element. 

2. The recursion for a recursive definition of a set 5. To use structural induction to prove that every 
is element in a recursively defined set S satisfies a 


certain property, you show that 
each rule in the recursion, if 


and that, for 
then 


3. The restriction for a recursive definition of a set 
is 


: na 6. A function is said to be defined recursively if, and 
4. One way to show that a given element is in a 


recursively defined set is to start with an element or anly i, ` 
EXERCISE SET 5.9 
1. Consider the set of Boolean expressions defined in 5. The set of arithmetic expressions over the real 
Example 5.9.1. Give derivations showing that each numbers can be defined recursively as follows: 
of the following is a Boolean expression over the I. Base: Each real number r is an arithmetic 
English alphabet {a, b, c,..., x, y, z}. expression. 
a ~pV@GA(rv~s)) II. Recursion: If u and v are arithmetic expres- 
b. (pV gv ~(pA~s) An) sions, then the following are also arithmetic 
P : expressions: 
2. Consider the set C of parenthesis structures de- 
fined in Example 5.9.2. Give derivations showing Waj. (Fu) 
that each of the following is in C. Ib) (~u) 
a. OO) Il(c) (u+v) 
b. OO Id) (u-v) 
3. Let S be the set of all strings over a finite set A Ie) (u-v) 
and let a, b, and c be any characters in A. j 
a. Using Theorem 5.9.1 but not Theorem 5.9.3 or I) (5 
5.9.4, show that (ab)c = a(bc). 
b. Show that ab is a string in S. Then use the result III. Restriction: There are no arithmetic expres- 
of part (a) to conclude that a(bc) is a string in S. sions over the real numbers other than those 


(This exercise shows that parentheses are not obtained from I and II. 


needed when writing the string abc.) (Note that the expression (5 is allowed to be 


4. Consider the MI U-system discussed in Example 
5.9.4. Give derivations showing that each of the fol- 
lowing is in the MI U-system. 

a. MIUI 


b. MUIIU a. ((2: (0.3-4.2) + (7) b. ( 


an arithmetic expression even though the value 
of v may be 0.) Give derivations showing that 
each of the following is an arithmetic expression. 


oe et 
((4—7)-6) 
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. Let S be a set of integers defined recursively as 


follows: 

I. Base: 5 is in S. 
II. Recursion: Given any integer n in S, n + 4 is 
in S. 
Restriction: No integers are in S other than 
those derived from rules I and II above. 


Il. 


Use structural induction to prove that for every 
integer n in S, n mod 2 = 1. 


. Define a set S of strings over the set {0, 1} recur- 


sively as follows: 
I. Base: 1 E S 
II. Recursion: If s E S, then 
Ha) Os ES 
Ib) lsES 


Restriction: Nothing is in S other than objects 
defined in I and II above. 


il. 


Use structural induction to prove that every string 
in S ends ina 1. 


. Define a set S of strings over the set {a, b} recur- 


sively as follows: 
I. Base:a E S 
Il. Recursion: If s E S, then 
Ha) saEsS 
Ib) sbh ES 


Restriction: Nothing is in S other than objects 
defined in I and II above. 


HI. 


Use structural induction to prove that every string 
in S begins with an a. 


. Define a set S of strings over the set {a, b} recur- 


sively as follows: 
I. Base: A ES 
Il. Recursion: If s E S, then 
IIa) bs ES 
I(b) sh ES 
Hc) saa ES 
Id) aas E S 


Restriction: Nothing is in S other than objects 
defined in I and II above. 


Il. 


Use structural induction to prove that every string 
in S contains an even number of a’s. 


Define a set S of strings over the set of all integers 
recursively as follows: 


13. 


H* 14. 


375 


I. Base: 1 E S,2ES,3 E S,4 E S,5 ES, 
6ES,7ES,8ES,9ES 


II. Recursion: If s E S and t E S, then 
Ha) s0ES 
Il(b) st ES 
HI. Restriction: Nothing is in S other than objects 
defined in I and II above. 
Use structural induction to prove that no string in 
S represents an integer with a leading zero. 


. Define a set S of strings over the set of all integers 


recursively as follows: 
I. Base: LES ZESSES,TES IES 
II. Recursion: If s E S and t E S, then 
Ha) stEs 
(b) 2s ES 
HC) 4sES 
Id) 6s ES 
Ile) 8s ES 


Restriction: Nothing is in S other than objects 
defined in I and II above. 


Il. 


Use structural induction to prove that every string 
in S represents an odd integer when written in 
decimal notation. 


. Define a set S of integers recursively as follows: 


I. Base: 0E SSE S 
II. Recursion: If k E S and p E S, then 
Ha) k+pES 
Ib) k-pES 
Restriction: Nothing is in S other than objects 
defined in I and II above. 


MI. 


Use structural induction to prove that every integer 
in S is divisible by 5. 

Define a set S of integers recursively as follows: 

I. Base:OES 

II. Recursion: If k E€ S, then 

Ha) k+3€S 

Ib) kK-3 ES 

Restriction: Nothing is in S other than objects 
defined in I and II above. 


III. 
Use structural induction to prove that every integer 
in S is divisible by 3. 


Is the string MU in the MI U-system? Use struc- 
tural induction to prove your answer. 
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Determine whether either of the following pa- 
renthesis configurations is in the set C defined in 
Example 5.9.2. Use structural induction to prove 
your answers. 


a. OW 


23. 


Use the definition of McCarthy’s 91 function in 
Example 5.9.7 to show the following: 

a. M(86) = M(Q91) 

b. M91) = 91 


H b. OOO * 24. Prove that McCarthy’s 91 function equals 91 for 
: all positive integers less than or equal to 101. 
16. Give a tecursIve definition for the set of ali strings of 25. Use the definition of the Ackermann function in 
0’s and 1’s that have the same number of 0’s as 1’s. ; 
Example 5.9.8 to compute the following: 
17. Give a recursive definition for the set of all strings a. AC, 1) 
of 0’s and 1’s for which all the 0’s precede all the 1’s. b. A(2, 1) 
18. Give a recursive definition for the set of all strings 26. Use the definition of the Ackermann function to 
of a’s and b’s that contain an odd number of a’s. show the following: 
19. Give a recursive definition for the set of all strings aun = ES TONERS ao n 
ofo sandr s ihatcontainexsctieone b. A(2, n) = 3 + 2n, for each nonnegative integer n 
y ` c. A(3,n) = 8 2” — 3, for each nonnegative 
20. a. Let A be any finite set and let L be the length integer n 
function on the set of all strings over A. Prove 27. Compute TŒ), TG), T(4), T(5), T(6), and T(7) for 
al tor every Character a 1m4; La) = 1; the “function” T defined after Example 5.9.9 
b. IfA is a finite set, define a set S of strings over poe ara 
A as follows: 28. Student A tries to define a function F : Z* — Z by 
I. Base: Every character in A is a string in S. the rule 
I. Recursion: If s is any string in S, then for 1 ifnis | 
every character c in A, csc is a string in S. Fn) = (2) ifniseven 
III. Restriction: Nothing is in S except strings 2 
obtained from the base and the recursion. 14+ F(5n—-9) ifnisoddandn> 1 
Use structural induction to prove that given any for each integer n = 1. Student B claims that F is 
string s in S, the length of s, L(s), is an odd integer. not well defined. Justify student B’s claim. 
21. Write a complete proof for Theorem 5.9.4. 29. Student C tries to define a function G : Z* — Z by 
H 22. If Sis the set of all strings over a finite set A and the rule 


ANSWERS FOR TEST YOURSELF 


if u is any string in S, define the string reversal 1 if nis 1 

function, Rev, as follows: ü 

a. Rev(A) =A Gin) = 4G > if n is even 

b. For every string u in S and for every character 2+GGn—5) ifnisoddandn> 1 


ain A, Rev(ua) = aRev(u). 
Use structural induction to prove that for all 
strings u and v in S, Rev(uv) = Rev(v)Rev(u). 


for each integer n = 1. Student D claims that G is 
not well defined. Justify student D’s claim. 


1. a statement that certain objects belong to the set 


2.a collection of rules indicating how to form new set objects 


from those already known to be in the set 


3. a statement 


either the base or the recursion 4. base; recursion 5. each 


object in the base satisfies the property; the rule is applied to 


objects in the base; the objects defined by the rule also satisfy 


that no objects belong to the set other than those coming from the property 6. its rule of definition refers to itself 
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SET THEORY 


In the late nineteenth century, Georg Cantor was the first to realize the potential 
usefulness of investigating properties of sets in general as distinct from properties of 
the elements that comprise them. Many mathematicians of his time resisted accept- 
ing the validity of Cantor’s work. Now, however, abstract set theory is regarded as the 
foundation of mathematical thought. All mathematical objects (even numbers!) can be 
defined in terms of sets, and the language of set theory is used in every mathematical 
subject. 

In this chapter we add to the basic definitions and notation of set theory introduced 
in Chapter 1 and show how to establish properties of sets through the use of proofs and 
counterexamples. We also introduce the notion of a Boolean algebra, explain how to derive 
its properties, and discuss their relationships to logical equivalencies and set identities. 
The chapter ends with a discussion of a famous “paradox” of set theory and its relation to 
computer science. 


Pictorial Press Ltd./Alamy Stock Photo 


Georg Cantor 
(1845-1918) 


Source: David Eugene Smith Collection, Columbia University. 


KA] Set Theory: Definitions and the Element 
Method of Proof 


The introduction of suitable abstractions is our only mental aid to organize and 
master complexity. —E. W. Dijkstra, 1930-2002 


The words set and element are undefined terms of set theory just as sentence, true, and 
false are undefined terms of logic. The founder of set theory, Georg Cantor, suggested 
imagining a set as a “collection into a whole M of definite and separate objects of our intu- 
ition or our thought. These objects are called the elements of M.” Cantor used the letter M 
because it is the first letter of the German word for set: Menge. 
Following the spirit of Cantor’s notation (though not the letter), let S denote a set and a 
an element of S. Then, as indicated in Section 1.2, a E S means that a is an element of S, 
a €& S means that a is not an element of S, {1, 2, 3} refers to the set whose elements are 1, 
2, and 3, and {1, 2, 3,... } refers to the set of all positive integers. If S is a set and P(x) is a 
A property that elements of S may or may not satisfy, then a set A may be defined by writing 


Caution! Don’t forget to 
include the words “the set A={xES | P(x)}, 
of all.” Z a 


the set of all such that 


which is read “A is the set of all x in S such that P of x.” 
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Subsets: Proof and Disproof 


In Section 1.2 we defined what it means for a set A to be a subset of the set B. Here we 
rewrite the definition as a formal universal conditional statement: 


ACB © \%,ifx EA thenx €B. 


The negation is, therefore, existential: 


AGB = xX such that x E A and x É B. 


Recall that a proper subset of a set is a subset that is not equal to its containing set. That is: 


A is a proper subset of B = 
(1) A C B, and 
(2) there is at least one element in B that is not in A. 


Testing Whether One Set Is a Subset of Another 
Let A = {1} and B = {1, {1}}. 
a. Is A CB? 
b. If so, is A a proper subset of B? 


Solution 

a. Because A = {1}, A has only one element—namely, the symbol 1. This element is 
Note A set like {1}, with also one of the elements in set B. Hence every element in A is in B, and so A C B. 
just one element, is called 


b. B has two distinct elements, the symbol | and the set {1} whose only element is 1. 
Since | # {1}, the set {1} is not an element of A, and so there is an element of B that is 
not an element of A. Hence A is a proper subset of B. E 


a singleton set. 


Because we define what it means for one set to be a subset of another by means of a uni- 
versal conditional statement, we can use the method of direct proof to establish a subset 
relationship. Such a proof is called an element argument and is the fundamental proof 
technique of set theory. 


Element Argument: The Basic Method for Proving That 
One Set Is a Subset of Another 


Let sets X and Y be given. To prove that X C Y, 
1. suppose that x is a particular but arbitrarily chosen element of X, 
2. show that x is an element of Y. 


Proving and Disproving Subset Relations 


Define sets A and B as follows: 


A= {mE Z|m = 6r+ 12 for some r € Z} 
B = {n E Z |n = 3s for some s E Z}. 


a. Outline a proof that A C B. b. Prove that A C B. c. Disprove that B C A. 
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Solution 
a. Proof Outline: 
Starting Point: Suppose x is a particular but arbitrarily chosen element of A. 


To Show: Therefore, x is an element of B. 


b. Proof: 
Suppose x is a particular but arbitrarily chosen element of A. 


[We must show that x E B. By definition of B, this means 
we must show that x = 3-(some integer).] 


By definition of A, there is an integer, say r, such that x = 6r+ 12. 


[Given that x = 6r + 12, can we express x as 3: (some integer)? 
That is, does 6r + 12 = 3+(some integer)? Yes, 6r + 12 = 3-(2r+4).] 


Let s = 2r+4. 
[We must check that s is an integer.] 


Then s is an integer because products and sums of integers are integers, and so 3s E B 
by definition of B. 


[Now we must check that x = 3s.] 


Also 3s = 3(2r + 4) = 6r+12 =x, 
Thus, by definition of B, x is an element of B, 


[as was to be shown]. 


c. To disprove a statement means to show that it is false. And to show that B C A is 
false, you must find an element of B that is not an element of A. By the definitions of 
A and B, this means that you must find an integer x of the form 3 - (some integer) that 
cannot be written in the form 6-(some integer) + 12. A little experimentation reveals 
that various numbers work. For instance, you could let x = 3. Then x € B because 
3 = 3-1, but x € A because there is no integer r such that 3 = 6r + 12. For if there 
were such an integer, then 


Note Recall that the 6r+12=3 by assumption 


notation P(x) = Q(x) = 2r+4=1 by dividing both sides by 3 
means that every element 


that makes P(x) true also > 2r= -3 by subtracting 4 from both sides 
makes Q(x) true. => r = —3/2. by dividing both sides by 2. 
But —3/2 is not an integer. Thus 3 € B whereas 3 € A, and so B ZA. E 


Set Equality 


Recall that by the axiom of extension, sets A and B are equal if, and only if, they have 
exactly the same elements. We restate this as a definition that uses the language of subsets. 


Definition 


Given sets A and B, A equals B, written A = B, if, and only if, every element of A is 
in B and every element of B is in A. 
Symbolically: 


A=B & ACBandBCA. 
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This version of the definition of equality implies the following: 


To know that a set A equals a set B, you must know 
that A C B and you must also know that B C A. 


Set Equality 


Defi ts A and B as foll : 
Note A is the set of all ene ene ae lacie 


integers that can be writ- A = {mE Z| m = 2a for some integer a} 
ten as 2-(some integer). 

The symbols m and a in 

the formal definition for A Is A =B? 
are used for convenience, 

but they can be replaced Solution Yes. To prove this, both subset relations A C B and B C A must be proved. 


b ther symbols as 
T Part 1, Proof That A C B: 


long as the replacements : : oy 
are made consistently. For Suppose x is a particular but arbitrarily chosen element of A. 


example, we can write [We must show that x E B. By definition of B, this means 


A={n€Z|n=2b we must show that x = 2: (some integer) — 2.] 


B = {n E Z |n = 2b — 2 for some integer b}. 


for some integer b}. By definition of A, there is an integer, say a, such that x = 2a. 


[Given that x = 2a, can x also be expressed as 2: (some integer) — 2? 
In other words, is there an integer—say, b—such that 2a = 2b — 2? 
Solve for b to obtain b = (2a + 2)/2 = a + 1. Check to see if this 
works.] 


Letb=a+l1. 
[First check that b is an integer.] 
Then b is an integer because it is a sum of integers. 


[Then check that x = 2b —2.] 


Also, 2b— 2 = 2(a + 1)— 2 = 2a +2 -2 = 2a = x. 
Thus, by definition of B, x is an element of B 


[as was to be shown]. 


Part 2, Proof That B C A: This part of the proof is left as exercise 2 at the end of this 
section. E 


Venn Diagrams 


If sets A and B are represented as regions in the plane, relationships between A and B can 
be represented by pictures called Venn diagrams, which were introduced by the British 
mathematician John Venn in 1881. For instance, the relationship A C B can be pictured in 
one of two ways, as shown in Figure 6.1.1. 


(4) 2 


John Venn (a) (b) 
(1834-1923) FIGURE 6.1.1 ACB 


Pictures From History/Newscom 
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The relationship A É B can be represented in three different ways, as shown in 


Figure 6.1.2. 
C) C) 
(b) (c) 


(a) 
FIGURE 6.1.2 ACB 


If we allow the possibility that some subregions of Venn diagrams do not contain any 
points, then in Figure 6.1.1 diagram (b) can be viewed as a special case of diagram (a) by 
imagining that the part of B outside A does not contain any points. Similarly, diagrams (a) 
and (c) of Figure 6.1.2 can be viewed as special cases of diagram (b). To obtain (a) from (b), 
imagine that the region of overlap between A and B does not contain any points. To obtain 
(c), imagine that the part of B that lies outside A does not contain any points. However, in 
all three diagrams it would be necessary to specify that there is a point in A that is not in B. 


Relations among Sets of Numbers 


Since Z, Q, and R denote the sets of integers, rational numbers, and real numbers, respec- 
tively, then Z is a subset of Q because every integer is rational (any integer n can be written 
in the form “an and Q is a subset of R because every rational number is real (any rational 
© Q\R number can be represented as a length on the number line). Z is a proper subset of Q be- 
cause there are rational numbers that are not integers (for example, 5), and Q is a proper 
subset of R because there are real numbers that are not rational (for example, V2). These 


FIGURE 6.1.3 relationships are shown diagrammatically in Figure 6.1.3. a 


Operations on Sets 


Most mathematical discussions are carried on within some context. For example, in a certain 
situation all sets being considered might be sets of real numbers. In such a situation, the set of 
real numbers would be called a universal set or a universe of discourse for the discussion. 


Definition 


Let A and B be subsets of a universal set U. 


1. The union of A and B, denoted A U B, is the set of all elements that are in at least 
one of A or B. 


2. The intersection of A and B, denoted A N B, is the set of all elements that are 
common to both A and B. 


3. The difference of B minus A (or relative complement of A in B), denoted B — A, 
is the set of all elements that are in B and not A. 


4. The complement of A, denoted A‘, is the set of all elements in U that are not in A. 
Symbolically: 
AUB={x€U|x€Aorx € B} 
ANB={x€U|x EA andx E B} 
B—A = {xE U|xEBandx EA} 
Ao = {x E U| x EA}. 
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The symbols €, U, and N were introduced in 1889 by the Italian mathematician 
Giuseppe Peano. 

Venn diagram representations for union, intersection, difference, and complement are 
shown in Figure 6.1.4. 


æj x] | co] 


fototeca gilardi/Marka/Superstock 


Shaded region Shaded region Shaded region Shaded region 
represents A U B. represents A N B. represents B — A. represents A^. 
Giuseppe Peano FIGURE 6.1.4 


(1858-1932) 
SCUWGLARS Unions, Intersections, Differences, and Complements 


Let the universal set be the set U = {a, b, c, d, e, f, g}, and let A = {a, c, e, g} and 
B = {d, e, f, g}. Find A U B, AN B, B—A, and A‘. 


Solution 
AUB= {a,c,d,e, f, g} AM B= fe, 2} 
B-A = {d,f} A’ = {b, d, f} a 


There is a convenient notation for subsets of real numbers that are intervals. 


Interval Notation 
Given real numbers a and b witha S b: 


(a,b) ={xER|a<x<b} [ad] ={xER|asx=b} 
(a, b] = {x ER|a<xsb} [a, b) = {x ER|asx<b}. 


Note The symbol œ% The symbols % and —% are used to indicate intervals that are unbounded either on 
does not represent a the right or on the left: 

number. It just indicates 

the unboundedness of the (a,~)= {xER | >o Be SER | s=) 

interval. (—%, b) = {xER|x<b} (—~,b] = {xER|x =b}. 


Although the notation for the interval (a, b) is identical to the notation for the ordered pair 
(a, b), context makes it unlikely that the two will be confused. 


Set Ray An Example with Intervals 


Let the universal set be R, the set of all real numbers, and let 
A=(-1,0] = {xER| 1<x=0} and B=[0,1)= {x R|OSx< 1}. 


These sets are shown on the number lines below. 


< o—+ H > 
A 
-2 -l 0 1 2 
< H +—© > 
B 


Find A U B, AN B, B—A, and A*. 
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a op ee a Solution 
<4 AUB= {x E R|x E (-1, 0] or x € [0, 1)} = {x ER|x E€ (-1, 1)} = (-1, 1). 
AUB 
9.2! 0. 1 2 
< + > AMB= {x ER|x € (—1, 0] and x € [0, 1)} = {0}. 
ANB 
Ot Oo LT 2 
< o—o—}+> B-A={xER|x€ [0, 1) andx ¢ (—1, 0]} = {x E R|O0 <x < 1} = (0, 1) 


a A° = {x E R| it is not the case that x E (—1, 0]} 


= {x E€ R|it is not the case that (—1 < x and x <0)} _ DY definition of the 
double inequality 


={xE R|x = —-lorx>0} = (—~,—-1] U (0, ~) by De Morgan’s law E 


The definitions of unions and intersections for more than two sets are very similar to 
the definitions for two sets. 


Definition 


Unions and Intersections of an Indexed Collection of Sets 
Given sets Ag, Aj, A2, . . . that are subsets of a universal set U and given a nonnegative 


n integer n, 
Note \_) A;is read “the 
i=0 $ 
union of the A-sub-i from Ua = {x E U|x E A; for at least one i = 0, 1, 2,..., n} 
i equals zero to n.” =U 
Ai = {x E U|x E A; for at least one nonnegative integer i} 
E 


(A: = {x E U|x E A; for every i = 0, 1, 2,..., n} 
i=0 


A; = {x E U|x E A; for every nonnegative integer i}. 
i=0 


I 


An alternative notation for U A; is Ag U A; U... U A, and an alternative notation for 
[N Ais Ag MALO... A,” 


i=0 
eae” Finding Unions and Intersections of More than Two Sets 


For each positive integer i, let A; = {x ER| l <x< i} = (-4 3). 


ii 


a. Find A; U A, U A; and Ay N A, N 43. b. Find U A; and () Aj. 


i=1 i=l 
Solution 
a. A; U A, U A; = {x E R]x is in at least one of the intervals (—1, 1), 


or (=3, 3}, or (=3,3)} 


={xE R| =1<x% <1} because all the elements in (-3,4 


RS 


= (-l, 1) and (—4, 1) are in{—1, 1} 
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A, N Az N A; = {x E R | xis in all of the intervals (1, 1), and (—3, 5), and (—3, 3)} 


={kER|-3<x<3}  becase(-4 l) c(-4} cL D 


œ 


b. UJA; = fx € R | xis in at least one of the intervals (-i, i) 


i, F 
i=1 
where i is a positive integer} 
={xER | =] << 1} because all the elements in every interval 
=(-1,1) (—i, ;) are in (1, 1) 


4: = {x €E R | xis in all of the intervals ( oe 


z 4), where i is a positive integer} 
i=1 


= {0} because the only element in every interval is 0 E 


The Empty Set 


We have stated that a set is defined by the elements that compose it. This being so, 
can there be a set that has no elements? It turns out that it is convenient to allow such 
a set. Otherwise, every time we wanted to take the intersection of two sets or to define 
a set by specifying a property, we would have to check that the result had elements 
and hence could be defined as a set. For example, if A = {1,3} and B = {2, 4}, then 
A N B has no elements. Neither does {x € R | v= 1} because no real numbers have 
negative squares. 

It may seem strange to talk about a set with no elements, but it often happens in math- 
ematics that the definitions formulated to fit one set of circumstances are satisfied by some 
extreme cases not originally anticipated. Yet changing the definitions to exclude those cas- 
es would seriously undermine the simplicity and elegance of the theory taken as a whole. 

In Section 6.2 we will show that there is only one set with no elements. Because it is 
unique, we can give it a special name. We call it the empty set (or null set) and denote it 
by the symbol Ø. Thus {1, 3} M {2,4} = Ø and {x E R| = -]} = Ø. 


A Set with No Elements 


Describe the following sets. 
a. D={xER|3<x<2}. b. E={xEZ|2<x<3}. 
Solution: 


a. Recall that a < x < b means that a < x and x < b. So D consists of all real numbers 
that are both greater than 3 and less than 2. Since there are no such numbers, D has no 
elements and thus D = Ø. 


b. E is the set of all integers that are both greater than 2 and less than 3. Since no inte- 
gers satisfy this condition, E has no elements, and so E = Ø. E 


Partitions of Sets 


In many applications of set theory, sets are divided into nonoverlapping (or disjoint) pieces. 
Such a division is called a partition. 
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Definition 


Two sets are called disjoint if, and only if, they have no elements in common. 
Symbolically: 


A and B are disjoint = ANB = Ø. 


Example 6.1.9 Disjoint Sets 


Let A = {1, 3,5} and B = {2, 4, 6}. Are A and B disjoint? 


Solution Yes. By inspection A and B have no elements in common, or, in other words, 
{1,3,5} N {2, 4, 6} = Ø. | 


Definition 


Sets Aj, A2, A3, . . . are mutually disjoint (or pairwise disjoint or nonoverlapping) 
if, and only if, no two sets A; and A; with distinct subscripts have any elements in 
common. More precisely, for all integers i and j = 1, 2, 3,... 


A; A; = © whenever i # j. 


Seay Mutually Disjoint Sets 
a. Let A; = {3,5}, A2 = {1, 4, 6}, and Az = {2}. Are Aj, A>, and A3 mutually disjoint? 
b. Let B; = {2, 4, 6}, B2 = {3, 7}, and B3 = {4, 5}. Are Bj, B2, and B mutually disjoint? 


Solution 


a. Yes. A; and A, have no elements in common, A, and A, have no elements in common, 
A and A, and A; have no elements in common. 


> b. No. B, and B; both contain 4. E 


Suppose A, Aj, A2, A3, and Ay are the sets of points represented by the regions shown 

in Figure 6.1.5. Then A;, A2, A3, and A; are subsets of A, and A = A; U A; U A; U Ay. 

Suppose further that boundaries are assigned to the regions representing A,, A2, A3, and 

FIGURE6.1.5 A Parti- A, in such a way that these sets are mutually disjoint. Then A is called a union of mutual- 


tion of a Set ly disjoint subsets, and the collection of sets {A;, A2, A3, A4} is said to be a partition of A. 


Definition 


A finite or infinite collection of nonempty sets {A;, A>, A3,... } is a partition of a 
set A if, and only if, 


1. A is the union of all the A;; 
2. the sets A4, Ao, A3,...are mutually disjoint. 


Partitions of Sets 


a. Let A = {1, 2,3, 4,5, 6}, Ay = {1,2}, A, = {3, 4}, and A; = {5, 6}. Is {A), A>, A3} a 
partition of A? 
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b. Let Z be the set of all integers and let 


Ty = {n E Z| n = 3k, for some integer k}, 
Tı = {n E Z |n = 3k +1, for some integer k}, and 
Ty = {n E Z |n = 3k +2, for some integer k}. 


Is {To, Ti, To} a partition of Z? 


Solution 
a. Yes. By inspection, A = A; U A> U A; and the sets Aj, A>, and A} are mutually disjoint. 


b. Yes. By the quotient-remainder theorem, every integer n can be represented in exactly 
one of the three forms 


n=3k or n=3k+1 or n= 3k+2, 


for some integer k. This implies that no integer can be in any two of the sets Tọ, T}, or 
T». So To, T}, and T, are mutually disjoint. The theorem also implies that every integer 
is in one of the sets Tp, T}, or T2. So Z = Ty U Ti U Tp. E 


Power Sets 


There are various situations in which it is useful to consider the set of all subsets of a par- 
ticular set. The power set axiom guarantees that this is a set. 


Definition 


Given a set A, the power set of A, denoted P(A), is the set of all subsets of A. 


Se kaAva Power Set of a Set 


Find the power set of the set {x, y}. That is, find P(x, y}. 


Solution ({x, y}) is the set of all subsets of {x, y}. In Section 6.2 we will show that 
© is a subset of every set, and so Ø E P({x, y}). Also any set is a subset of itself, so 
{x, y} E P({x, y}). The only other subsets of {x, y} are {x} and {y}, so 


P({x, yp = {D, {x}, {y}, {x, y}} m 


An Algorithm to Check Whether One Set Is a Subset 
of Another (Optional) 


You may get some additional insight into the concept of subset by considering an algorithm 
for checking whether one finite set is a subset of another. Order the elements of both sets 
and successively compare each element of the first set with each element of the second set. 
If some element of the first set is not found to equal any element of the second, then the 
first set is not a subset of the second. But if each element of the first set is found to equal 
an element of the second set, then the first set is a subset of the second. The following 
algorithm formalizes this reasoning. 
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Algorithm 6.1.1 Testing Whether A C B 


[The input sets A and B are represented as one-dimensional arrays a[1], a[2], . . . , a[m] 
and b[1], b[2], . .. , b[n], respectively. Starting with a[1] and for each successive a{i] in 
A, a check is made to see whether a{i] is in B. To do this, a[i] is compared to successive 
elements of B. If ali] is not equal to any element of B, then the output string, called 
answer, is given the value “A Ç B.” If ali] equals some element of B, the next succes- 
sive element in A is checked to see whether it is in B. If every successive element of 
A is found to be in B, then the answer never changes from its initial value “A C B.”] 


Input: m [a positive integer], a[l], a[2],..., alm] [a one-dimensional array repre- 
senting the set A], n [a positive integer], b{1], b[2],..., b[n] [a one-dimensional array 
representing the set B] 

Algorithm Body: 


i:= 1, answer := “A C B” 
while (i = m and answer = “A C B”) 
J := 1, found := “no” 
while (j = n and found = “no” 
if a[i] = b[j] then found := “yes” 
j:=j+1 
end while 


[Tf found has not been given the value “yes” when execution reaches this point, 
then ali] € B.] 


if found = “no” then answer := “A Ç B” 
i:=i+1 
end while 
Output: answer [a string] 


Tracing Algorithm 6.1.1 


Trace the action of Algorithm 6.1.1 on the variables i, j, found, and answer for m = 3, 
n = 4, and sets A and B represented as the arrays a[l] = u, a[2] = v, a[3] = w, b[1] = w, 
b[2] = x, b[3] = y, and b[4] = u. 


Solution 
i 1 2 3 
J 1 2| 3J] 41/5/1234 5 
found no yes no 
answer | ACB ACB Po 


In the exercises at the end of this section, you are asked to write an algorithm to check 
whether a given element is in a given set. To do this, you can represent the set as a one- 
dimensional array and compare the given element with successive elements of the array to 
determine whether the two elements are equal. If they are, then the element is in the set; if 
the given element does not equal any element of the array, then the element is not in the set. 
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TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. The notation A C B is read “ ” and means 


that 7 


2. To use an element argument for proving that a set 
X is a subset of a set Y, you suppose that 
show that 


3. To disprove that a set X is a subset of a set Y, you 
show that there is 


4. An element x is in A U B if, and only if, 


5. An element x is in A N B if, and only if, 


EXERCISE SET 6.1* 


and 


6 
7. 
8 
9 
10. 
Tl. 


. An element x is in B — A if, and only if, 


. The empty set is a set with 


. The power set of a set A is 


An element x is in A‘ if, and only if, 


Sets A and B are disjoint if, and only if, 


A collection of nonempty sets Aj, A>, A3,... 
is a partition of a set A if, and only if, 


1. In each of (a)—-(f), answer the following questions: 
Is A C B? Is B CA? Is either A or B a proper 
subset of the other? 

A = {2, {2}, /2)"}, B = {2, {2}, {{2}}} 

A = {3, \/5*—4?, 24 mod 7}, B = {8 mod 5} 

= {{1, 2}, {2,3}}, B = {1, 2, 3} 

{a, b, c}, B = {{a}, {b}, {ch} 

= {V16, {4}}, B = {4} 

{xE R|cos x EZ}, 
B= {xE R|sinx E Z} 


mppnpg yp 


2. Complete the proof from Example 6.1.3: Prove 
that B C A where 


A = {m € Z| m = 2a for some integer a} 
and 


B = {n E Z |n = 2b — 2 for some integer b} 


3. Let sets R, S, and T be defined as follows: 
R = {x € Z| x is divisible by 2} 
S = {y € Z| y is divisible by 3} 
T = {z EZ | zis divisible by 6}. 
Prove or disprove each of the following statements. 


a RCT b. TCR e TOS 


4. Let A = {n E Z |n = 5r for some integer r} and 
B = {m E Z | m = 20s for some integer s}. Prove 
or disprove each of the following statements. 

a ACB b. BCA 


. Let C= {nE Z]|n= 6r 


5 for some integer r} 
and D = {m E Z | m = 3s + 1 for some integer s}. 


Prove or disprove each of the following statements. 
a CCD b. DCC 


. Let A = {x E Z | x = 5a +2 for some integer a}, 


B ={y E Z | y = 10b — 3 for some integer b}, and 
C={z E€ Z|z = 10c +7 for some integer c}. 


Prove or disprove each of the following statements. 
a ACB b. BCA Hc. B=C 


Let A = {x E Z | x = 6a + 4 for some integer a}, 
B = {y E Z | y = 18b — 2 for some integer b}, and 
C = {z E Z |z = 18c + 16 for some integer c}. 


Prove or disprove each of the following statements. 
a ACB b. BCA c. B=C 


. Write in words how to read each of the following 


out loud. Then write each set using the symbols 
for union, intersection, set difference, or set 
complement. 

a. {x @U|x GA and x €B} 

b. {x €U|x GA orx € B} 

c. {x © U|x GA and x ¢ B} 

d. {xE U|x ÉA} 


Complete the following sentences without using 
the symbols U, N, or —. 

a. x ÉA U B if, and only if, 
b. x ÆA N B if, and only if, 
c. x É A-B if, and only if, 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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10. 


Tl. 


12. 


13. 


14. 


15. 


16. 


17. 
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Let A = {1, 3,5, 7,9}, B = {3, 6, 9}, and 
C = {2, 4, 6, 8}. Find each of the following: 


a. AUB b. ANB c AUC d ANC 
e. A-B f. B=A g BUC h BAC 
Let the universal set be R, the set of all real 


numbers, and let A = {x E R| 0 <x < 2}, B = 
{x ER|1 <x <4}, andC={x EC R|3 Sx<9}. 
Find each of the following: 


a AUB bANB G AS d AUC 
e ANC f. BS g. A N B° 
h. ASUBS i. (AN BY j. (AUBY 


Let the universal set be R, the set of all real 
numbers, and let A = {x E R | -3 Sx < 0}, B= 
{x ER|-1<x<2},andC ={x E R|6<x 8}. 
Find each of the following: 


a AUB bANB « AS d AUC 
e ANC f. B° g. A N B° 
h. ASUBS i. (ANB) j. (AU BY 


Let S be the set of all strings of 0’s and I’s of 
length 4, and let A and B be the following sub- 
sets of S: A = {1110, 1111, 1000, 1001} and 

B = {1100, 0100, 1111, 0111}. Find each of the 
following: 
a ANB b., AUB 


c. A-B d. B-A 


In each of the following, draw a Venn diagram for 
sets A, B, and C that satisfy the given conditions. 
a ACB, CCB, ANC=Ø 

b. CCA, BAC=Ø 


In each of the following, draw a Venn diagram for 

sets A, B, and C that satisfy the given conditions. 

a ANB=Ø, ACC, CNB#D 

b. ACB, CCB, ANC#D 

c ANB Ø, BAC Ø, 
ANC=0,ACB, CB 


Let A = {a, b, c}, B = {b, c, d}, and C = {b, c, e}. 

a. Find A U (B N ©), (A U B) N C, and 
(A U B) AN (A U C). Which of these sets are 
equal? 

b. Find A N (B U C), (A A B) U C, and 
(A N B) U (A N C). Which of these sets are 
equal? 

c. Find (A — B) — C and A — (B — C). Are these 
sets equal? 


Consider the following Venn diagram. For each 
of (a)—(f), copy the diagram and shade the region 
corresponding to the indicated set. 
a ANB b. BUC 

d. A-(BUC) œe. (AUB) 


c. A‘ 
f. ASM BS 


18. 


19. 


21. 


22. 


23. 


389 


a. Is the number 0 in @? Why? 
b. Is @ = {Ø}? Why? 

c. Is ØE {GO}? Why? 

d. Is Ø E @? Why? 


Let A; = {i, ry for each integer i = 1, 2, 3, 4. 

a. A, U A, U A3 U A44 =? 

b. Ay NA,MA3N Ay =? 

c. Are Aj, Ay, A3, and Ay mutually disjoint? 
Explain. 


. Let B; = {x E R| 0 Sx < i} for each integer 


i = 1,2,3,4. 

a. Bı UB, U B} U B4 =? 

b. Bi AN B} A B; NA B=? 

c. Are B4, By, B3, and By, mutually disjoint? 
Explain. 


Let C; = {i, —i} for each nonnegative integer i. 


4 4 
a UC=? b. NCG =? 

i=0 i=0 
c. Are Co, Ci, Co, . . . mutually disjoint? Explain. 
d. Lc; =? e. NG =? 

i=0 i=0 


f. Lea g- NC =? 
i=0 i=0 


Let D; = {x E R|- i< x< i} = [—i, i] for each 
nonnegative integer i. 


4 
a. UD; =? 
i=0 


c. Are Do, Di, Do, ... mutually disjoint? Explain. 
d. UD; =? e. (NPD: =? 
i=0 i=0 


f. UD =? g- ND: =? 

i=0 i=0 
Let V; = {xER|—F =x = 3} = [~i 7] foreach 
positive integer i. 


4 
a | J¥;=? 
i=0 


4 
b. ()V;=? 


i=0 
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24. 


25. 


26. 


27. 


28. 
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c. Are Vi, V>, V3, úa 


d. LJV; =? 
i=0 


eee 
i=0 


. mutually disjoint? Explain. 


e. Av =? 
i=0 

g NV: =? 
i=0 


Let W; = {x E R | x > i} = (i, ©) for each non- 
negative integer i. 
4 


4 
a. LJW,=?2 b. QW: =? 
i=0 i 


c. Are Wo, Wi, Wa, ... mutually disjoint? Explain. 


n 


e. () W; =? 


i=0 i=0 
f; LW; = ? g- () W; =? 
i=0 i=0 


Let R; = fx E R|1 <x < 1+}} = [1, 1 ++] for 
each positive integer i. 


4 
a. UR; =? 

i=0 
c. Are Ri, Ro, R3, š 


d. UR, =? 
i=0 


f. UR = ? 
i=0 


4 
b. NR = ? 


i=0 


.. mutually disjoint? Explain. 
e. B =? 
i=0 
g- CR: =? 
i=0 


Let S; = {x ER] 1 <x<14+F}=(1, 144) for 
each positive integer i. 
4 


4 
a. US =? b. ()S:=? 


i=0 i=0 


c. Are Si, S2, $3,...mutually disjoint? Explain. 


d. US; =? e ANS =? 
i=0 i=0 

f. US; = 2 g- NS: = ? 
i=0 i=0 


a. Is {{a, d, e}, {b, c}, {d, f}} a partition of 
{a, b, c,d, e, f}? 
b. Is {{w, x, v}, {u, y, q}, {p, z}} a partition of 
{p, q, u, v, w, x, y, Z}? 
c. Is {{5, 4}, {7, 2}, {1, 3, 4}, {6, 8}} a partition of 
{1, 2, 3, 4, 5, 6, 7, 8}? 
d. Is {{3, 7, 8}, {2, 9}, {1, 4, 5}} a partition of 
{1, 2, 3, 4, 5, 6, 7, 8, 9}? 
e. Is {{1, 5}, {4, 7}, {2, 8, 6, 3}} a partition of 
{1, 2, 3, 4, 5, 6, 7, 8}? 


Let E be the set of all even integers and O the set 
of all odd integers. Is {E, O} a partition of Z, the 
set of all integers? Explain your answer. 
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29. Let R be the set of all real numbers. Is {R*, R7, {0} 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


a partition of R? Explain your answer. 
Let Z be the set of all integers and let 


Ao = {n E Z | n = 4k, for some integer k} 


A, = {n E Z|n = 4k+ 1, for some integer k} 

A, = {n E Z |n = 4k + 2, for some integer k} 
and 

A, = {n E Z | n = 4k + 3, for some integer k}. 


Is {Ap, Ay, A>, A3} a partition of Z? Explain your 
answer. 


Suppose A = {1, 2} and B = {2, 3}. Find each of 
the following: 

a. P(A N B) 

b. P(A) 

c. P(A U B) 

d. P(A X B) 


a. Suppose A = {1} and B = {u, v}. Find P(A X B). 
b. Suppose X = {a, b} and Y = {x, y}. Find 

P(X X Y). 
a. Find P(Ø). 
c. Find P(P(P (O). 
Let A, = {1}, A. = {u, v}, and A3 = {m, n}. Find 
each of the following sets: 
a. A, U (A, X A3) 
b. (A; U A2) X A3 
Let A = {a, b}, B = {1, 2}, and C = {2, 3}. Find 
each of the following sets. 


b. Find PP). 


a. AX (BUC) b. (AX B)U (A X ©) 
c. AX (BNC) d. (AX B)N(AXC) 
Trace the action of Algorithm 6.1.1 on the vari- 


ables i, j, found, and answer for m = 3,n = 3, 
and sets A and B represented as the arrays 


a[l] = u, a[2] = v, a[3] = w, b[1] = w, b[2] = u, 
and b[3] = v. 
Trace the action of Algorithm 6.1.1 on the vari- 


ables i, j, found, and answer for m = 4, n = 4, 
and sets A and B represented as the arrays 

a[l] = u, a[2] = v, a[3] = w, a[4] = x, DU] = r, 
b[2] = u, b[3] = y, b[4] = z. 

Write an algorithm to determine whether a 


given element x belongs to a given set that is repre- 
sented as the array a[1], a[2],..., a[n]. 
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ANSWERS FOR TEST YOURSELF 


1. the set A is a subset of the set B; for every x, if x E A then 5. xis in A and x is in B (Or: xis in both A and B) 6.xisinB 

x E B (Or: every element of A is also an element of B) and xis notin A 7.x is in the universal set and is not in A 

2. xis any [particular but arbitrarily chosen] element of X; x is 8. no elements 9. the set of all subsets of A 10. A AN B = Ø 
an element of Y 3. an element in X that is notin Y 4. xis in (Or: A and B have no elements in common) 11. A is the union 
A or xis in B (Or: x is in at least one of the sets A and B) of all the sets Aj, A2, A3, . . . and A; N A; = Ø whenever i # j 


FA Properties of Sets 


... only the last line is a genuine theorem here—everything else is in the fantasy. 
—Douglas Hofstadter, Gödel, Escher, Bach, 1979 


It is possible to list many relations involving unions, intersections, complements, and differ- 
ences of sets. Some of these are true for all sets, whereas others fail to hold in some cases. 
In this section we show how to establish basic set properties using element arguments, and 
we discuss a variation used to prove that a set is empty. In the next section we will show how 
to disprove a proposed set property by constructing a counterexample and how to use an al- 
gebraic technique to derive new set properties from set properties already known to be true. 

We begin by listing some set properties that involve subset relations. As you read them, 
keep in mind that the operations of union, intersection, and difference take precedence 
over set inclusion. Thus, for example, A N B C C means (A N B) CC. 


Theorem 6.2.1 Some Subset Relations 


1. Inclusion of Intersection: For all sets A and B, 


ayy~ANBCA and (K)ANBCB. 
2. Inclusion in Union: For all sets A and B, 
@QACAUB and (b|)BCAUB. 
3. Transitive Property of Subsets: For all sets A, B, and C, 
if A C B and B C C, then A CC. 


The conclusion of each part of Theorem 6.2.1 states that one set X is a subset of another 
set Y and so to prove them, you suppose that x is any [particular but arbitrarily chosen] ele- 
ment of X, and you show that x is an element of Y. 

In most proofs of set properties, the secret of getting from the assumption that x is in X 
to the conclusion that x is in Y is to think of the definitions of basic set operations in terms 
of how they act on elements, that is, in procedural terms. For example, the union of sets X 
and Y, X U Y, is defined as 


XUY={x|xEX or xEr}. 


This means that any time you know an element x is in X U Y, you can conclude that x must 
be in X or x must be in Y. Conversely, any time you know that a particular x is in some set X 
or is in some set Y, you can conclude that x is in X U Y. Thus, for any sets X and Y and any 
element x, 


xEXUY if,andonlyif, xEX or xEY. 
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Procedural versions of the definitions of the other set operations are derived similarly and 
are summarized below. 


Procedural Versions of Set Definitions 


Let X and Y be subsets of a universal set U and suppose x and y are elements of U. 
LxE€XUY & xEXorxEY 

2.xEXNY © xEX anddxEY 

3,.xE€X-Y & xEXandxéY 

4x€xX © xX 

5. x,y) EXXY & xEX and yEY 


Proving a Subset Relation 


Consider trying to prove Theorem 6.2.1(a): For all sets A and B, A N B C A. First notice 
that the statement is universal. It makes a claim about all sets A and B. So the proof has 
the following outline: 


Starting Point: Suppose A and B are any [particular but arbitrarily chosen] sets. 
To Show: AQBCA 


Now to prove A N B C A you must use the definition of subset. In other words, you must 
show that 


Vx, if x E A N B, then x € A. 
This statement is also universal, and to prove it you use an element argument: 
suppose x is any element in A N B 
and 
show that x is in A. 


You can fill in the gap between “suppose” and the “show” by using the procedural version 
of the definition of intersection along with your knowledge of logic and the definition of 
subset. Examples 6.2.1(a) and 6.2.1(b) show proofs for Theorem 6.2.1(1)(a) and Theorem 
6.2.1(2)(a). Each contains blanks to fill in with explanations or parts of proof steps. 


SCWJGLYAI Fill in the Blanks for Proofs of Subset Relations 
Fill in the blanks in the proofs shown below. 
a. Theorem 6.2.1(1)(a): For all sets A and B, AN B CA. 


Proof: 
Statement Explanation 
Suppose A and B are any sets. starting point 
We must show that AM BCA. conclusion to be shown 
Let x be any element in A N B. start of an element argument 
Then x is in A and x is in B. O 
In particular, x is in A. _ ii) | 
Thus every element in A N Bis in A. because x could be any element of A N B 
Therefore, A BCA. (iii) 
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b. Theorem 6.2.1(2)(a): For all sets A and B, A CA UB. 


Proof: 


Statement 


Explanation 


Suppose _@ . 


starting point 


We must show that A CA U B. 


conclusion to be shown 


(it) 


Then the following statement is true: 
“xis in A or x is in B?” 


start of an element argument 


For an or statement to be true only 
(iii) component needs to be true. 


Thus x is in A U B. 
Hence every element in A isin (i). 


Therefore, 0) 


by definition of union 


because x could be any element of A 


by definition of subset 


Solution 


a. (i) by definition of intersection (ii) If an and statement is true, then each individual 
component is true. (iii) by definition of subset 


b. (i) A and B are any sets. (ii) Let x be any element in A. (Or: Suppose x is any element 
inA.) (iii)one (vy) AUB (WACAUB E 


In his book Gödel, Escher, Bach,* Douglas Hofstadter introduces the fantasy rule for 
mathematical proof. Hofstadter points out that when you start a mathematical argument 
with if, let, or suppose, you are stepping into a fantasy world where not only are all the 
facts of the real world true but whatever you are supposing is also true. Once you are in 
that world, you can suppose something else. That sends you into a subfantasy world where 
not only is everything in the fantasy world true but also the new thing you are supposing. 
Of course, you can continue stepping into new subfantasy worlds in this way indefinitely. 
You return one level closer to the real world each time you derive a conclusion that makes 
a whole if-then or universal statement true. Your aim in a proof is to continue deriving 
such conclusions until you return to the world from which you made your first supposition. 

Occasionally, mathematical problems are stated in the following form: 


Suppose (statement 1). Prove that (statement 2). 


When this phrasing is used, the author intends the reader to add statement 1 to his or her 
general mathematical knowledge and not to make explicit reference to it in the proof. In 
Hofstadter’s terms, the author invites the reader to enter a fantasy world where statement 1 
is known to be true and to prove statement 2 in this fantasy world. Thus the solver of such 
a problem would begin a proof with the starting point for a proof of statement 2. Consider, 
for instance, the following restatement from Example 6.2.1(a): 


Suppose A and B are arbitrarily chosen sets. 
Prove that AM BCA. 


The proof would begin “Suppose x E A N B,” it being understood that sets A and B have 
already been chosen arbitrarily. 

The proofs in Example 6.2.1 are called element arguments because they show one set to 
be a subset of another by demonstrating that every element in the one set is also an element 
in the other. In higher mathematics, element arguments are the standard method for estab- 
lishing relations among sets. High-school students are often allowed to justify set properties 


*Gödel, Escher, Bach: An Eternal Golden Braid (New York: Basic Books, 1979). 
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by using Venn diagrams. This method is appealing, but for it to be mathematically rigorous 
may be more complicated than you might expect. Appropriate Venn diagrams can be drawn 
for two or three sets, but the verbal explanations needed to justify conclusions inferred from 
them are normally as long or longer than a straightforward element proof. 

In general, Venn diagrams are not very helpful when the number of sets is four or more. 
For instance, if the requirement is made that a Venn diagram must show every possible 
intersection of the sets, it is impossible to draw a symmetric Venn diagram for four sets, 
or, in fact, for any nonprime number of sets. In 2002, computer scientists/mathematicians 
Carla Savage and Jerrold Griggs and undergraduate student Charles Killian solved a long- 
standing open problem by proving that it is possible to draw such a symmetric Venn diagram 
for any prime number of sets. For n > 5, however, the resulting pictures are extremely com- 
plicated! However, the fact that such symmetric diagrams exist has applications in the area 
of computer science called coding theory. 


Set Identities 


An identity is an equation that is universally true for all elements in some set. For ex- 
ample, the equation a +b = b +a is an identity for real numbers because it is true for 
all real numbers a and b. The collection of set properties in the next theorem consists 
entirely of set identities. That is, they are equations that are true for all sets in some 
universal set. 


Theorem 6.2.2 Set Identities 


Let all sets referred to below be subsets of a universal set U. 


1. Commutative Laws: For all sets A and B, 
(a) AUB=BUA and b) ANB=BNA. 
2. Associative Laws: For all sets A, B, and C, 
(a) (AUB)UC=AU(BUC) and 
(b) AN B)NC=AN(BNC). 
3. Distributive Laws: For all sets A, B, and C, 
(a) AU(BNC)=(AUB)N(AUC) and 
b AN(BUC)=(ANB)U(ANC). 
4. Identity Laws: For every set A, 
MAUB=A aml (>) ANMW=A, 
5. Complement Laws: For every set A, 
@) AUAPS iW ail O AMA =e. 
6. Double Complement Law: For every set A, 
“oy =A, 
7. Idempotent Laws: For every set A, 


@ AUA=A and (b) ANA=A. 
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8. Universal Bound Laws: For every set A, 
(a) AUU=U and (b+) AN Ø= Ø. 
9. De Morgan’s Laws: For all sets A and B, 
(a) (AU B) = ASB and (b) (AN BY =A‘ UB’. 
10. Absorption Laws: For all sets A and B, 
(a) AU(ANB)=A and (b) AN(AUB)=A. 
11. Complements of U and Ø: 
@ U =ø ad b Ø =V. 
12. Set Difference Law: For all sets A and B, 
A-B=ANB*. 


The conclusion of each part of Theorem 6.2.2 is that one set equals another set. As we 
noted in Section 6.1, 


Two sets are equal < each isa subset of the other. 


The method derived from this fact is the most basic way to prove equality of sets. 


Basic Method for Proving That Sets Are Equal 
Let sets X and Y be given. To prove that X = Y: 
1. Prove that X C Y. 
2. Prove that Y C X. 


Sew Proof of a Distributive Law 


Consider trying to prove that for all sets A, B, and C, 
AU(BNC)=(AUBN(AUC). 


Solution The proof of this fact is somewhat more complicated than the proofs in Ex- 
ample 6.2.1, so we first derive its logical structure, then find the core arguments, and end 
with a formal proof as a summary. As in the subset relation examples, the statement to be 
proved is universal. Thus, by the method of generalizing from the generic particular, the 
proof has the following outline: 


Starting Point: Suppose A, B, and C are arbitrarily chosen sets. 
To Show: AU(BNC)=(AUB)N(AUC). 


Now two sets are equal if, and only if, each is a subset of the other. Hence, the following 
two statements must be proved: 


AU(BNC)C(AUB)N(AUC) 
and 
(AUB)N(AUC)CAU(BNC). 
Showing the first subset relation requires showing that 
Vx, ifxE AU(BMC)thenx € (AUB)N(AUC). 
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Showing the second containment requires showing that 
Vx, if x E (AUB)N(AUC)thenx EA U (BN C). 
Note that both of these statements are universal. So to prove the first containment, you 
suppose you have any element x in A U (B N C), 
and then you 
show that x E (AU B)NA (AU C). 
And to prove the second containment, you 
suppose you have any element x in (A U B) N (A U C), 
and then you 
show that x E A U (B N C). 


In Figure 6.2.1, the structure of the proof is illustrated by the kind of diagram that is often 
used in connection with structured programs. The analysis in the diagram reduces the proof to 
two concrete tasks: filling in the steps indicated by dots in the two center boxes of Figure 6.2.1. 


Suppose A, B, and C are sets. [Show A U (B N C) = (A U B) A (A U C). That is, 
show A U (B N C) C (A U B) N (AUC) and 
(AAUB)N(AUC)CAU(BNC).] 


Show AU(BNC)C(AUB)N(AUC). [That is, show Vx, if 
x €AU(BNC) then 
xE(AUB)N(AUC).] 


Suppose x E A U (B N C). [Show x E (A U B) AN (A U C).] 


Thus x E (A U B) N (A U O). 


Hence A U (BNC) C(AUB)N(AUC). 


Show (AU B)N(AUC)CAU(BNC). [That is, show Vx, if 
xE(AUB)N(AUC) 
thenx EAU (BN C).] 


Suppose x € (A U B) A (A U C). [Showx EA U (B N C).] 


Thus x E€ A U (B N C). 


Hence (A UB) N (AUC) CAU (BNC). 


Thus (A UB) N (AU C)=AU(BNC). 


FIGURE 6.2.1 
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The following proof shows the steps for filling in the two innermost boxes of Fig- 
ure 6.2.1. As you read it, notice how the procedural version of the definition of union is 
used. For example: 


If you know that x € B, then you can conclude that x E A U B 
because the statement “x E A or x E B” is true. 


Similarly: 


If you know that x E A, then you can conclude that x E A U (BM C) 
because the statement “x E A or x E B N C” is true. 


Also suppose you know that an element—say x—is in a union of two sets but you don’t 
know which set x is in. If you want to deduce a conclusion about x, you need to show that 
the conclusion follows regardless of which set x is in. So you need to break your argument 
into two cases: x is in the first set and x is in the second set. 

The proof has a few blanks for you to fill in as practice for writing set theory proofs on 
your own. To make the proof more concise, the symbols N and U are used in place of the 
words “intersection” and “union,” respectively. 


Theorem 6.2.2(3)(a) A Distributive Law for Sets 
For all sets A, B, and C, 


AUBNC)=A4VUBNAUC). 


Proof: Suppose A, B, and C are any sets. 

(1) Proof that A U(BNC)C(AUB)NAUC): 
Let x E AU (B N C). [We must show that x © @ 7] 
By definition of U, x € D orxE BNC. 


Case I (x E A): Since x € A, then both statements x E A U B and x E A U C are 
true by definition of U. Hence x € (A U B) N (A U C) by definition of N. 


Case 2 (x E B N C): Since x E B N C, then x E€ B and x E C by definition of N. 
Since x € B, then x E A UB by definition of U. Similarly, since x € C, then 
x E A U C by definition of U. Hence x € (A U B) N (A U C) by definition of N. 


Therefore, in both cases 1 and 2, x E (A U B) KON (AUC). 


Because x could be any element in A U (B N C), this argument shows that every 
element of AU (B N C)is in (A U B) N (A U C). Hence, 


AU(BNC)CAUB)N(AUC) 
by definition of ®©. 
(2) Proof that (A UB)N (AUC)CAU(BNC): 
Let x E (A U B) AN (A UC). [We must show that x E AU (B N C).] 
We consider the two cases: x € A and x € A 


Case 1 (x E A): In this case, because x is in A, we can conclude immediately that 
x E A U (B N C) by definition of U. 


(continued on page 398) 


*The reason to consider the two cases x € A and x ¢ A is that when x € (A U B) N (A U C), then, by defini- 
tion of N, x E (A U B) and x € (A U C). Now one way for this statement to be true is for x to be in A, but, 
since that may not be the case, the proof must also consider the possibility that x is not in A. 
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Case 2 (x € A): In this case, we know that x € (A U B) N (A U C). Thus, by defini- 
tion of ® ,xEAUBandxEAUC. 


Because x is in A U B, then x is in at least one of A or B, and since x is not in A, then 
xis in B. Similarly, because x is in A U C, then x is in at least one of A or C, and since 
xis not in A, then x is in C. 


It follows that x E B o) x E C, and, thus, x E B N C by definition of N. 
Since x € B N C, then by definition of ©, x € A U (B N C). 
Therefore, in both cases 1 and 2, x E A U (B N C). 


Because x could be any element in (A U B) N (A U C), this argument shows that every 
element of (A U B) N (A U C) is in A U (B N C). Hence, (A U B) N (AUC) ® 
A U (B N C). Thus, (A U B) N (A U C) ®_A U (BN C) by definition of subset. 


(3) Conclusion: Since both subset relations have been proved, it follows, by defini- 
tion of set equality, that (a) 


Solution 

dd) a (AUB)NAUC) b. A Cy d. subset 
Da N b. and c. U d. C 

3) a AU(BNC)=(AUB)NAUC) 


In the study of artificial intelligence, the types of reasoning used previously to derive 
the proof of the distributive law are called forward chaining and backward chaining. First 
what is to be shown is viewed as a goal to be reached starting from a certain initial posi- 
tion: the starting point. Analysis of this goal leads to the realization that if a certain job is 
accomplished, then the goal will be reached. Call this job subgoal 1: SG. (For instance, if 
the goal is to show that A U (BM C) = (A U B) N (A U C), then SG, would be to show 
that each set is a subset of the other.) Analysis of SG, shows that when yet another job is 
completed, SG, will be reached. Call this job subgoal 2: SG. Continuing in this way, a 
chain of argument leading backward from the goal is constructed. 


starting point — SG; — SG, —> SG, — | goal 


At a certain point, backward chaining becomes difficult, but analysis of the current 
subgoal suggests it may be reachable by a direct line of argument, or, in other words, by 
forward chaining, beginning at the starting point. Using the information contained in the 
starting point, another piece of information, /,, is deduced; from that another piece of 
information, J, is deduced; and so forth until finally one of the subgoals is reached. This 
completes the chain and proves the theorem. A completed chain is illustrated below. 


hhh by > SG, SG, — $6, — [aT] 


Since set complement is defined in terms of not, and since unions and intersections are 
defined in terms of or and and, it is not surprising that there are analogues of De Morgan’s 
laws of logic for sets. 


Selsey Proof of a De Morgan’s Law for Sets 
Prove that for all sets A and B, (A U BY = ASN B®. 
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Solution As in previous examples, the statement to be proved is universal, and so the 
starting point of the proof and the conclusion to be shown are as follows: 


Starting Point: Suppose A and B are arbitrarily chosen sets. 
To Show: (AUB) =ASN BS 


To do this, you must prove both that (A U B)° C A® N B° and that AN BS C (A U BY. To 
prove the first subset relation means to show that 


Vx, if x E (A U BY then x E ASN B®. 


And to prove the second subset relation means to show that 


Vx, if x E A® N B“ then x € (A U BY. 
Since each of these statements is universal and conditional, for the first subset relation, you 
suppose x € (A U BY, 
and then you 
show that x E€ A® N B*. 
And for the second subset relation, you 
suppose x E A“ N B°, 
and then you 


show that x € (A U BY. 


To fill in the steps of these arguments, you use the procedural versions of the definitions 
of complement, union, and intersection, and at crucial points you use De Morgan’s laws 
of logic. 


Theorem 6.2.2(9)(a) A De Morgan’s Law for Sets 
For all sets A and B, (A U B} = ASN BY. 


Proof: Suppose A and B are sets. 


Proof that (A U BY C A‘ N B‘: 
[We must show that Yx, if x E (A U BY then x E ASN BS] 


Suppose x € (A U B)°. [We must show that x E A° N B°.] By definition of complement, 
xEAUB. 


Now to say that x É A U B means that 
it is false that (x is in A or x is in B). 
By De Morgan’s laws of logic, this implies that 


xis notin A and x is not in B, 


which can be written 


xA and xE€B. 


(continued on page 400) 
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Hence x € A‘ and x € B° by definition of complement. It follows, by definition of 
intersection, that x E€ A® N B“ [as was to be shown]. So (A U B) C A* N B° by defi- 
nition of subset. 


Proof that A‘ N BS C (A U BY: 

[We must show that Yx, if x E ASM BS then x E (A U B)*.] 

Suppose x € A® N B®. [We must show that x E (A U B)*.] By definition of intersec- 
tion, x E A‘ and x € B‘*, and by definition of complement, 


xA and xE€B. 
In other words, 
xis notin A and x is not in B. 
By De Morgan’s laws of logic this implies that 
it is false that (x is in A or x is in B), 
which can be written 
xE€AUB 
by definition of union. Hence, by definition of complement, x E€ (A U BY [as was to 
be shown]. It follows that AS N B® C (A U BY by definition of subset. 


Conclusion: Since both set containments have been proved, (A U B) = ASM B° by 
definition of set equality. 


The set property given in the next theorem says that if one set is a subset of another, then 
their intersection is the smaller of the two sets and their union is the larger of the two sets. 


Theorem 6.2.3 Intersection and Union with a Subset 
For any sets A and B, if A C B, then 
(aa~aANB=A and (b)AUB=B. 


Proof: 
Part (a): Suppose A and B are sets with A C B. To show part (a) we must show both 
that AM BCA and that A C A N B. We already know that A N B C A by the in- 
clusion of intersection property. To show that A C A N B, let x be any element in A. 
[We must show that x is in A N B.] But, because of the hypothesis that A C B, we can 
conclude that x is also in B by definition of subset. Hence 

xEA and xEB, 
and thus 

xEANB 


by definition of intersection /as was to be shown]. 


Proof: 
Part (b): The proof of part (b) is left as an exercise. 
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The Empty Set 


In Section 6.1 we introduced the concept of a set with no elements and promised that in this 
section we would show that there is only one such set. To do so, we start with the most basic 
property of a set with no elements: It is a subset of every set. To see why this is true, just 
ask yourself, “Could it possibly be false? Could there be a set without elements that is not 
a subset of some given set?” The crucial fact is that the negation of a universal statement is 
existential: If a set B is not a subset of a set A, then there exists an element in B that is not 
in A. But if B has no elements, then no such element can exist. 


Theorem 6.2.4 A Set with No Elements Is a Subset of Every Set 


If E is a set with no elements and A is any set, then E C A. 


Proof (by contradiction): Suppose not. [We take the negation of the theorem and 
suppose it to be true.] Suppose there exists a set E with no elements and a set A such 
that E É A. [We must deduce a contradiction.] Then there would be an element of E 
that is not an element of A [by definition of subset]. But there can be no such element 
since E has no elements. This is a contradiction. [Hence the supposition that there are 
sets E and A, where E has no elements and E ¢ A, is false, and so the theorem is true.] 


The truth of Theorem 6.2.4 can also be understood by appeal to the notion of vacuous 
truth. If Z is a set with no elements and A is any set, then to say that E C A is the same as 
saying that 

Vx, if x E E, then x E A. 


But since E has no elements, this conditional statement is vacuously true. 


How many sets with no elements are there? Only one. 


Corollary 6.2.5 Uniqueness of the Empty Set 


There is only one set with no elements. 


Proof: Suppose E, and E; are both sets with no elements. By Theorem 6.2.4, E; C Ey 
since E, has no elements. Also Æ, C E; since E, has no elements. Thus E£; = E, by 
definition of set equality. 


It follows from Corollary 6.2.5 that the set of all the pink elephants on earth is equal to the 
set of all the real numbers whose square is — 1 because neither set has any elements! Since 
there is only one set with no elements, we are justified in calling it by a special name, the 
empty set (or null set) and in denoting it by the special symbol Ø. 

Note that whereas Ø is the set with no elements, the set {@} has one element, the empty 
set. This is similar to the convention in the computer programming languages Lisp and 
Scheme, in which ( ) denotes the empty list and (( )) denotes the list whose one element is 
the empty list. 

Suppose you need to show that a certain set equals the empty set. By Corollary 6.2.5 
it suffices to show that the set has no elements. For since there is only one set with no ele- 
ments (namely ), if the given set has no elements, then it must equal Ø. 
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Element Method for Proving a Set Equals the Empty Set 


To prove that a set X is equal to the empty set Ø, prove that X has no elements. To 
do this, suppose X has an element and derive a contradiction. 


Seu wee Proving That a Set Is Empty 
Prove Theorem 6.2.2(8)(b). That is, prove that for any set A, A N Ø = Ø. 


Solution Let A be a [particular, but arbitrarily chosen] set. To show that A N Ø = Ø, it 
suffices to show that A N Ø has no elements [by the element method for proving a set equals 
the empty set]. Suppose not. That is, suppose there is at least one element—say x—such that 
x E A N Ø. Then, by definition of intersection, x E A and x € ©. In particular, x E Ø. 
But this is impossible since Ø has no elements. [This contradiction shows that the suppo- 
sition that there is an element x in AN © is false. So A N © has no elements, as was to be 


shown.] Thus AN Ø = Ø. | 


Se wey A Proof for a Conditional Statement 
Prove that for all sets A, B, and C, if A C B and B C C“, then A N C = Ø. 


Solution Because the statement to be proved is both universal and conditional, you start 
with the method of direct proof: 


Suppose A, B, and C are arbitrarily chosen sets 
that satisfy the condition: A C B and B C CÀ. 
Show that A N C = Ø. 


Since the conclusion to be shown is that a certain set is empty, you can use the principle for 
proving that a set equals the empty set. A complete proof is shown below. 


Proposition 6.2.6 
For all sets A, B and C,if A C BandBCC‘, thn ANC=2@. 


Proof: Suppose A, B, and C are any sets such that A C B and B C C*. We must show 
that A N C = ©. Suppose not. That is, suppose there is an element x in A N C. By 
definition of intersection, x E A and x E C. Then, since A C B, x E B by definition 
of subset. Also, since B C C*%, then x E C% by definition of subset again. It follows 
by definition of complement that x € C. Thus x E C and x € C, which is a con- 
tradiction. So the supposition that there is an element x in A M C is false, and thus 
AMC = © [as was to be shown]. 


Seu veey A Generalized Distributive Law 


Prove that for all sets A and B}, B2, B3,..., Ba, where n is a positive integer, 


i i=1 
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Solution Compare this proof to the one given in Example 6.2.2. Although the notation 
is more complex, the basic ideas are the same. 


Proof: Suppose A and B}, Bo, B3,..., B, are any sets and n is a positive integer. 


Part 1, Proof that A U a) c ANA UB): 
i i=l 


i=1 


Suppose x is any element in A U N B) [We must show that x is in (\(A U B;).] 
i=1 i= 


i=l 


By definition of union, x E A or x E (NB; 
i=1 


Case 1, x E A: In this case, it is true by definition of union that for every integer i = 1, 
2,...,n, x EA U B;. Hence x € NA U B). 
i=1 


Case 2, x E (NB: In this case, by definition of the general intersection, we have that for 
i=l 
every integer i = 1,2,...,n,x © Bi. Hence, by definition of union, for every integer i = 1, 
n 


2,...,n,x EA U B; and so, by definition of general intersection, x © N (A U B)). 


i=1 
n 


Thus, in either case, x E (N (A U B) [as was to be shown]. 


i=1 


Part 2, Proof that (N) (A U B;) CA U fala; 
i=l i=l 


Suppose x is any element in (N (A U B;) [We must show that x is in A U ()B;.] 


i=1 i=1 
By definition of intersection, x E A U B; for every integer i = 1, 2,...,n. Either x E A or 
xEA. 
Case 1, x E A: In this case, x E A U (Az) by definition of union. 
i=l 
Case 2, x É A: By definition of intersection, x E A U B; for every integer i = 1,2,...,7. 
Since x É A, x must be in each B; for every integer i = 1, 2, .. . , n. Hence, by definition of 


intersection, x E ( )B;, and so, by definition of union, x E A U fa ai} 
i=1 i=1 


Conclusion: Since both set containments have been proved, it follows by definition of set 


equality that A U (Az) = NA U B). E 
i=1 i=1 
TEST YOURSELF 
1. To prove that a set X is a subset of a set A N B, 3. To prove that a set A U B is a subset of a set X, 
you suppose that x is any element of X and you you start with any element x in A U B and con- 
show that x E A xEB. sider the two cases and . You then 


; show that in either case —___ 
2. To prove that a set X is a subset of a set A U B, 


you suppose that x is any element of X and you 4. To prove that a set A N B is a subset of a set X, 
show that x E A xEB. you suppose that and you show that. 
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5. To prove that a set X equals a set Y, you prove that 6. To prove that a set X does not equal a set Y, you 
and that : need to find an element that is in and not 
—— or that is in and not —___ 
EXERCISE SET 6.2 
1. a. To say that an element is in A N (B U C) Then x is in B. @ 
means that it is in and in It follows that xis in C. | (b) 


b. To say that an element is in (A N B) UC 


means thatitisin ©. orin 2. Thus every element in A since x could be any 


c. To say that an element is in A — (B N C) is in C. element of A 
means that it is in © and not in_@_. Therefore, A C C [as was | (© 

d. To prove that (A UB) NCCAU(BN OC), to be shown]. — 
we suppose that x is any element in_“_. Then 
we must show that _@)_. 4. Theorem: For all sets A and B, if A C B, then 

e. If A, B, and C are any sets such that B C C, to AUBCB. 
prove that A N B C A N C, we suppose that Proof: 
x is any element in_“)_. Then we must show 
that _@ . Statement Explanation 

2. The following are two proofs that for all sets A Suppose A, B, and C starting point 

and B, A — B CA. The first is less formal, and the are any sets such that 

second is more formal. Fill in the blanks. ACB. 

a. Proof: Suppose A and B are any sets. To show We dust show that concusiondo be howi 
that A — B C A, we must show that every ele- AUBCB 
ment in “is in . But any element in 
A-Bisin_@)_ and not in__ (by definition Let x be any element start of an element proof 
of A — B). In particular, such an element is in A. inA UB. 

b. Proof: Suppose A and B are any sets and Then xis in A or xis (a) 
x E A—B. [We must show that _] By defini- in B. DE 


tion of set difference, x € _2) andx ¢ ©. 
In particular, x € _® [which is what was to be 


In case x is in A, then (b) 


shown]. xisin B. 
In 3 and 4, supply explanations of the steps in the given In case x is in B, then | tautology (p — p) 
proofs. xis in B. 
3. Theorem: For all sets A, B, and C, if So in either case xis | proof by division into 
ACB,BCC,thenA CC. in B. cases 
Proof: Thus every element in since x could be any ele- 
Statement Explanation oes. mato A UA 


Therefore, A U B C B (c) 


[as was to be shown]. 


Suppose A, B, and C are starting point 
any sets such that A C B 


and B C C. 5. Prove that for all sets A and B, (B — A) = B N AS. 

We must show that conclusion to be shown H 6. Let N and U stand for the words “intersection” 

AGC. and “union,” respectively. Fill in the blanks in 
Let x be any element start of an element the following proof that for all sets A, B, and C, 
ae ane: AN(BUC)=(ANB)U(NC). 
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Proof: Suppose A, B, and C are any sets. 
(D ProofthatA 1 (BUC)C(ANB)UANC): 
Letx E A N (B U C). [We must show that x © _@)), 


By definition of N, x € ©) andx EBU C. 


Thus x € A and, by definition of U, x E B or KON 
Case 1 (x E A and x € B): In this case, x E A N B 
by definition of N. 

Case 2 (x E A and x € C): In this case,x E A N C 
by definition of N. 

By cases 1 and 2, x E A N Bor x E A N C, and so, 
by definition of U, ® 

[So A N (B U C) C (A N B) U (A N C) by definition 
of subset.] 

(2)Proofthat(A 1 B) U(ANC)CAN(BUC): 
Let x E (A N B) U (A N C). [We must show that 
xEAN(BUC)] 

By definition of Ọ, x EAN B@_xE ANC. 
Case 1 (x E A AN B): In this case, by definition of 
N, x EA and x E B. 

Since x E B, then x E€ B U C by definition of U. 
Case 2 (x E A N C): In this case, by definition of 
N, xEA (D) y EC. 

Since x E C, then x E€ B U C by definition of U. 
In both cases x E A and x E BUC, and so, by 
definition of N, © 

[So (AN B)U(ANC)CAN (BU C) by defi- 
nition of @) 7 


(3) Conclusion: [Since both subset relations have 
been proved, it follows, by definition of set equality, 
that (a).] 


Use an element argument to prove each statement in 7-22. 
Assume that all sets are subsets of a universal set U. 


H 7. 
8. 


H9. 


10. 


Tl. 


For all sets A and B, (A N BY = AS U BS. 


For all sets A and B, (AM B) U (A A B9 =A. 
(This property is used in Section 9.9.) 


For all sets A, B, and C, 

(A-B) U (C-B) = (A U C)-B. 
For all sets A, B, and C, 
AUB)NCCAU(BNC). 


For all sets A, B, and C, 
AN(B-C)C(ANB)-(ANC). 


12. 


H 19. 


21. 


22. 


23. 


H 24. 


25. 
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For all sets A, B, and C, 
(AU B)—-CC(A-C) U(B-C). 


. For all sets A, B, and C, 


(A-B) N (C-B) =(ANC)-B. 


. For all sets A and B, A U (A N B) =A. 
. For every set A, A U Ø =A. 
. For all sets A, B, and C, if 


ACBthnANCCBNC. 


. For all sets A, B, and C, if A C B then 


AUCCBUC. 


. For all sets A and B, if A C B then B° C A“. 


For all sets A, B, and C, if A C Band A C C then 
ACBNC. 


. For all sets A, B, and C, if A C C and B C C then 


AUBCC. 
For all sets A, B, and C, 
AX(BUC)=(AX B)U(AXC). 


For all sets A, B, and C, 
AX(BNC)=(AXB)N(AXC). 


Find the mistake in the following “proof” that 
for all sets A, B, and C, if A C Band B C C then 
ACC. 


“Proof: Suppose A, B, and C are any sets such that 
AC Band BCC. Since A C B, there is an element 
x such that x EA and x E€ B, and since B C C, 
there is an element x such that x © B and x E C. 
Hence there is an element x such that x E A and 
x€CandsoA CC.” 


Find the mistake in the following “proof.” 


“Theorem:” For all sets A and B, AS UBS C 
(A U BY. 


“Proof: Suppose A and B are any sets, and 
x E A° U B®. Then x E A° or x E B° by definition 
of union. It follows that xA or x EB by 
definition of complement, and so x AUB 
by definition of union. Thus x€ (A U BY 
by definition of complement, and hence 
ASU BS C (A U BY.” 


Find the mistake in the following “proof” that for 
all sets A and B, (A — B) U (A N B) CA. 
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“Proof: Suppose A and B are any sets, and suppose 
x E(A—B)U (AN B). If x EA then x CA-B, 
and so, by definition of difference, x G&A and 
x€B. In particular, x€A, and, therefore, 
(A — B) U (A N B) CA by definition of subset.” 


26. Consider the Venn diagram below. 


U 


a. Illustrate one of the distributive laws by 
shading in the region corresponding to 
A U (B N C) on one copy of the diagram and 
(A U B) A (A U C) on another. 

b. Illustrate the other distributive law by 
shading in the region corresponding to 
A N (B U C) on one copy of the diagram and 
(A N B) U (AN C) on another. 

c. Illustrate one of De Morgan’s laws by shading 
in the region corresponding to (A U B)* on one 
copy of the diagram and A° N B® on the other. 
(Leave the set C out of your diagrams.) 

d. Illustrate the other De Morgan’s law by shad- 
ing in the region corresponding to (A N B)* on 
one copy of the diagram and A° U B] on the 
other. (Leave the set C out of your diagrams.) 


27. Fill in the blanks in the following proof that for all 
sets A and B, (A — B) N (B — A) = Ø. 
Proof: Let A and B be any sets and suppose 
(A—-B)AN (B-A) + ©. That is, suppose there 
is an element x in ®. By definition of (b) 4 
x€A-—B and x €_©. Then by definition of 
set difference, x E A and x € B and x E€ KON and 
xE © In particular x € A and x € (f) , which 
is a contradiction. Hence [the supposition that 
(A-B) N (B—A) # Ø is false, and so] £8). 


Use the element method for proving a set equals the 
empty set to prove each statement in 28-38. Assume that 
all sets are subsets of a universal set U. 


28. For all sets A and B, (AN B) N (ANB) = Ø. 
(This property is used in Section 9.9.) 
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29. 


30. 


31. 
32. 
33. 


34. 


35. 


36. 


37. 


38. 


For all sets A, B, and C, 
(A—C)N(B-C)N(A-B) = Ø. 

For every subset A of a universal set U, 

ANAS = Ø. 

If U denotes a universal set, then US = Ø. 

For every set A, A X Ø = ©. 

For all sets A and B, if A C B then A AN BS = Ø. 

For all sets A and B, if B C A“ then A N B = Ø. 


For all sets A, B, and C, if A C B and B N C = Ø 
then A N C = Ø. 


For all sets A, B, and C, if C C B — A, then 
ANC=Ø. 


For all sets A, B, and C, 
if B N C CA, then (C— A) N (B — A) = Ø. 


For all sets A, B, C, and D, 
if A N C = Ø then (A X B) N (C X D) = Ø. 


Prove each statement in 39-44. 


H 39. 


40. 


H 4. 


42. 


For all sets A and B, 

a. (A-B) U(B-A) U(ANB)=AUB 

b. The sets (A — B), (B — A), and (A N B) are 
mutually disjoint. 


For every positive integer n, if A and By, B2, B3,... 
are any sets, then 


i i=1 


For every positive integer n, if Aj, A2, A3,... and 
B are any sets, then 


For every positive integer n, if A4, A2, A3, . . . and 
B are any sets, then 


ha-m -=(Aa)-z 
i=l 


i=1 
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43. For every positive integer n, if A and By, B2, B3,... 44. For every positive integer n, if A and By, B2, B3,... 


are any sets, then 


i=1 


ANSWERS FOR TEST YOURSELF 


are any sets, then 


Ua xay=ax( Us) Quaxay=ax( Aa). 
i=1 


i=] i=1 


l.and 2.or 3.xE&A;xE€B;xEX AandB);xE€X S5.XCY;YCX 
4.x EA NB (Or: x is an element of both 6. X; in Y; Y; in X 


[Æ] Disproofs and Algebraic Proofs 


Example 6.3.1 


Ifa fact goes against common sense, and we are nevertheless compelled to 
accept and deal with this fact, we learn to alter our notion of common sense. 
— Phillip J. Davis and Reuben Hersh, The Mathematical Experience, 1981 


In Section 6.2 we gave examples only of set properties that were true. Occasionally, however, 
a proposed set property is false. We begin this section by discussing how to disprove such 
a proposed property. Then we prove an important theorem about the power set of a set and 
go on to discuss an “algebraic” method for deriving new set properties from set properties 
already known to be true. We finish the section with an introduction to Boolean algebras. 


Disproving an Alleged Set Property 


Recall that to show a universal statement is false, it suffices to find one example (called a 
counterexample) for which it is false. 


Finding a Counterexample for a Set Identity 
Is the following set property true? 


For all sets A, B, and C, (A — B) U(B-C) = A-C. 


Solution Observe that the property is true if, and only if, 
the given equality holds for all sets A, B, and C. 
So it is false if, and only if, 
there are sets A, B, and C for which the equality does not hold. 


One way to solve this problem is to picture sets A, B, and C by drawing a Venn diagram 
such as that shown in Figure 6.3.1 on the next page. If you assume that any of the eight 
regions of the diagram may be empty of points, then the diagram is quite general. 

Find and shade the region corresponding to (A — B) U (B — C). Then shade the region 
corresponding to A — C. These are shown in Figure 6.3.2 on the next page. 

Comparing the shaded regions seems to indicate that the property is false. For in- 
stance, if there is an element in B that is not in either A or C then this element would be in 
(A — B) U (B — C) (because of being in B and not C), but it would not be in A — C since 
A — C contains nothing outside A. Similarly, an element that is in both A and C but not B 
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FIGURE 6.3.1 


would be in (A — B) U (B — C) (because of being in A and not B), but it would not be in 
A — C (because of being in both A and C). 


U U 


FIGURE 6.3.2 


Construct a concrete counterexample in order to confirm your answer and make sure 
that you did not make a mistake either in drawing or analyzing your diagrams. One way is 
to put one of the integers from 1 through 7 into each of the seven subregions enclosed by 
the circles representing A, B, and C. If the proposed set property had involved set comple- 
ments, it would also be helpful to label the region outside the circles, and so we place the 
number 8 there. (See Figure 6.3.3.) Then define discrete sets A, B, and C to consist of all 
the numbers in their respective subregions. 


FIGURE 6.3.3 


Counterexample 1: Let A = {1, 2,4, 5}, B = {2, 3,5, 6}, and C = {4, 5, 6, 7}. 
Then 


A-B= {1,4}, B—C= {2,3}, and A—C= {1,2}. 


Hence 
(A-B) U (B-C) = {1, 4} U {2, 3} = {1,2,3,4}, whereas A-—C ={l, 2}. 
Since {1, 2, 3,4} # {1, 2}, we have that (A — B) U (B-C) #A-C. 
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A more economical counterexample can be obtained by observing that as long as the 
set B contains an element, such as 3, that is not in A, then regardless of whether B con- 
tains any other elements and regardless of whether A and C contain any elements at all, 
(A-B)U(B-C)#A-C. 


Counterexample 2: Let A = Ø, B = {3}, and C = Ø. Then 
A-B=6, B-C={3}, and A-C=2@. 


Hence (A-B) U (B-C) = Ø U {3} = {3}, whereas A-C = Ø. 
Since {3} # Ø, we have that (A — B) U (B— C) + A-C. 


Note Check that when Another economical counterexample requires only that A = C = a singleton set, such 
A=C={4}andB=©, as {4}, while B is the empty set. E 
(A—B) U(B-C) #A-C. 


Problem-Solving Strategy 


How can you discover whether a given universal statement about sets is true or false? There 
are two basic approaches. Either you plunge in and start trying to prove the statement, ask- 
ing yourself, “What do I need to show?” and “How do I show it?” or you try to find a set 
of conditions that must be fulfilled to construct a counterexample. With either approach 
you may have immediate success or you may run into difficulty. The trick is to be ready 
to switch to the other approach if the one you are working on does not look promising. 
For more difficult questions, you may alternate several times between the two approaches 
before arriving at the correct answer. 


The Number of Subsets of a Set 


The following theorem states the important fact that if a set has n elements then its power 
set has 2” elements. The proof uses mathematical induction and is based on the following 
observations. Suppose X is a set and z is an element of X. 


1. The subsets of X can be split into two groups: those that do not contain z and those that 
do contain z. 


2. The subsets of X that do not contain z are the same as the subsets of X — {z}. 


3. The subsets of X that do not contain z can be matched up one for one with the subsets of X 
that do contain z by matching each subset A that does not contain z to the subset A U {z} 
that contains z. Thus there are as many subsets of X that contain z as there are subsets of 
X that do not contain z. For instance, if X = {x, y, z}, the following table shows the cor- 
respondence between subsets of X that do not contain z and subsets of X that contain z. 


Subsets of X That Subsets of X That 
Do Not Contain z Contain z 
Ø > ØU {z} = {z} 
{x} < {x} U {z} = {xz} 
{y} < Wye = {yz} 
{x, y} < {x,y} U {z} = {x,y,z} 
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Theorem 6.3.1 


For every integer n = 0, if a set X has n elements, then P(X) has 2” elements. 


Proof (by mathematical induction): Let the property P(n) be the sentence 


Any set with n elements has 2” subsets. < P(n) 


Show that P(O) is true: 
To establish P(O), we must show that 


Any set with 0 elements has 2° subsets. < PO) 


Now the only set with zero elements is the empty set, and the only subset of the 
empty set is itself. Thus a set with zero elements has one subset. Since | = 2°, we 
have that P(0) is true. 


Show that for every integer k = 0, if P(k) is true then P(k +1) is also true: 
[Suppose that P(k) is true for a particular but arbitrarily chosen integer k = 0. That is:] 
Suppose that k is any integer with k = O such that 


Any set with k elements has 2* subsets. < P(k) inductive hypothesis 


[We must show that P(k + 1) is true. That is:] We must show that 


okt 


Any set with k + 1 elements has subsets. <—P(k+1) 


Let X be a set with k+ 1 elements. Since k+ 1 = 1, we may pick an element 
z in X. Observe that any subset of X either contains z or does not. Furthermore, 
any subset of X that does not contain z is a subset of X — {z}. And any subset A of 
X — {z} can be matched up with a subset B, equal to A U {z}, of X that contains z. 
Consequently, there are as many subsets of X that contain z as do not, and thus there 
are twice as many subsets of X as there are subsets of X — {z}. It follows that since 
X — {z} has k elements, then, by inductive hypothesis, 


the number of subsets of X — {z} = oa 
Therefore, 


the number of subsets of X = 2- (the number of subsets of X — {z}) 


= 2: 25 by substitution 
= JEF by basic algebra. 

[This is what was to be shown.] 

[Since we have proved both the basis step and the inductive step, we conclude that the 
theorem is true.] 


“Algebraic” Proofs of Set Identities 


Let U be a universal set and consider the power set of U, P(U). The set identities given 
in Theorem 6.2.2 hold for all elements of P(U). Once a certain number of identities and 
other properties have been established, new properties can be derived from them algebra- 
ically without having to use element method arguments. It turns out that only identities 
(1-5) of Theorem 6.2.2 are needed to prove any other identity involving only unions, 
intersections, and complements. With the addition of identity (12), the set difference law, 
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any set identity involving unions, intersections, complements, and set differences can be 
established. 

To use known properties to derive new ones, you need to use the fact that such proper- 
ties are universal statements. Like the laws of algebra for real numbers, they apply to a 
wide variety of different situations. Assume that all sets are subsets of P(U), then, for 
instance, one of the distributive laws states that 


forall sets A, B,andC, AN(BUC)=(ANB)U(ANC). 


This law can be viewed as a general template into which any three particular sets can be 
placed. Thus, for example, if A4, A2, and A; represent particular sets, then 


A, N (Ap U A3) = (Ay N Az) U (A, As), 
i oe a Ww oe Ww ee 
AN(BUQ= ANB UANYC 


where A, takes the place of A, A, takes the place of B, and A; takes the place of C. Simi- 
larly, if W, X, Y, and Z are any particular sets, then, by the distributive law, 


(WOAX)NVYUZ)=(WAXNY)IUC(WNX)NZ), 


t t l T J t t 
A N BUC)= (A NB) U {fA np 


where W N X takes the place of A, Y takes the place of B, and Z takes the place of C. 


Deriving a Set Difference Property 
Construct an algebraic proof that for all sets A, B, and C, 
(AU B)-—C=(A-C)U(B-C). 
Cite a property from Theorem 6.2.2 for each step of the proof. 
Solution Let A, B, and C be any sets. Then 
(AU B)-C=(AUB)NC by the set difference law 
=C°N(AUB) by the commutative law for N 
=(C° NA) U (C° N B) by the distributive law 
= (AMC) U (B N C°) by the commutative law for N 
= (A-—C)U(B-C) by the set difference law. a 


Deriving a Set Identity Using Properties of Ø 
Construct an algebraic proof that for all sets A and B, 
A-(AN B)=A-B. 
Cite a property from Theorem 6.2.2 for every step of the proof. 


Solution Suppose A and B are any sets. Then 


A-(ANB)=AN(AN BY) by the set difference law 
=A N (AS UB‘) by De Morgan’s law 
= (ANA) U(ANB’) _ by the distributive law 
= ØU (AN B®’) by the complement law 
=(ANB)U®D by the commutative law for U 
=AN BS by the identity law for U 
=A-B by the set difference law. | 
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To many people an algebraic proof seems simpler than an element proof, but often 
an element proof is actually easier to understand. For instance, in Example 6.3.3 above, 
you could see immediately that A — (A N B) = A —B because for an element to be in 
A-— (A N B) means that it is in A and not in both A and B, and this is equivalent to saying 
that it is in A and not in B. 


eH aeeg Deriving a Generalized Associative Law 
Prove that for any sets A,, A>, A3, and Ag, 


AÀ ((A; U A2) U A3) U Ay = Ay U ((42 U A3) U Ay). 
Caution! When do- Cite a property from Theorem 6.2.2 for every step of the proof. 
ing problems similar to Solution Let A, A>, A3, and A, be any sets. Then 
Examples 6.3.2—6.3.4, be 
sure to use the set proper- 
ties exactly as they are 
stated in Theorem 6.2.2. 


((A; U Ax) U A3) U Ag = (Ay U (Az U A3)) U Ag by the associative law for U with A, 
taking the place of A, A, taking the place 
of B, and A; taking the place of C 


= Ay U ((A2 U A3) U A4) by the associative law for U with A, 
taking the place of A, Aj U A; taking the 

place of B, and A; taking the place of C. 
| 


TEST YOURSELF 


1. Given a proposed set identity involving set vari- 2. When using the algebraic method for proving a set 
ables A, B, and C, the most common way to show identity, it is important to for every step. 
that the equation does not hold in general is to find 
concrete sets A, B, and C that, when substituted for 
the set variables in the equation, ____ 


3. When applying a property from Theorem 6.2.2, it 
must be used as it is stated. 


EXERCISE SET 6.3 


For each of 1-4 find a counterexample to show that the 6. For all sets A and B, AM (A U B) =A. 
statement is false. Assume all sets are subsets of a univer- 
slet U. 7. For all sets A, B, and C, 
1. For all sets A, B, and C, (A=B) N (C=B)=A-(B U C). 
(AUB)NC=AU(BNC). 8. For all sets A and B, if A° C B then A UB = U. 
2. For all sets A and B, (A U BY = AS U BY. 9. For all sets A, B, and C,if A C C and B C C then 
AUBCC. 
3. For all sets A, B, and C, if A É B and B © C then > 
ACC. 10. For all sets A and B, if A C B then A AN Bo = Ø. 
4. For all sets A, B, and C, if BU C C A then H 11. For all sets A, B, and C, if A C B then 
AN(BNC)Y = Ø. 
(A-B)N(A-O)=@. aire 
For each of 5-21 prove each statement that is true and H 12. For all sets A, B, and C, 
find a counterexample for each statement that is false. AN(B-Q=(ANB)-(ANO. 


Assume all sets are subsets of a universal set U. 


13. For all sets A, B, and C, 
5. For all sets A, B, and C, 
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H 19. 


H23. 


24. 


H 25. 


H* 26 


. For all sets A, B, and C, if AM C = B N C and 
AUC=BUC,thenA=B. 


. For all sets A, B, and C,(A—B)UCCAU(C—B). 


. For all sets A and B, if AM B = Ø then 
AXB=©. 


. For all sets A and B, if A C B then P(A) C P(B). 
. For all sets A and B, P(A U B) C P(A) U P(B). 
For all sets A and B, P(A) U P(B) C P(A U B). 
. For all sets A and B, P(A N B) = P(A) N P(B). 
. For all sets A and B, P(A X B) = P(A) X P(B). 


. Write a negation for each of the following state- 
ments. Indicate which is true, the statement or its 
negation. Justify your answers. 

a. V sets S, Ja set T such that S N T = Ø. 
b. 3a set S such that Y sets T, S U T = Ø. 


Let S = {a, b, c}, and for each integer i = 0, 1, 

2, 3, let S; be the set of all subsets of S that have i 
elements. List the elements in So, S1, S2, and $3. Is 
{So, $1, S2, S3} a partition of P(S)? 


Let A = {t, u, v, w}, and let S; be the set of all 
subsets of A that do not contain w and S, the set of 
all subsets of A that contain w. 

. Find $. 

. Find $}. 

Are Sı and S, disjoint? 

. Compare the sizes of S$; and S}. 

. How many elements are in $, U S2? 

What is the relation between S; U S and 
P(A)? 


hoan gow 


Use mathematical induction to prove that for every 
integer n = 2, if a set S has n elements, then the 
number of subsets of S with an even number of 
elements equals the number of subsets of S with 
an odd number of elements. 


. The following problem, devised by Ginger Bolton, 
appeared in the January 1989 issue of the Col- 
lege Mathematics Journal (Vol. 20, No. 1, p. 68): 
Given a positive integer n = 2, let S be the set of 
all nonempty subsets of {2, 3,..., n}. For each 
S; E S, let P; be the product of the elements of S;. 
Prove or disprove that 


a n+1)! 
Eana 


> Pi 2 


i=1 
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In 27 and 28 supply a reason for each step in the derivation. 


27. For all sets A, B, and C, 


H 28. 


(AUB)NC=ANQU(BNC). 
Proof: Suppose A, B, and C are any sets. Then 
(AAUB)NC=CN(AUB) by (a) 
=(CNA)U(CNMB) by (b) 
=(ANC)U(BNC) by ©. 
For all sets A, B, and C, 
(A U B)—(C—A) =AU (B-C). 
Proof: Suppose A, B, and C are any sets. Then 


(A U B)—(C—A) = (AU B) N (C-A) by (a) 
=(AUB)N(CNAY by (by 
=(AUB)N(ASNCY by © 
=(AUB)N (AY UC’) by @ 
=(AUB)N(AUC) by © 
=AU(BNC by ©) 
=AU(B-C) by (g). 


H 29. 


Some steps are missing from the following 
proof that for all sets A and B, (A U B®) — B = 
(A — B) U B®. Indicate what they are, and then 
write the proof correctly. 


Proof: Let any sets A and B be given . Then 


(AU BS)-—B=(AUB)N BS by the set difference 

law 

= (BS N A) U (BS N B°) by the distributive 
law 

= (BSN A)U BS by the idempotent 
law for U 

= (A-B) UB“ by the set differ- 
ence law. 


In 30-40, construct an algebraic proof for the given state- 
ment. Cite a property from Theorem 6.2.2 for every step. 


30. 


31. 
32. 
33. 
34. 


35. 


For all sets A, B, and C, 
(ANB)UC=AUC)N(BUC). 
For all sets A and B, AU (B—A) =A UB. 
For all sets A and B, (A — B) U (A N B) =A. 
For all sets A and B, (A — B) N (A N B) = Ø. 
For all sets A, B, and C, 
(A-B)-C=A-(BUC). 
For all sets A and B, A — (A—- B) =A AB. 
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36. For all sets A and B, ((A® U B®) — AY = A. 
37. For all sets A and B, (B® U (B—A))* = B. 
38. For all sets A and B, (A N BSF NA=A-B. 
39. For all sets A and B, 

(A—B)U (B—A) = (AUB)—-(ANB). 


40. For all sets A, B, and C, 
(A—B)—-(B-C)=A-—B. 


In 41-43 simplify the given expression. Cite a property 
from Theorem 6.2.2 for every step. 


H 41. AN (BUA) NB) 
42. (A—(AN B) N(B-(ANB)) 
43. (AN (BU C) N (A-B) N (BUC 


44. Consider the following set property: For all sets A 
and B, A — B and B are disjoint. 
a. Use an element argument to derive the 
property. 
b. Use an algebraic argument to derive the property 
(by applying properties from Theorem 6.2.2). 
c. Comment on which method you found easier. 


45. Consider the following set prop- 

erty: For all sets A, B, and C, 

(4—B) U (B-C) = (A U B)- (BN C). 

a. Use an element argument to derive the 
property. 

b. Use an algebraic argument to derive the 
property (by applying properties from 
Theorem 6.2.2). 

c. Comment on which method you found easier. 


ANSWERS FOR TEST YOURSELF 


Definition: Given sets A and B, the symmetric differ- 
ence of A and B, denoted A A B, is 


A A B = (A —B) U (B —A). 


46. Let A = {1,2,3,4}, B = {3, 4, 5, 6}, and 
C = {5, 6, 7, 8}. Find each of the following sets: 
a. AAB 
b. BAC 
c. AAC 
d. (AAB)AC 
Refer to the definition of symmetric difference given 
above. Prove each of 47-52, assuming that A, B, and C are 
all subsets of a universal set U. 


47. AAB=BAA 


48. AAO=A 
49. AAAS =U 
50. AAA=@ 


H* 51. IfA A C= BAC, then A = B. 
H* 52. (AAB)AC=AA(BAC) 
H 53. Derive the set identity A U (A N B) = A from the 


properties listed in Theorem 6.2.2(1)—(9). Start by 
showing that for every subset B of a universal set 
U, U U B = U. Then intersect both sides with A 
and deduce the identity. 


54. Derive the set identity A M (A U B) = A from the 
properties listed in Theorem 6.2.2(1)—(9). Start by 
showing that for every subset B of a universal set 
U, Ø = Ø N B. Then take the union of both sides 
with A and deduce the identity. 


1. make the left-hand side unequal to the right-hand side 
(Or: result in different values on the two sides of the 


equation) 2. cite one of the properties from Theorem 6.2.2 
(Or: give a precise reason) 3. exactly 


KÆ] Boolean Algebras, Russell’s Paradox, 
and the Halting Problem 


From the paradise created for us by Cantor, no one will drive us out. 


—David Hilbert (1862-1943) 


Table 6.4.1 summarizes the main features of the logical equivalences from Theorem 2.1.1 
and the set properties from Theorem 6.2.2. Notice how similar the entries in the two 


columns are. 
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TABLE 6.4.1 


Logical Equivalences 


Set Properties 


For all statement variables p, q, and r: 


For all sets A, B, and C: 


a.pVqG=4\VP 
b. p^q=q4^p 


aAUB=BUA 
bANB=BNA 


apA(qAr)=(pAQgAr 
bpVaVvn=PVQvr 


aAN(BNC)=ANBYNC 
bAUBUC)=(AUB)UC 


apA(aVN=PAQVPADN 
bpVvViqAn=PVQAPVr) 


aAN(BUC)=(ANB)U(ANC) 
bAUBNC)=AUB)NAUC) 


b.~(pAq)=~PV ~q 


a.pVC=p aAUO=A 

b pAt=p bANU=A 
a.pVv~p=t a AUA =U 
b.p\~p=e b ANAS =Ø 
~(~p) =p (AY =A 
a.pV/p=p a.AUA=A 

b. pAp=p bANA=A 
apvt=t a AUU=U 

b. pA\c=ce bAND =O 

a. ~(pV q) = ~P\~q a. (A U B) = ASN BS 


b. (A N B} = ASU B° 


a. pV (p Aq) =p a AU(ANB)=A 


b. pA(pV 4) =P bAN(AUB)=A 
a.~t=c a. US =H 
b. ~c=t b @° =U 


If you let v (or) correspond to U (union), A (and) correspond to N (intersection), 
t (a tautology) correspond to U (a universal set), c (a contradiction) correspond to © (the 
empty set), and ~ (negation) correspond to ° (complementation), then you can see that 
the structure of the set of statement forms with operations v and ^ is essentially iden- 
tical to the structure of the set of subsets of a universal set with operations U and N. 
In fact, both are special cases of the same general structure, known as a Boolean 
algebra. The essential idea of a Boolean algebra was introduced by the self-taught 
English mathematician/logician George Boole in 1847 in a book entitled The Math- 
ematical Analysis of Logic. During the remainder of the nineteenth century, Boole 
and others amplified and clarified the concept until it reached the form in which we 
use it today. 

In this section we show how to derive the various properties associated with a Boolean 
algebra from a set of five axioms. 
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Definition and Axioms for a Boolean Algebra 


A Boolean algebra is a set B together with two operations, generally denoted + and ., 
such that for all a and b in B both a + b and a:b are in B and the following axioms 
are assumed to hold: 


1. Commutative Laws: For all a and b in B, 
(@a)a+tb=b+a and (b)a:b = b-a. 
2. Associative Laws: For all a, b, and c in B, 
(a) (a+b)+c=a+(b+c) and (b)(a-b)-c =a: (b-c). 
3. Distributive Laws: For all a, b, and c in B, 
(a)at+(b-c) =(at+b):(a+c) and (b)a:(b+c) = (a:b) +(a-c). 
4. Identity Laws: There exist distinct elements 0 and 1 in B such that for each a in B, 
@at0=a and (b)a-1=a. 


5. Complement Laws: For each a in B, there exists an element in B, denoted a and 
called the complement or negation of a, such that 


(ajata=1 and (b)a-a=0. 


In any Boolean algebra, the complement of each element is unique, the quantities 0 and 
1 are unique, and identities analogous to those in Theorem 2.1.1 and Theorem 6.2.2 can 
be deduced. 


Theorem 6.4.1 Properties of a Boolean Algebra 


Let B be any Boolean algebra. 

1. Uniqueness of the Complement Laws: For all a and x in B, if a+x = 1 and 
a-x = O then x = a. 

2. Uniqueness of 0 and 1: If there exists x in B such that a +x = a for every a in 
B, then x = 0, and if there exists y in B such that a'y = a for every a in B, then 


y=. 
3. Double Complement Law: For every a € B, (a) = a. 
4. Idempotent Laws: For every a € B, 


(ajata=a and (b)a‘a=a. 


5. Universal Bound Laws: For every a € B, 


(a)a+1=1 and (b)a-0=0. 


6. De Morgan’s Laws: For all a and b E B, 
@a+b=a-b and (b)a-b=atbD. 
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7. Absorption Laws: For all a and b E B, 


(a) (a+b)-a 


a and (b)(a:b)+a=<a. 


8. Complements of 0 and 1: 


(a)0=1 and (b)T=0. 


Proof: 


Part 1: Uniqueness of the Complement Law 
Suppose a and x are particular, but arbitrarily chosen, elements of B that satisfy the 
following hypothesis: a+ x = 1 and a-x = 0. Then 


x=x-l because | is an identity for- 
= x: (a+a) by the complement law for + 
=xa+xa by the distributive law for- over + 
=a'xt+x-a by the commutative law for- 
=O+x-a by hypothesis 
=a:at+x-a by the complement law for- 
= (aʻa) + (a: x) by the commutative law for: 
=a (a+x) by the distributive law forover + 
=a'l by hypothesis 
=a because | is an identity for-. 


Proofs of the other parts of the theorem are discussed in the examples that follow 
and in the exercises. 


You may notice that all parts of the definition of a Boolean algebra and most parts of 
Theorem 6.4.1 contain paired statements. For instance, the distributive laws state that for 
all a, b, and c in B, 


(a)a+(b:c)= (a+b) (a+c) and (b)a-(b+c) = (a:b) +(a-c), 
and the identity laws state that for every a in B, 
@)a+0=a and (b)a‘l =a. 


Each of the paired statements can be obtained from the other by interchanging all the + 
and: signs and interchanging | and 0. Such interchanges transform any Boolean identity 
into its dual identity. It can be proved that the dual of any Boolean identity is also an iden- 
tity. This fact is often called the duality principle for a Boolean algebra. 


Proof of the Double Complement Law 


Prove that for all elements a in a Boolean algebra B, (a) = a. 
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Solution Start by supposing that B is a Boolean algebra and a is any element of B. 
The basis for the proof is the uniqueness of the complement law: that each element 
in B has a unique complement, which satisfies certain equations with respect to it. 
So if a can be shown to satisfy those equations with respect to a, then a must be the 
complement of a. 


Theorem 6.4.1(3) Double Complement Law 


For every element a in a Boolean algebra B, @ =a. 


Proof: 
Suppose B is a Boolean algebra and a is any element of B. Then 


ata=ata by the commutative law for + 


=1 by the complement law for 1 
and 
aa=a-a by the commutative law for. 
=0 by the complement law for 0. 


Thus a satisfies the two equations with respect to a that are satisfied by the com- 
plement of a. From the fact that the complement of a is unique, we conclude that 
(@) =a. 


eM keeway Proof of an idempotent Law 


Fill in the blanks in the following proof that for all elements a in a Boolean algebra B, 
ata=a. 


Proof: 
Suppose B is a Boolean algebra and a is any element of B. Then 


a=a+0 (a) 
=a+(aʻa) (b) 
=(ata):(a+a) (c) 
=(ata):l (d) 
=ata ©. 


Solution 
(a) because 0 is an identity for + 
(b) by the complement law for - 
(c) by the distributive law for + over: 
(d) by the complement law for + 


(e) because | is an identity for- E 
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Russell’s Paradox 


By the beginning of the twentieth century, abstract set theory had gained such wide accep- 
tance that a number of mathematicians were working hard to show that all of mathematics 
could be built upon a foundation of set theory. In the midst of this activity, the English 
mathematician and philosopher Bertrand Russell discovered a “paradox” (really a genuine 
contradiction) that seemed to shake the very core of the foundation. The paradox assumes 
Cantor’s definition of set as “any collection into a whole of definite and separate objects of 
our intuition or our thought.” 


Russell’s Paradox: Most sets are not elements of themselves. For instance, the set of all 
integers is not an integer and the set of all horses is not a horse. However, we can imagine 
the possibility of a set’s being an element of itself. For instance, the set of all abstract ideas 
might be considered an abstract idea. If we are allowed to use any description of a property 
Bertrand Russell as the defining property of a set, we can let S be the set of all sets that are not elements of 
(1872-1970) themselves: 
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S = {A|A is a set and A € A}. 


Is S an element of itself? 

The answer is both yes and no. For suppose S € S. Then S satisfies the defining 
property for S, and hence S € S. This contradicts the supposition that $ E S and shows 
that S É S. Next suppose S € S. Then S is a set such that S$ € S and so S satisfies the 
defining property for S, which implies that S € S. This contradicts the supposition that 
S É S and shows that S E S. Thus both S$ E S and S Æ S, which is impossible because a 
statement is either true or false but not both. To help explain his discovery to laypeople, 
Russell devised a puzzle, the barber puzzle, whose solution exhibits the same logic as 
his paradox. 


eke The Barber Puzzle 


In a certain town there is a male barber who shaves all those men, and only those men, who 
do not shave themselves. Question: Does the barber shave himself? 


Solution The answer is both yes and no. If the barber shaves himself, he is a member 
of the class of men who shave themselves. But no member of this class is shaved by the 
barber, and so the barber does not shave himself. On the other hand, if the barber does not 
shave himself, he belongs to the class of men who do not shave themselves. But the barber 
shaves every man in this class, so the barber does shave himself. E 


How can the answer be both yes and no? Surely any barber either does or does not shave 
himself. You might try to think of circumstances that would make the paradox disappear. 
For instance, maybe the barber happens to have no beard and never shaves. But a condition 
of the puzzle is that the barber is a man who shaves all those men who do not shave them- 
selves. If he does not shave, then he does not shave himself, in which case he is shaved 
by the barber and the contradiction is as present as ever. Other attempts at resolving the 
paradox by considering details of the barber’s situation are similarly doomed to failure. 

So let’s accept the fact that the paradox has no easy resolution and see where that thought 
leads. Since the barber both shaves himself and doesn’t shave himself, the sentence “The 
barber shaves himself” is both true and false. Yet the sentence arose in a natural way from 
a description of a situation. If the situation actually existed, then the sentence would have 
to be either true or false but not both. Thus we are forced to conclude that the situation 
described in the puzzle simply cannot exist in the world as we know it. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


420 CHAPTER6 SET THEORY 


In a similar way, the conclusion to be drawn from Russell’s paradox itself is that the 
object S is not a set. Because if it actually were a set, in the sense of satisfying the gen- 
eral properties of sets that we have been assuming, then it either would be an element of 
itself or not. 

In the years following Russell’s discovery, several ways were found to define the basic 
concepts of set theory so as to avoid his contradiction. The way used in this text requires 
that, except for the power set whose existence is guaranteed by an axiom, whenever a set is 
defined using a predicate as a defining property, the stipulation must also be made that the 
set is a subset of a known set. This method does not allow us to talk about “the set of all 
sets that are not elements of themselves.” We can speak only of “the set of all sets that are 
subsets of some known set and that are not elements of themselves.” When this restriction 
is made, Russell’s paradox ceases to be contradictory. Here is what happens: 

Let U be a universal set and suppose that all sets under discussion are subsets of U. Let 


S = {A|A C UandA € A}. 
In Russell’s paradox, both implications 
SES—>SES and SES—HSES 
are proved, and the contradictory conclusion 
bothSES and SES 


is therefore deduced. In the situation in which all sets under discussion are subsets of U, 
the implication S E S$ — S ¢ S is proved in almost the same way as it is for Russell’s 
paradox: (Suppose $ € S. Then by definition of S, S C U and S € S. In particular, S € S.) 
On the other hand, from the supposition that S € S we can only deduce that the statement 
“S C U and S € S” is false. By one of De Morgan’s laws, this means that “S Z U or S E S?” 
Since S E S would contradict the supposition that $ € S, we eliminate it and conclude that 
S Œ U. In other words, the only conclusion we can draw is that the seeming “definition” of 
S is faulty—in other words, S is not a set in U. 

Russell’s discovery had a profound impact on mathematics because even though his 
contradiction could be made to disappear by more careful definitions, its existence caused 
people to wonder whether other contradictions remained. In 1931 Kurt Gödel showed that 
it is not possible to prove, in a mathematically rigorous way, that mathematics is free of 
contradictions. You might think that Gédel’s result would have caused mathematicians to 
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Kurt Gödel give up their work in despair, but that has not happened. On the contrary, there has been 
(1906-1978) more mathematical activity since 1931 than in any other period in history. 
The Halting Problem 


Well before the actual construction of an electronic computer, Alan M. Turing 
(1912-1954) deduced a profound theorem about how such computers would have to 
work. The argument he used is similar to that in Russell’s paradox. It is also related 
to those used by Gédel to prove his theorem and by Cantor to prove that it is impossible 
to write all the real numbers in an infinitely long list, even given an infinitely long pe- 
riod of time (see Section 7.4). 

If you have some experience programming computers, you know how badly an infi- 
nite loop can tie up a computer system. It would be useful to be able to preprocess a pro- 
gram and its data set by running it through a checking program that determines whether 
execution of the given program with the given data set would result in an infinite loop. 
Can an algorithm for such a program be written? In other words, can an algorithm be 
written that will accept any algorithm X and any data set D as input and will then print 
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“halts” or “loops forever” to indicate whether X terminates in a finite number of steps or 
loops forever when run with data set D? In the 1930s, Turing proved that the answer to 
this question is no. 


Theorem 6.4.2 


There is no computer algorithm that will accept any algorithm X and data set D as 
input and then will output “halts” or “loops forever” to indicate whether or not X 
terminates in a finite number of steps when X is run with data set D. 


Proof (by contradiction): 
Suppose there is an algorithm, CheckHalt, such that if an algorithm X and a data set 
D are input, then 


CheckHalt(X, D) prints 
“halts” if X terminates in a finite number of steps 
when run with data set D 
or 
“loops forever” if X does not terminate in a finite number of 


steps when run with data set D. 


[To show that no algorithm such as CheckHalt can exist, we will deduce a contradiction. ] 

Observe that the sequence of characters making up an algorithm X can be 
regarded as a data set itself. Thus it is possible to consider running CheckHalt 
with input (X, X). Define a new algorithm, Test, as follows: For any input algo- 
rithm X, 


Test(X) 

loops forever if CheckHalt(X, X) prints “halts” 
or 

stops if CheckHalt(X, X) prints “loops forever”. 


Now run algorithm Test with input Test. If Test(Test) terminates after a finite 
number of steps, then the value of CheckHalt(Test, Test) is “halts” and so Test(Test) 
loops forever. 

On the other hand, if Test(Test) does not terminate after a finite number of steps, 
then CheckHalt(Test, Test) prints “loops forever” and so Test(Test) terminates. 

The two paragraphs above show that Test(Test) loops forever and also that it ter- 
minates. This is a contradiction. But the existence of Test follows logically from the 
supposition of the existence of an algorithm CheckHalt that can check any algorithm 
and data set for termination. [Hence the supposition must be false, and there is no such 
algorithm.] 


The axioms introduced into set theory to avoid Russell’s paradox are not entirely ad- 
equate to deal with the full range of recursively defined objects in computer algorithms. 
One response has been to develop an extension of set theory that includes new objects 
called hypersets. In addition, the kinds of semantic issues raised by the barber paradox are 
related to problems involved in processing natural language by computers. 
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TEST YOURSELF 


1. In the comparison between the structure of the 
set of statement forms and the set of subsets of a 
universal set, the or operation V corresponds 
to _____., the and operation ^ corresponds 
to ____, a tautology t corresponds to ____, 
a contradiction ¢ corresponds to ______, and the 
negation operation, denoted ~, corresponds 
to 


EXERCISE SET 6.4 


2. The operations of + and-in a Boolean algebra are 
generalizations of the operations of ___ and 
in the set of all statement forms in a given 
finite number of variables and the operations of 
and in the set of all subsets of a 
given set. 


3. Russell showed that the following proposed “set 
definition” could not actually define a set: 


In 1-3 assume that B is a Boolean algebra with operations 
+ and-. Give the reasons needed to fill in the blanks in 
the proofs using only the axioms for a Boolean algebra. 


1. Idempotent law for: : For every ain B, aʻa = a. 
Proof: Let a be any element of B. Then 


a=a'l (a) 
=a-(ata) (b) 
= (a:a)+(a-a) (c) 
= (a-a) +0 (d) 
=a (e) 


2. Universal bound law for +: For every a in B, 
a+1=1. 
Proof: Let a be any element of B. Then 
a+l=a+(a+a) (a) 
= (a+a)+a (b) 


=ata by Example 6.4.2 
al (c) 
3. Absorption law for:over +: For all a and b in B, 
(a+b)a=a. 


Proof: Let a be any element of B. Then 
(a+b)-a =a: (a+b) (a) 
=a:ata:b (b) 
=ata-b by exercise 1 
=a'lt+a:b (c) 
=a: (1+b) (d) 
=a: (b+1) (e) 
=a-l by exercise 2 
=a (f) 


In 4-10 assume that B is a Boolean algebra with opera- 
tions + and-. Prove each statement using only the axioms 
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for a Boolean algebra and statements proved in the text 
or in lower-numbered exercises. 


4. Universal bound for 0: For every a in B, a:0 = 0. 


5. Complements of 0 and 1: 


a O= 1 b. 1 =0 


6. Uniqueness of 0: There is only one element of B 
that is an identity for +. 


H 7. Uniqueness of 1: There is only one element of B 


that is an identity for -. 


8. De Morgan’s law for: : For all a and b in B, 
a'b = a+b. (Hint: Prove that (a:b) + (a+b) = 1 
and that (a-b) + (a + b) = 0, and use the fact that 
a:b has a unique complement.) 


9. De Morgan’s law for +: For all a and b in B, 
at+b=a-b. 


. Cancellation law: For all x, y, and z in B, if 
x+y =x+zandx-y = x-z, then y = z. 


11. Let S = {0, 1}, and define operations + andon S 


by the following tables: 
+| 0 |o 1 
0; O 1 0 0 0 
1 1 1 1 0 1 


a. Show that the elements of S satisfy the follow- 
ing properties: 
(i) the commutative law for + 
(ii) the commutative law for - 
(iil) the associative law for + 
(iv) the associative law for - 
H (v) the distributive law for + over - 
(vi) the distributive law for-over + 
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H b. Show that 0 is an identity element for + and 
that | is an identity element for -. 

c. Define 0 = 1 and 1 = 0. Show that for every 
ain S,a+a=1anda-a = Q. It follows from 
parts (a)—(c) that S is a Boolean algebra with 
the operations + and -. 

Exercises 12-15 provide an outline for a proof that the asso- 
ciative laws, which were included as an axiom for a Boolean 
algebra, can be derived from the other four axioms. The 
outline is from Introduction to Boolean Algebra by S. Givant 
and P. Halmos, Springer, 2009. In order to avoid unneeded 
parentheses, assume that - takes precedence over +. 


12. The universal bound law for + states that for every 
element a in a Boolean algebra, a+ 1 = 1. The 
proof shown in exercise 2 used the associative law 
for +. Rederive the law without using the associa- 
tive law and using only the other four axioms for a 
Boolean algebra. 


H 13. The absorption law for + states that for all ele- 
ments a and b ina Boolean algebra, a:b +a = a 
. Prove this law without using the associative law 
and using only the other four axioms for a Boolean 
algebra plus the result of exercise 12. 


14. Test for equality law: For all elements a, b, and c 
in a Boolean algebra, 


If b-a = c'a and b'a = c'a, then b = c. 


Without using the associative law, derive this law 
from the other four laws in the axioms for a Bool- 
ean algebra plus the result of exercise 12. 


H 15. The associative law for + states that for all 
elements a, b, and c in a Boolean algebra, 
a+(b+c) = (a+ b)+ c. Show that this law, as 
well as the associative law for -, can be derived 
from the other four axioms in the definition and 
axioms for a Boolean algebra. Then explain how 
to use your work to obtain a derivation for the 
associative law for -. 


Hints: To prove this theorem, suppose a, b, and 
c are any elements in a Boolean algebra B, and 
divide the proof into three parts. Part 1: Prove 
that (a+ (b+ c))-a = ((a + b) + c) +a. Part 2: 
Prove that (a + (b + c)):a = (at+b)+c):a. 
Part 3: Use the results of parts 1 and 2 to prove 
that a + (b + c) = (a+ b)+ c. You may use the 
universal bound law for +, the absorption law 
for +, and the test for equality law from exer- 
cises 12, 13, and 14 because the associative laws 
were not used to derive these properties. 


In 16-21 determine whether each sentence is a statement. 
Explain your answers. 


16. 
17. 
18. 
19. 
20. 
21. 
22. 


H 23. 


24. 


25. 


26. 


27. 


This sentence is false. 


If 1+1 = 3, then 1 = 0. 


The sentence in this box is a lie. 


All positive integers with negative squares are prime. 
This sentence is false or 1 + 1 = 3. 
This sentence is false and 1 + 1 = 2. 


a. Assuming that the following sentence is a 
statement, prove that 1 + 1 = 3: 


If this sentence is true, then 1 + 1 = 3. 


b. What can you deduce from part (a) about the 
status of “This sentence is true”? Why? (This 
example is known as Löb’s paradox.) 


The following two sentences were devised by the 
logician Saul Kripke. While not intrinsically para- 
doxical, they could be paradoxical under certain 
circumstances. Describe such circumstances. 


(i) Most of Nixon’s assertions about Watergate 
are false. 
(ii) Everything Jones says about Watergate is 
true. 
(Hint: Suppose Nixon says (ii) and the only utter- 
ance Jones makes about Watergate is (i).) 


Can there exist a computer program that has as 
output a list of all the computer programs that do 
not list themselves in their output? Explain your 
answer. 


Can there exist a book that refers to all those 
books and only those books that do not refer to 
themselves? Explain your answer. 


Some English adjectives are descriptive of 
themselves (for instance, the word polysyllabic is 
polysyllabic) whereas others are not (for instance, 
the word monosyllabic is not monosyllabic). The 
word heterological refers to an adjective that does 
not describe itself. Is heterological heterological? 
Explain your answer. 


As strange as it may seem, it is possible to give a 
precise-looking verbal definition of an integer that, 
in fact, is not a definition at all. The following was 
devised by an English librarian, G. G. Berry, and 
reported by Bertrand Russell. Explain how it leads 
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to a contradiction. Let n be “the smallest integer determine whether X prints a when run with data 
not describable in fewer than 12 English words.” set D? Explain. (This problem is called the print- 
(Note that the total number of strings consisting of ing problem.) 


11 or fi English words is finite. : oe ; 
cee eos 29. Use a technique similar to that used to derive 


H 28. Is there an algorithm which, for a fixed quantity Russell’s paradox to prove that for any set A, 
a and any input algorithm X and data set D, can P(A) ZA. 


ANSWERS FOR TEST YOURSELF 


1. the operation of union U; the operation of intersection N; a universal set U; the empty set ©; the operation of 


complementation, denoted by using the superscriptc 2. V; ^; U; N 3. the set of all sets that are not elements of themselves 
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PROPERTIES OF FUNCTIONS 


The concept of function is essential in all areas of mathematics and computer science. Ear- 
lier in this book we discussed sequences (which are functions defined on sets of integers), 
mod and div (which are functions defined on Cartesian products of integers), floor and 
ceiling (which are functions from R to Z), and truth tables and input/output tables (which 
can be regarded as Boolean functions). 

In this chapter we consider an additional wide variety of functions, focusing on those 
defined on discrete sets (such as finite sets or sets of integers). We then look at properties of 
functions such as one-to-one and onto, existence of inverse functions, and the interaction 
of composition of functions and the properties of one-to-one and onto. We end the chapter 
with the surprising result that there are different sizes of infinite sets and give an applica- 
tion to computability. 


JAN Functions Defined on General Sets 


The theory that has had the greatest development in recent times is without any doubt 
the theory of functions. —Vito Volterra, 1888 


As used in ordinary language, the word function indicates dependence of one varying 
quantity on another. If your teacher tells you that your grade in a course will be a function 
of your performance on the exams, you interpret this to mean that the teacher has some 
rule for translating exam scores into grades. To each collection of exam scores there cor- 
responds a certain grade. 

In Section 1.3 we defined a function as a certain type of relation. In this chapter we 
focus on the more dynamic way functions are used in mathematics. The following is a 
restatement of the definition of function that includes additional terminology associated 
with the concept. 


425 
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Definition 


A function f from a set X to a set Y, denoted f: X — Y, is arelation from X, the domain 
of f, to Y, the co-domain of f, that satisfies two properties: (1) every element in X is relat- 
ed to some element in Y, and (2) no element in X is related to more than one element in Y. 
Thus, given any element x in X, there is a unique element in Y that is related to x 
by f. If we call this element y, then we say that “f sends x to y” or “f maps x to y” 
and write x > y or f: x — y. The unique element to which f sends x is denoted 


f(x) andiscalled fofx, or 
the output of f for the input x, or 
the value of f at x, or 
the image of x under f. 


The set of all values of f taken together is called the range of f or the image of X 
under f. Symbolically: 


range of f = image of X under f = {y E Y | y = f(x), for some x in X}. 


x is an element such that f(x) = y, then x is called a preimage of y or an inverse 
image of y. The set of all inverse images of y is called the inverse image of y. 
Symbolically: 


A Given an element y in Y, there may exist elements in X with y as their image. When 


Caution! Use f(x) to 
refer to the value of the 


function f at x. Generally 
avoid using f(x) to refer to the inverse image of y = {a EX | FQ) = y}. 
the function f itself. 


In some mathematical contexts, the notation f(x) is used to refer both to the value 
of f at x and to the function f itself. Because using the notation this way can lead to 
confusion, we avoid it whenever possible. In this book, unless explicitly stated oth- 
erwise, the symbol f(x) always refers to the value of the function f at x and not to the 
function f itself. 

The concept of function was developed over a period of centuries. A definition similar 
to that given above was first formulated for sets of numbers by the German mathematician 
Lejeune Dirichlet (DEER-ish-lay) in 1837. 


Arrow Diagrams 


INTERFOTO/Alamy Stock Photo 


Recall from Section 1.3 that if X and Y are finite sets, you can define a function f from X to Y 
by drawing an arrow diagram. You make a list of elements in X and a list of elements in Y, and 


Johann Peter Gustav 
Lejeune Dirichlet 


(1805-1859) draw an arrow from each element in X to the corresponding X f Y 
element in Y, as shown in Figure 7.1.1. = 
This arrow diagram does define a function because: = <] 
1. Every element of X has an arrow that points to an element i 


in Y. pd 


2. No element of X has two arrows that point to two differ- 
ent elements of Y. FIGURE7.1.1 
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Sen WARm Functions and Nonfunctions 


Which of the arrow diagrams in Figure 7.1.2 define functions from X = {a,b,c} to 
Y = {1, 2, 3,4}? 


(a) (b) (c) 
FIGURE 7.1.2 


Solution Only (c) defines a function. In (a) the element b in X is not related to any ele- 
ment of Y because there is no arrow that points from b to an element in Y. And in (b) the 
element c is not related to a unique element of Y because from c there are two arrows that 
point to two different elements of Y—one toward 2 and the other toward 3. m 


A Function Defined by an Arrow Diagram 


Let X = {a, b, c} and Y = {1, 2, 3, 4}. Define a function f from X to Y by the arrow dia- 
gram in Figure 7.1.3. 


a. Write the domain and co-domain of f. x f Y 
C 
b. Find f(a), f(b), and f (o). 
ind f(a), f(b), and f(c) [4 

c. What is the range of f? pe 
d. Is c an inverse image of 2? Is b an inverse image of 3? 
e. Find the inverse images of 2, 4, and 1. FIGURE 7.1.3 
f. Represent fas a set of ordered pairs. 
Solution 
a. domain of f= {a, b, c}, co-domain of f= {1, 2, 3, 4} 
b. f(a) = 2, f(b) = 4, flo) = 2 
c. range of f= {2, 4} 
d. yes, no 
e. inverse image of 2 = {a, c} 

inverse image of 4 = {b} 

inverse image of 1 = @ (since no arrows point to 1) 
f. {(a, 2), (b, 4), (c, 2)} a 


In Example 7.1.2 there are no arrows pointing from X to the | or the 3 in Y. This illus- 
trates the fact that although each element of the domain of a function must have an arrow 
pointing out from it, there can be elements of the co-domain to which no arrows point. 
Note also that there are two arrows pointing to the 2—one coming from a and the other 
from c. This does not violate the definition of function. 

In Section 1.3 we gave a test for determining whether two functions with the same do- 
main and co-domain are equal, saying that the test results from the definition of a function 
as a relation. We formalize this justification in Theorem 7.1.1. 
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Theorem 7.1.1 A Test for Function Equality 


If F: X — Y and G: X — Y are functions, then F = G if, and only if, F(x) = GŒ) 
FOr eveni NE N. 


Note Proof: Suppose F : X — Y and G : X — Y are functions; that is, F and G are relations 
(%y) EF y= FQ) from X to Y that satisfy the two additional function properties. Then F and G are 
@ y) € Gey = G(x). subsets of X X Y, and for (x, y) to be in F means that y is the unique element related 
to x by F, which we denote as F (x). Similarly, for (x, y) to be in G means that y is the 
unique element related to x by G, which we denote as G (x). 


Now suppose that F(x) = G(x) for every x E€ X. Then if x is any element of X, 


œ, y EF&y= Fase y= GaSe x,y) EG because F(x) = G(x). 
So F and G consist of exactly the same elements and hence F = G. 
Conversely, if F = G, then for every x € X, 
y= Fx e x,y EFS x,y) €Gsy= Gia) because F and G consist 
of exactly the same 


Thus, since both F(x) and G(x) equal y, we have that elements. 


F(x) = G(x). 


Equality of Functions 


a. Let J; = {0, 1, 2}, and define functions f and g from J; to J3 as follows: For every x in J3, 
f(x) = (°° +x4+1)mod3 and g(x) = (x +2) mod 3. 
Does f= g? 


b. Let F: RR and G: R — R be functions. Define new functions F + G : R —> R and 
G + F : R —> R as follows: For every x E R, 


(F+ G)(x) = F(x)+ G(x) and (G+FXx)= G(x) + F(x). 
Does F+G=G+F? 
Solution 


a. Yes, the table of values shows that f(x) = g(x) for every x in J3. 


xX+x+1 | f(x) =(@?4+x4+1)mod3 | (x+2) | g(x) = (x +2)? mod 3 


x 

0 1 1 mod3 = 1 4 4 mod3 = 1 
1 3 3 mod 3 = 0 9 9 mod3 =0 
2 7 7 mod3 = 1 16 16 mod 3 = 1 


b. Again the answer is yes. For every real number x, 


(F+ G)(x) = F(x) + G(x) by definition of F+ G 


= G(x) + F(x) by the commutative law for addition of real numbers 
= (G+ F)(x) by definition of G+ F. 
Hence F+G=G+F. E 
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Examples of Functions 


The following examples illustrate some of the wide variety of different types of functions. 


The Identity Function on a Set 


Given a set X, define a function Jy from X to X by 
Ixy (x) =x for each xin X. 


The function /y is called the identity function on X because it sends each element of X to 
the element that is identical to it. Thus the identity function can be pictured as a machine 
that sends each piece of input directly to the output chute without changing it in any way. 


Let X be any set, and suppose that aij and @(z) are elements of X. Find Ix (a£) and Iy((z)). 


Solution Whatever is input to the identity function comes out unchanged, so 
Ix(afj) = a; and OC) = ©). m 


ZCWUGYARE Sequences 


The formal definition of sequences specifies that an infinite sequence is a function defined 
on the set of integers that are greater than or equal to a particular integer. For example, the 
sequence denoted 


11 141 (-1)" 
3° aes; 0? er hae 


1, 


can be thought of as the function f from the nonnegative integers to the real numbers that 


i ‘ —1)" 
associates 0 > 1, 1 > L, 2 — i; 3 — L, 4—> A and, in general, n —> = In other 


words, f: Z"°""*S — R is the function defined as follows: 


—1 n 
Send each integer n = 0 to f(n) = na 
n+l 


In fact, there are many functions that can be used to define a given sequence. For in- 
stance, express the sequence above as a function from the set of positive integers to the set 
of real numbers. 

—qyn+1 
Solution Define g:Z* — Rby g(n) = CD foreachn € Z*. Then gl) = 1,22) = —4, 
g(3) = i, and, in general, 


(-1)"*? (-1)" 
n+1 ntl =O n 


g(n+1)= 


ZCUGYARJ A Function Defined on a Power Set 


Recall from Section 6.1 that P(A) denotes the set of all subsets of the set A. Define a func- 
tion F: PHa, b, c}) — Z"°""®s as follows: For each X E P({a, b, c}), 


F(X) = the number of elements in X. 


Draw an arrow diagram for F. 
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Example 7.1.7 


Note It is customary to 
omit one set of parenthe- 
ses for functions defined 
on Cartesian products. For 
example, we write M (a, b) 
rather than M (a, b)). 


Note It is not obvious, 
but it is true, that for 
any positive real num- 
ber x there is a unique 
real number y such that 
b” = x. Most calculus 
books contain a discus- 
sion of this result. 


CHAPTER7 PROPERTIES OF FUNCTIONS 


Solution 


Alka, b, c}) 


gronneg 


Functions Defined on a Cartesian Product 


Define functions M: R X R — R and R: R X R — R xX R as follows: For each ordered 
pair (a, b) of integers, 


M(a,b)=ab and R(a,b)= (—a, b). 


Then M is the multiplication function that sends each pair of real numbers to the product of 
the two, and R is the reflection function that sends each point in the plane that corresponds 
to a pair of real numbers to the mirror image of the point across the vertical axis. Find the 


following: 

a. M(-1, —1) b. M(5,5) c. M(V2, V2) 

d. R(2, 5) e. R(—2,5) f. RB, —4) 

Solution 

a. (-1)(-1) = 1 b. (1/2)(1/2) = 1/4 c. V2:V2=2 

d. (—2, 5) e. (~(=2), 5) = (2, 5) f. (-3, —4) fal 


Definition Logarithms and Logarithmic Functions 


Let b be a positive real number with b # 1. For each positive real number x, the 
logarithm with base b of x, written log , x, is the exponent to which b must be raised 
to obtain x. Symbolically: 


py & PY =x 


The logarithmic function with base b is the function from R” to R that takes each 
positive real number x to log, x. 


SCUPGYAR:S The Logarithmic Function with Base b 
Find the following: 


a. 


e. 


log; 9 b. log2(3) c. logo (1) d. log» (2™) (m is any real number) 


22e (m > 0) 
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Solution 
a. log; 9 = 2 because 3° = 9. b. log (5) = —] because 2 | = 5. 


c. logio (1) = 0 because 10° =1, 
d. log, (2”) = m because the exponent to which 2 must be raised to obtain 2” is m. 


e. 2280 = m because log, (m) is the exponent to which 2 must be raised to obtain m. E 


Recall that if S is a nonempty, finite set of characters, then a string over S can be regarded 
as a finite sequence of elements of S. The number of characters in a string is called the 
length of the string. The null string over S is the “string” with no characters. It is usually 
denoted A and is said to have length 0. 


ZEWJGAKEJ Encoding and Decoding Functions 


Digital messages consist of finite sequences of 0’s and 1’s. When they are communicated 
across a transmission channel, they are frequently coded in special ways to reduce the pos- 
sibility that they will be garbled by interfering noise in the transmission lines. For example, 
suppose a message consists of a sequence of 0’s and 1’s. A simple way to encode the mes- 
sage is to write each bit three times. Thus the message 


00101111 
would be encoded as 
000000111000111111111111. 


The receiver of the message decodes it by replacing each section of three identical bits by 
the one bit to which all three are equal. 

Let A be the set of all strings of 0’s and I’s, and let T be the set of all strings of 0’s and 1’s 
that consist of consecutive triples of identical bits. The encoding and decoding processes 
described above are actually functions from A to T and from T to A. The encoding function 
E is the function from A to T defined as follows: For each string s € A, 


E(s) = the string obtained from s by replacing each 


bit of s by the same bit written three times. 
The decoding function D is defined as follows: For each string t E T, 


D (t) = the string obtained from tf by replacing each consecutive 
triple of three identical bits of t by a single copy of that bit. 


The advantage of this particular coding scheme is that it makes it possible to do a cer- 
tain amount of error correction when interference in the transmission channels has intro- 
duced errors into the stream of bits. If the receiver of the coded message observes that one 
of the sections of three consecutive bits that should be identical does not consist of identical 
bits, then one bit differs from the other two. In this case, if errors are rare, it is likely that 
the single bit that is different is the one in error, and this bit is changed to agree with the 
other two before applying the decoding function. E 


The Hamming Distance Function 


The Hamming distance function, named after the computer scientist Richard W. Hamming, 
is very important in coding theory. It gives a measure of the “difference” between two 
strings of 0’s and ?’s that have the same length. Let S, be the set of all strings of 0’s and I’s 
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of length n. Define a function H: S, X S, —> Z"°"" as follows: For each pair of strings 
(s, D E Sp X Sw 
H (s, t) = the number of positions in which s and t have different values. 

Thus, letting n = 5, 

H (11111, 00000) = 5 
because 11111 and 00000 differ in all five positions, whereas 

H (11000, 00000) = 2 
because 11000 and 00000 differ only in the first two positions. 

a. Find H (00101, 01110). b. Find H (10001, 01111). 


Courtesy of U.S. Naval Academy 


: - Solution 
Richard Hamming 


(1915-1998) a. 3 b. 4 E 


Boolean Functions 


In Section 2.4 we showed how to find input/output tables for certain digital logic circuits. 
Any such input/output table defines a function in the following way: The elements in the 
input column can be regarded as ordered tuples of 0’s and 1’s; the set of all such ordered 
tuples is the domain of the function. The elements in the output column are all either 0 or 
1; thus {0, 1} is taken to be the co-domain of the function. The relation sends each input 
element to the output element in the same row. Thus, for instance, the input/output table 
of Figure 7.1.4(a) defines the function with the arrow diagram shown in Figure 7.1.4(b). 

More generally, the input/output table corresponding to a circuit with n input wires has 
n input columns. Such a table defines a function from the set of all n-tuples of 0’s and 1’s 
to the set {0, 1}. 


Input Output 
P Q R S 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 1 0 
0 0 0 0 
(a) (b) 


FIGURE 7.1.4 Two Representations of a Boolean Function 


Definition 


An (n-place) Boolean function fis a function whose domain is the set of all ordered 
n-tuples of 0’s and 1’s and whose co-domain is the set {0, 1}. More formally, the 
domain of a Boolean function can be described as the Cartesian product of n copies 
of the set {0, 1}, which is denoted {0, 1}". Thus f: {0, 1}"— {0, 1}. 
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SCUACYARL A Boolean Function 


Consider the three-place Boolean function defined from the set of all 3-tuples of 0’s and I’s 
to {0, 1} as follows: For each triple (x1, x2, x3) of 0’s and 1’s, 


Sq, Xa, X3) = (x1 + x2 + x3) mod 2. 
Describe f using an input/output table. 
Solution fd, 1, 1) = A +1+1) mod2 =3mod2 = 1 
fC, 1,0) = (1 +1+0) mod 2 = 2 mod2 = 0 


The rest of the values of f can be calculated similarly to obtain the following table. 


Input Output 
Xy X2 X3 | (Ky 4X2 4.43) mod 2 
1 1 1 1 
1 1 0 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 1 1 
0 0 0 0 


Checking Whether a Function Is Well Defined 


It can sometimes happen that what appears to be a function defined by a rule is not really 
a function at all. To give an example, suppose we wrote, “Define a function f: R > R by 
specifying that for each real number x, 


f(x) is the real number y such that r+y=1” 


There are two distinct reasons why this description does not define a function. For almost 
all values of x, either (1) there is no y that satisfies the given equation or (2) there are two 
different values of y that satisfy the equation. For instance, when x = 2, there is no real 
number y such that 27+ y = |, and when x = 0, both y = —1 and y = 1 satisfy the equa- 
tion 0° + y = 1. In general, we say that a “function” is not well defined if it fails to satisfy 
at least one of the requirements for being a function. 


ZCUIGYARVA A Function That Is Not Well Defined 
Recall that Q represents the set of all rational numbers. Suppose you read that a function 
f: Q — Zis to be defined by the formula 
m : : 
(2) =m for all integers m and n with n # 0. 


That is, the integer associated by f to the number 7 is m. Is f well defined? Why? 


Solution The function fis not well defined. The reason is that fractions have more than 
one representation as quotients of integers. For instance, 5 = z. Now if f were a function, 
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then the definition of a function would imply that f (5) = (2) since 5 = 2. But applying the 


formula for f, you find that 
py 1 d J= 3 
A2 am leas 


and so 


This contradiction shows that fis not well defined and, therefore, is not a function. E 


Note that the phrase well-defined function is actually redundant; for a function to be 
well defined really means that it is worthy of being called a function. 


Functions Acting on Sets 


Given a function from a set X to a set Y, you can consider the set of images in Y of all the el- 
ements in a subset of X and the set of inverse images in X of all the elements in a subset of Y. 


Definition 


Note For y E€ Y, If f: X — Y is a function and A C X and C C Y, then 


“ina =p iiie 
1 (=F a) f(A) = {y E Y | y = f(x) for some x in A} 


and f \(C) = {x EX | fo) EC}. 
f(A) is called the image of A, and f ~'(C) is called the inverse image of C. 


SCUPCGYARkS The Action of a Function on Subsets of a Set 


Let X = {1, 2,3, 4} and Y = {a, b, c, d, e}, and define F: X — Y by the following arrow 
diagram: 


Let A = {1,4}, C = {a, b}, and D = {c, e}. Find F(A), F(X), F /(C), and F '(D). 
Solution 


F(A) = {b} F(X) = {a,b,d} F(C) = {1,2,4} FD) =Ø 


Interaction of a Function with Union 


Let X and Y be sets, let F be a function from X to Y, and let A and B be any subsets of X. 
Prove that F(A U B) C F(A) U F(B). 
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Solution 


The fact that X, Y, F, A, and B were formally introduced prior to the word “Prove” allows 
you to regard their existence and relationships as part of your background knowledge. 
Thus to prove that F(A U B) C F(A) U F(B), you only need show that if y is any element 
in F(A U B), then y is an element of F(A) U F(B). 


Proof: 


Suppose y E F(A U B). [We must show that y E F(A) U F(B).] By definition of function, 
y = F(x) for some x E A U B. By definition of union, x E A or x E B. 


Case 1, x EA: In this case, y = F(x) for some x in A. Hence y E F(A), and so by definition 
of union, y E F(A) U F(B). 


Case 2, x E B: In this case, y = F(x) for some x in B. Hence y E F(B), and so by definition 
of union, y E F(A) U F(B). 


Thus in either case y € F(A) U F(B) [as was to be shown]. (| 


Exercise 40 asks you to prove the opposite containment from the one in Example 7.1.14. 
Taken together, the example and the solution to the exercise establish the full equality that 
F(A U B) = F(A) U F(B). 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. 


Given a function f from a set X to a set Y, f(x) 6. 


is 


. Given a function f from a set X to a set Y, if 7. 


or or 


f(x) = y then y is called 
` 8. 


. Given a function f from a set X to a set Y, the range 


of f (or the image of X under f) is ———. 


Given functions f and g from a set X to a set Y, 
f= gif, and only if, —— 


Given positive real numbers x and b with b # 1, 
log, (x) = —___. 


Given a function f from a set X to a set Yanda 
subset A of X, f(A) = ——__. 


9. Given a function f from a set X to a set Yanda 
4. Given a function f from a set X to a set Y, if subset C of Y, fO S~ 
f(x) = y then x is called or 
5. Given a function f from a set X to a set Y, if y E Y 


EXERCISE SET 7.1“ 


then f '(y) = and is called 


1. Let X = {1, 3,5} and Y = {s, t, u, v}. Define a. Write the domain of f and the co-domain of f. 
f: X — Y by the following arrow diagram. b. Find f(1), f(3), and f(5). 
x f c. What is the range of f? 
F d. Is 3 an inverse image of s? Is 1 an inverse im- 
age of u? 
> Sal e. What is the inverse image of s? of u? of v? 
n f. Represent f as a set of ordered pairs. 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 


given. The symbol * signals that an exercise is more challenging than usual. 
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2. 
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Let X = {1, 3,5} and Y = {a, b, c, d}. Define 
g : X — Y by the following arrow diagram. 
X Y 
& 
-~a 


. Write the domain of g and the co-domain of g. 

. Find g(1), g(3), and g (5). 

. What is the range of g? 

. Is 3 an inverse image of a? Is 1 an inverse im- 
age of b? 

e. What is the inverse image of b? of c? 

f. Represent g as a set of ordered pairs. 


ano 


. Indicate whether the statements in parts (a)—(d) 


are true or false for all functions. Justify your 

answers. 

a. If two elements in the domain of a function are 
equal, then their images in the co-domain are 
equal. 

b. If two elements in the co-domain of a function 
are equal, then their preimages in the domain 
are also equal. 

c. A function can have the same output for more 
than one input. 

d. A function can have the same input for more 
than one output. 


w 


b. Find all functions from X = {a, b, c} to Y = {u}. 


c. Find all functions from X = {a, b, c} to Y = {u, v}. 


. Let Iz be the identity function defined on the set 


of all integers, and suppose that e, bE, K, and Uj 
all represent integers. Find the following: 
a. Iz(e) b. LOŽ c IKO) d. Izu) 


. Find functions defined on the set of nonnegative 


integers that can be used to define the sequences 
whose first six terms are given below. 


E EET N l b. 0, —2,4, —6, 8, —10 
STE gE OP JI AR eet A 


. Let A = {1, 2, 3, 4, 5}, and define a function 


F: P(A) — Z as follows: For each set X in P(A), 
0 if X has an even 
number of elements 


1 if X has an odd 
number of elements. 


Y= 
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. Find all functions from X = {a, b} to Y = {u, v}. 


Tl. 


12. 


13. 


14. 


Find the following: 
a. F({1,3,4) b. F(@) 
c. F({2, 3}) d. F({2, 3, 4, 5}) 


. Let J; = {0, 1, 2, 3, 4}, and define a function 


F: Js — Js as follows: For each x E Js, F(x) = 
(+ 2x +4) mod 5. 
Find the following: 
a. FO) b. Fd) c. F(2) 


d. FG) e FA) 


. Define a function S: Z* — Z* as follows: For 


each positive integer n, 


S(n) = the sum of the positive divisors of n. 


Find the following: 
a. S(1) b. S(15) c. S(17) 
d. S(5) e. S(18) f. S(21) 


. Let D be the set of all finite subsets of positive 


integers. 

Define a function T: Z* — D as follows: For each 
positive integer n, T(n) = the set of positive divi- 
sors of n. 


Find the following: 
a. T(1) b. T(15) c. T(17) 
d. T(5) e. T(18) f. T21) 


Define F: Z X Z — Z X Z as follows: For every 
ordered pair (a, b) of integers, F(a, b) = (2a + 1, 
3b—2). 


Find the following: 
a. F(4,4) b. F(2,1) c. FB,2) d. FC, 5) 
Let J; = {0, 1, 2, 3, 4}, and define G: J; X J; —> 


J; X J; as follows: For each (a, b) E J; X Js, 
G(a, b) = ((2a + 1) mod 5, (3b — 2) mod 5). 
Find the following: 
a. G(4,4) b. G(2,1) « G(3,2) d. G(1,5) 


Let J; = {0, 1, 2, 3, 4}, and define functions 
f: Js — J; and g: J; — J; as follows: For each 
yE Js, 


f(x) = (x +4) mod 5 and 

g(x) = ec + 3x+ 1) mod 5. 
Is f = g? Explain. 
Define functions H and K from R to R by the 
following formulas: 
For every x E R, 

A(x) =|x]+1 

Does H = K? Explain. 


and K(x) = [xl]. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


23. 


Let F and G be functions from the set of all real 
numbers to itself. Define the product functions 
F-G: R — R and G- F: R — Ras follows: For 
every x E R, 
(F-G)(x) = Fœ: GA) 
(GEX) = G(x): F(x). 
Does F-G = G:F? Explain. 
Let F and G be functions from the set of all 
real numbers to itself. Define new functions 
F— G: R— R and G — F : R= Ras follows: For 
every x E R, 
(F— G)(x) = F@) —G@) 
(G—F)(x) = G(x) — F(x). 
Does F — G = G — F? Explain. 
Use the definition of logarithm to fill in the blanks 


below. 
a. log, 8 = 3 because 


. logs (45) = —2 because 


logy 4 = 1 because 
. log; (3”) = n because 
. log, 1 = 0 because 


pang 


Find exact values for each of the following quanti- 
ties without using a calculator. 


a. log; 81 b. log, 1024 c logs(35) 
d. log, 1 e. logol) f. log; 3 
g. log: (25 


Use the definition of logarithm to prove that for 
any positive real number b with b + 1, log, b = 1. 


Use the definition of logarithm to prove that for 
any positive real number b with b + 1, log, 1 = 0. 


If b is any positive real number with b # 1 and x 


is any real number, b “ is defined as follows: 
: 1 
—— y Use this definition and the definition of 
1 
logarithm to prove that log, Pig —log,(u) 


for all positive real numbers u and b, with b # 1. 


. Use the unique factorization for the integers theo- 


rem (Section 4.4) and the definition of logarithm 
to prove that log; (7) is irrational. 


If b and y are positive real numbers such that 
log, y = 3, what is logi; (y)? Explain. 


24. 


25. 


26. 


27. 


28. 


29. 
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If b and y are positive real numbers such that 
log, y = 2, what is log, (y)? Explain. 


Let A = {2, 3,5} and B = {x, y}. Let pı and p3 

be the projections of A x B onto the first and 

second coordinates. That is, for each pair 

(a, b) E A X B, pia, b) = a and pra, b) = b. 

a. Find p,(2, y) and p,(5, x). What is the range 
of pı? 

b. Find p,(2, y) and p2(5, x). What is the range 
of p2? 


Observe that mod and div can be defined as func- 
tions from Z"?""*S X Z* to Z. For each ordered 
pair (n, d) consisting of a nonnegative integer n 
and a positive integer d, let 


mod (n, d) = n mod d (the nonnegative remainder 
obtained when n is divided by d). 


div (n, d) = n div d (the integer quotient 
obtained when n is divided by d). 


Find each of the following: 

a. mod (67, 10) and div (67, 10) 
b. mod (59, 8) and div (59, 8) 
c. mod (30, 5) and div (30, 5) 


Let S be the set of all strings of a’s and b’s. 

a. Define f: S— Z as follows: For each string s 
in S 

the number of b’s to the left 

of the left-most a in s 


f(s) = 


0 if s contains no a’s. 


Find f(aba), f(bbab), and f(b). What is the 
range of f? 

b. Define g: S — S as follows: For each string s 
in S, 


g(s) = the string obtained by writing the 
characters of s in reverse order. 


Find g(aba), g(bbab), and g(b). What is the 
range of g? 


Consider the coding and decoding functions E and 
D defined in Example 7.1.9. 

a. Find E(0110) and D(111111000111). 

b. Find E(1010) and D(OO0000111111). 


Consider the Hamming distance function defined 
in Example 7.1.10. 

a. Find H(10101, 00011). 

b. Find H(00110, 10111). 
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30. Draw arrow diagrams for the Boolean functions 34. Student C tries to define a function h: Q — Q by 
defined by the following input/output tables. the rule 


2 
Output (2) = z for all integers m and n with n # 0. 


Student D claims that / is not well defined. Justify 
student D’s claim. 


function R: U — Z as follows: For each x € U, 


0 
1 
5 35. Let U = {1, 2, 3, 4}. Student A tries to define a 
1 


R(x) is the integer y so that (xy) mod 5 = 1. 


b. Taput faint Student B claims that R is not well defined. Who 
— TE is correct: student A or student B? Justify your 
S answer. 


1 36. Let V = {1, 2, 3}. Student C tries to define a func- 
0 tion S: V— V as follows: For each x € V, 
1 


S(x) is the integer y in V so that (xy) mod 4 = 1. 


Student D claims that $ is not well defined. Who is 
right: student C or student D? Justify your answer. 


37. On certain computers the integer data type goes 
from —2,147,483,648 through 2,147,483,647. Let 
S be the set of all integers from —2,147,483,648 
through 2,147,483,647. Try to define a function 
f: S—S by the rule f(n) = n’ for each n in S. Is f 
31. Fill in the following table to show the values of all well defined? Explain. 


possible two-place Boolean functions. 38. Let X = {a, b, c} and Y = {r, s, t, u, v, w}. Define 


Input | fil f2 f3 Sa Ss Sol Fai Ss fo Fio Su Siz | Sia Sia Sis Fi a follows: fla) = v, f(b) = vand 


1 1 a. Draw an arrow diagram for f. 


ol=lolejļ]olejojl=e|> 


= | O| S| Goj= 


1 0 b. Let A = {a, b}, C = {t}, D = {u, v}, and 
oi E = {r, s}. Find f (A), f (X), f (C), f(D), 
f'E), and fY). 
0 0 
39. Let X = {1,2,3,4} and Y = {a, b, c,d, e}. 
32. Consider the three-place Boolean function f defined Define g : X — Y as follows: g(1) = a, g(2) = a, 
by the following rule: For each triple (x4, X2, x3) of g(3) = a, and g(4) = d. 
0’s and 1’s, a. Draw an arrow diagram for g. 
b. Let A = {2,3}, C = {a}, and D = {b, c}. Find 
f, Xr, x3) = (4x, + 3X5 + 2x3) mod 2. g(A), g(X), g (©), g (D), and g (Y). 
a. Find f(1, L, 1 and f(O, 0, D. H 40. Let X and Y be sets, let A and B be any sub- 
b. Describe fusing an input/output table. sets of X, and let F be a function from X to Y. 
33. Student A tries to define a function g : Q — Z by Fill in the blanks in the following proof that 
the rule F(A) U F(B) C F(A U B). 
m ; . Proof: Let y be any element in F(A) U F(B). [We 
E n = m~n, for all integers m and n with n # 0. must show that y is in F(A U B).] By definition of 
: i 
union, 


Student B claims that g is not well defined. Justify 


student B’s claim. Case 1, y E F(A): In this case, by definition of 


F(A), y = F(x) for “)_ x € A. Since A C A U B, 
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it follows from the definition of union that 

xe Mii). Hence, y = F(x) for some x E A UB, 
and thus, by definition of F(A U B), y E ®©). 
Case 2, y E F(B): In this case, by definition of 

F(B), (vy) _ for some x E B. Since B CAUBit 
follows from the definition of union that _Y))_. Thus 
yE F(A UB). 


Therefore, regardless of whether y E F(A) or 
y E F(B), we have that y E F(A U B) [as was to 
be shown]. 


In 41-49 let X and Y be sets, let A and B be any subsets of 
X, and let C and D be any subsets of Y. Determine which 
of the properties are true for every function F from X to Y 
and which are false for at least one function F from X to Y. 
Justify your answers. 


41. If A C B then F(A) C F(B). 
42. F(A N B) C F(A) N F(B) 
43. F(A) N F(B) C F(A N B) 


44. For all subsets A and B of X, F(A — B) = 
F(A) — F(B). 


45. For all subsets C and D of Y, if C C D, then 
F (©) C F ©). 


H 46. For all subsets C and D of Y, 


F (CUD)=F (QOUF '(D). 
47. For all subsets C and D of Y, 
F(C A D) = FCO) N FD). 


ANSWERS FOR TEST YOURSELF 
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48. For all subsets C and D of Y, 
F`!(C— D) =F \(Q-F ‘(D). 
49. F(F (GC) CC 


50. Given a set S and a subset A, the characteristic 
function of A, denoted y4, is the function defined 
from S to Z with the property that for each u € S, 


a- [1 ifuea 
XAW TSO ifuga. 


Show that each of the following holds for all sub- 

sets A and B of S and every u E S. 

a. Xanglu) = xau): Xg) 

b. Xaugu) = xau) + xglu) — xau) Xg) 
Each of exercises 51-53 refers to the Euler phi function, 
denoted ¢, which is defined as follows: For each integer 
n=1, d(n) is the number of positive integers less than or 
equal to n that have no common factors with n except +1. 
For example, (10) = 4 because there are four positive 
integers less than or equal to 10 that have no common 
factors with 10 except +1—namely, 1, 3, 7, and 9. 


51. Find each of the following: 
a. (15) b. (2) c. (5) 
d. (12) e. dll) f. fd) 

52. Prove that if p is a prime number and n is an inte- 
ger with n = 1, then (p^) = p” — p"™!. 


H 53. Prove that there are infinitely many integers n for 


which (n) is a perfect square. 


1. the unique output element in Y that is related to x by f 

2. the value of f at x; the image of x under f; the output of f 
for the input x 3. the set of all y in Y such that f(x) = y 

4. an inverse image of y under f; a preimage of y 

5. {x E X| f(x) = y}; the inverse image of y 6. f(x) = g(x) 


for every x EX 7. the exponent to which b must be raised 
to obtain x (Or: the real number y such that x = b”) 

8. {y E Y| y = f(a) for some x E A} (Or: {f(x)|x € A}) 

9. {x EX| fx) EC) 


One-to-One, Onto, and Inverse Functions 


Don’t accept a statement just because it is printed. —Anna Pell Wheeler, 1883-1966 


In this section we discuss two important properties that functions may satisfy: the property 
of being one-to-one and the property of being onto. Functions that satisfy both properties 
are called one-to-one correspondences or one-to-one onto functions. When a function is a 
one-to-one correspondence, the elements of its domain and co-domain match up perfectly, 
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and we can define an inverse function from the co-domain to the domain that “undoes” 
the action of the function. 


One-to-One Functions 


In Section 7.1 we noted that a function may send several elements of its domain to the same 
element of its co-domain. In terms of arrow diagrams, this means that two or more arrows 
that start in the domain can point to the same element in the co-domain. On the other hand, 
if no two arrows that start in the domain point to the same element of the co-domain then 
the function is called one-to-one or injective. For a one-to-one function, each element of 
the co-domain is the image of at most one element of the domain. 


Let F be a function from a set X to a set Y. F is one-to-one (or injective) if, and only 
if, for all elements x, and x, in X, 


if F(x) = F(%), then X = Xo, 
or, equivalently, if x, # x, then F(x) # F (x2). 
Symbolically: 


F: X — Y is one-to-one = Vx, x, E X, if F(x,) = F (>) then x; = x2. 


To obtain a precise statement of what it means for a function not to be one-to-one, take 
the negation of one of the equivalent versions of the definition above. Thus: 


A function F: X — Y is not one-to-one < Jelements x, and x, in X with 
F(x) = F(x2) and x; # x2. 


In other words, if elements x, and x, can be found that have the same function value but 
are not equal, then F is not one-to-one. 

In terms of arrow diagrams, a one-to-one function can be thought of as a function that 
separates points. That is, it takes distinct points of the domain to distinct points of the co- 
domain. A function that is not one-to-one fails to separate points. In other words, at least 
two points of the domain are taken to the same point of the co-domain. This distinction is 
illustrated in Figures 7.2.1(a) and 7.2.1(b). 


X = domain of F F Y =co-domain of F 
—- 
Xe Any two distinct elements 
of X are sent to two 
mae distinct elements of Y. 


FIGURE 7.2.1(a) A One-to-One Function Separates Points 


X = domain of F F Y = co-domain of F 


a 


Two distinct elements 
of X are sent to 
the same element of Y. 


FIGURE 7.2.1(b) A Function That Is Not One-to-One Collapses Points Together 
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Se eeae Identifying One-to-One Functions Defined on Finite Sets 
a. Do either of the arrow diagrams in Figure 7.2.2 define one-to-one functions? 


Domain of F Co-domain of F Domain of G Co-domain of G 


FIGURE 7.2.2 


b. Let X = {1, 2,3} and Y = {a, b, c, d}. Define H: X — Y as follows: H(1) = c, 
H(2) = a, and H(3) = d. Define K: X — Y as follows: K(1) = d, K(2) = b, and 
K(3) = d. Is either H or K one-to-one? 


Solution 


a. F is one-to-one but G is not. F is one-to-one because no two different elements of X 
are sent by F to the same element of Y. G is not one-to-one because the elements a and 
c are both sent by G to the same element of Y: G(a) = G(c) = w buta # c. 


b. H is one-to-one but K is not. H is one-to-one because each of the three elements of 
the domain of H is sent by H to a different element of the co-domain: H(1) # H(2), 
H(1) + H(3), and H(2) + H(3). K, however, is not one-to-one because K(1) = K(3) = d 
but 1 Æ 3. E 


Consider the problem of writing a computer algorithm to check whether a function 
F is one-to-one. If F is defined on a finite set and there is an independent algorithm 
or a table of values for F, then an algorithm to check whether F is one-to-one can 
be written as follows: Represent the domain of F as a one-dimensional array a[1], 
a[2],..., a[n] and use a nested loop to examine all possible pairs (a[i], a[j]), where 
i < j. If there is a pair (aļi], a[j]) for which F(aļi]) = F(a[j]) and aļi] # alj], then F is 
not one-to-one. If, however, all pairs have been examined without finding such a pair, 
then F is one-to-one. You are asked to write such an algorithm in exercise 57 at the 
end of this section. 


One-to-One Functions on Infinite Sets 


Now suppose fis a function defined on an infinite set X. By definition, fis one-to-one if, 
and only if, the following universal statement is true: 


Vx, X2 © X, if f(x) = f(x) then x, = x. 
Thus, to prove fis one-to-one, you will generally use the method of direct proof: 


suppose x, and x, are elements of X such that f(x,) = f(%) 


and show that x; = x2. 


To show that fis not one-to-one, you will ordinarily 


find elements x, and x, in X so that f(x,) = f(x) but x, # x2. 
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ZEWG YA Proving or Disproving That Functions Are One-to-One 
Define f: R — Rand g: Z — Z by the rules 
fœ) =4x—-1 forallxER 


and g(n) =n’ forallnE Z. 


a. Is f one-to-one? Prove or give a counterexample. 


b. Is g one-to-one? Prove or give a counterexample. 


Solution It is usually best to start by taking a positive approach to answering ques- 
tions like these. Try to prove the given functions are one-to-one and see whether you run 
into difficulty. If you finish without running into any problems, then you have a proof. 
If you do encounter a problem, then analyzing the problem may lead you to discover a 
counterexample. 


a. The function f: R — R is defined by the rule 


f(x) =4x-1 foreach real number x. 


To prove that fis one-to-one, you need to prove that 
V real numbers x, and x, if f(x,) = f(x) then x; = x. 
Substituting the definition of f into the outline of a direct proof, you 
suppose x, and x, are any real numbers such that 4x, — 1 = 4x, —1, 
and show that x, = x». 


Can you reach what is to be shown from the supposition? Yes. Just add 1 to both sides 
of the equation in the supposition and then divide both sides by 4. 
This discussion is summarized in the following formal answer. 


Answer to (a): 


If the function f: R — R is defined by the rule f(x) = 4x — 1, for each real num- 
ber x, then fis one-to-one. 


Proof: Suppose x, and x, are real numbers such that f(x,) = f(x2). [We must show 
that x, = Xy.] By definition of f, 


4x,-1 = Ax, — 1. 
Adding 1 to both sides gives 
4x, = 4, 


and dividing both sides by 4 gives 
X% = X92, 


[as was to be shown]. 
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b. The function g: Z — Z is defined by the rule 


2 


g(n) =n’ foreach integer n. 


As above, you start as though you were going to prove that g is one-to-one. Substitut- 
ing the definition of g into the outline of a direct proof, you 


suppose n; and n, are integers such that n = n3, 
and try to show that n, = m. 


Can you reach what is to be shown from the supposition? No! It is quite possible for 
two numbers to have the same squares and yet be different. For example, 2? = (—2)} 
but 2 # —2. 

Thus, in trying to prove that g is one-to-one, you run into difficulty. But analyzing 
this difficulty leads to the discovery of a counterexample, which shows that g is not 
one-to-one. 

This discussion is summarized as follows: 


Answer to (b): 


If the function g: Z — Z is defined by the rule g(n) = n’, for all n € Z, then gis 
not one-to-one. 


Counterexample: 
Let n; = 2 and nm, = —2. Then by definition of g, 


g(nı) = g(2) = 2? = 4 and also 


gm) = g(-2) = (72° = 4. 
Hence g(n) = g(a) but ny Æ m, 


and so g is not one-to-one. 


Application: Hash Functions 


Imagine a university with 10,000 students each with a nine-digit ID number, which the 
university plans to link to student records. Placing the record with ID number n in position 
n of an array would waste computer memory space because only a small fraction of the 
billion possible nine-digit ID numbers are needed for the 10,000 students. 


Definition: Hash Function 


A hash function is a function defined from a larger, possibly infinite, set of data to 
a smaller fixed-size set of integers. 


To make it efficient for the university to store the records, a hash function is needed that 
(1) is one-to-one and (2) has a co-domain that is very much smaller than one billion. Most 
hash functions are modifications of mod functions and are defined using prime numbers 
to increase the chance that their values will be scattered rather than clustered together. In 
addition, making their co-domains 50% to 100% larger than their domains makes it more 
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likely that they will be one-to-one. Nonetheless, two input values may collide, that is, have 
the same output value, and various methods are used to avoid such a collision. One of these 
is illustrated in the following very much simplified example to address the university’s 
situation. 


Computing Values of a Hash Function 


Instead of 10,000 students, suppose there are only 6. Define a function H, from the set of 
student ID numbers to the set {0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10} as follows: 


H(n) = n mod 11 for each ID number n. 


To compute values of H either use a calculator or a computer with a built-in mod function 
or use the formula n mod 11 = n— (n div 11) from Section 4.5. In other words, divide n 
by 11, multiply the integer part of the result by 11, and subtract that number from n. As an 
example, since 328343419/11 = 29849401.73. 


(328343419) = 328343419 — (11-29849401) = 8 
To store the link to the record for the student with ID number n, start by computing H(n). 


For instance, if the ID numbers are 328343419, 356633102, 223799061, and 513408716, the 
corresponding H-values are as shown in Table 7.2.1. 


TABLE 7.2.1 
0 
356633102 1 
223799061 2 
3 
4 
5 
6 
513408716 7 
328343419 8 
9 
10 


As noted above, H may not be one-to-one: two different ID numbers could have the 
same H-value. But because it is important for each value in the table to link to a single 
student record, a collision resolution method is needed. One of the simplest, called a 
linear probe, works as follows: If H(n) is already occupied when a new student ID number 
is input, start from H(n) in the table and search downward to put a link for the student’s 
record in the first empty position that occurs; if no empty positions remain going down, go 
up to the beginning of the table and search from there. Because 11 is greater than 6, empty 
positions are guaranteed. 

Suppose the ID number for another student is 607275830. Find the position in the table 
for this number. 
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Solution When you compute H(607275830) you find that it equals 7, which is already 
occupied by the link to the record for ID number 513408716. Searching downward from 
position 7, you find that position 8 is also occupied but position 9 is free. 


607275830 > 7 — 8 => 9 
t t t 


occupied occupied free 


Therefore, you put the link for the record with ID number 607275830 in position 9. a 


A special class of hash functions, known as cryptographic hash functions, is used to 
secure digital data. A cryptographic hash function is designed to satisfy the following 
conditions: 


1. It is a function from bit strings to bit strings of a fixed length. 
2. It is close to being one-to-one: the probability of collisions is very small. 


3. It is close to being a one-way function: given any bit string in its range, finding the in- 
verse image of the string is computationally very difficult. 


4. Its values can be quickly computed. 
5. A very slight change in an input string results in an extensive change in the output string. 


One use of cryptographic hash functions is to provide password security. Passwords in a 
company’s user account file are almost never stored as “clear” text. A basic protection is to 
apply a cryptographic hash function to the passwords, or to a combination consisting of the 
passwords plus extra content provided by the company, and to store only the values of the hash 
function, called the hashes. To log in, a user keys in a password, which is immediately hashed 
(meaning that its value is input to the hash function), and the result is compared to the hash 
stored in the account file. In order to complete the login process, the two hashes have to agree. 

A somewhat similar kind of hashing is used for checking the integrity of files. When 
a file is intended to be copied, a cryptographic hash function is applied to it. The accu- 
racy of a copy is checked by applying the same hash function. If the two hashes agree, 
the copy is accepted. Similarly, when A needs to send a file through a possibly insecure 
network to B, A can first apply a cryptographic hash function to the file. Then A sends 
the hash separately to B through a secure network, and when B receives the file, B applies 
the same hash function to it that A used. B compares the result with the hash received 
from A, and if the two agree, B can have confidence that the file was unchanged during 
transmission. 

Cryptographic hash functions are also used in blockchain technology. A blockchain is a 
public register on the Internet made up of linked blocks consisting of records or programs. To 
make it impossible to change the data in any part of a block, each includes a time stamp plus a 
hash computed from all the previous parts of the blockchain. To keep the system operating at a 
reasonable pace and to validate additions, a time-consuming proof of work is required to add a 
block to the blockchain. A commonly used proof of work requires a programmer to use repeat- 
ed random trials, to discover the input that needs to be added to the data in the blockchain so that 
when the two are hashed together the output will contain a specified number of initial zeroes. 


Onto Functions 


It was noted in Section 7.1 that there may be an element of the co-domain of a function 
that is not the image of any element in the domain. On the other hand, it is possible for 
every element in a function’s co-domain to be the image of some element in its domain. 
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Example 7.2.4 


CHAPTER7 PROPERTIES OF FUNCTIONS 


Such a function is called onto or surjective. When a function is onto, its range is equal to 
its co-domain. 


Let F be a function from a set X to a set Y. F is onto (or surjective) if, and only if, 
given any element y in Y, it is possible to find an element x in X with the property 
that y = F(x). 

Symbolically: 


F:X—/Yisonto < Vy € Y, Jx E X such that F(x) = y. 


To obtain a precise statement of what it means for a function not to be onto, take the 
negation of the definition of onto: 


F: X — Yisnotonto < dAyin Y such that Vx E X, F(x) £ y. 


That is, there is some element in Y that is not the image of any element in X. 

In terms of arrow diagrams, a function is onto if each element of the co-domain has an 
arrow pointing to it from some element of the domain. A function is not onto if at least one 
element in its co-domain does not have an arrow pointing to it. This is illustrated in Figures 
7.2.3(a) and 7.2.3(b). 


X = domain of F F Y = co-domain of F 
— 
\— 2? 


Each element y in 


BEDES Y equals F(x) for 


at least one x in X. 


FIGURE 7.2.3(a) A Function That Is Onto 


X = domain of F F Y = co-domain of F 


— 


ae | At least one element in Y 
| does not equal F(x) 
for any x in X. 
FIGURE 7.2.3(b) A Function That Is Not Onto 


Identifying Onto Functions Defined on Finite Sets 


a. Do either of the arrow diagrams in Figure 7.2.4 define onto functions? 


Domain of F Co-domain of F Domain of G Co-domain of G 


FIGURE 7.2.4 
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b. Let X = {1, 2, 3,4} and Y = {a, b, c}. Define H: X — Y as follows: H(1) = c, 
A(2) = a, H(3) = c, H(4) = b. Define K: X — Y as follows: K(1) = c, K(2) = b, 
K(3) = b, and K(4) = c. Is either H or K onto? 


Solution 


a. Fis not onto because b # F(x) for any x in X. G is onto because each element of Y 
equals G(x) for some x in X: a = G(3), b = G(1), c = G(2) = G(4), and d = G(5). 


b. H is onto but K is not. H is onto because each of the three elements of the co-domain 
of H is the image of some element of the domain of H: a = H (2), b = H(A), and 
c = H(1) = H (3). K, however, is not onto because a # K(x) for any x in {1, 2, 3, 4}. 
E 


It is possible to write a computer algorithm to check whether a function F is onto, pro- 
vided F is defined from a finite set X to a finite set Y and there is an independent algorithm 
or table of values for F. Represent X and Y as one-dimensional arrays a[1], a[2], . . . , a[n] 
and b[1], b[2], . . . , b[m], respectively. Use a nested loop to pick each element y of Y in turn, 
and search through the elements of X to find an x such that y is the image of x. If any search 
is unsuccessful, then F is not onto. If each such search is successful, then F is onto. You are 
asked to write such an algorithm in exercise 58 at the end of this section. 

Onto Functions on Infinite Sets 


Now suppose F is a function from a set X to a set Y, and suppose Y is infinite. By definition, 
F is onto if, and only if, the following universal statement is true: 


Vy E Y, dx E X such that F(x) = y. 


Thus to prove F is onto, you will ordinarily use the method of generalizing from the ge- 
neric particular: 


suppose that y is any element of Y 
and show that there is an element x in X with F(x) = y. 
To prove F is not onto, you will usually 


find an element y of Y such that y # F(x) for any x in X. 


eee Proving or Disproving That Functions Are Onto 
Define f: R — R and A: Z — Z by the rules 
f(x) = 4x- 1 for each x E R 


and h(n) = 4n — 1 for each n E Z. 


a. Is fonto? Prove or give a counterexample. 
b. Is h onto? Prove or give a counterexample. 
Solution 


a. The best approach is to start trying to prove that fis onto and be alert for difficulties 
that might indicate that it is not. Now f: R — R is the function defined by the rule 


f(x) = 4x-1 foreach real number x. 
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A 


Caution! This scratch 
work only proves what x 
has to be if it exists. The 
scratch work does not 
prove that x exists. 
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To prove that fis onto, you must prove 
Vy E Y, Jx E X such that f(x) = y. 


Substituting the definition of finto the outline of a proof by the method of generalizing 
from the generic particular, you 


suppose y is a real number 


and show that there exists a real number x such that y = 4x — 1. 


Scratch Work: If such a real number x exists, then 


4x-l=y 
4x =y+1 by adding 1 to both sides 
y+1 
x= A by dividing both sides by 4. 


Thus if such a number x exists, it must equal (y + 1)/4. Does such a number exist? Yes. 
To show this, let x = (y + 1)/4, and then make sure that (1) x is a real number and that (2) 


really does send x to y. The following formal answer summarizes this process. 


Answer to (a): 


If f: R — R is the function defined by the rule f(x) = 4x — 1 for each real num- 
ber x, then fis onto. 


Proof: Let y E R. [We must show that3x in R such that f(x) = y.J Letx = (y + 1/4. 
Then x is a real number since sums and quotients (other than by 0) of real num- 
bers are real numbers. It follows that 


y+1 
fo =f “4 by substitution 
y+1 
=4-(- 4 —1 by definition of f 


=(y+1)—1 =y _ by basic algebra, 


[as was to be shown]. 


b. The function h: Z — Z is defined by the rule 


h(n) = 4n—1 foreach integer n. 


To prove that h is onto, you must prove that 
V integer m, J an integer n such that h(n) = m. 


Substituting the definition of A into the outline of a proof by the method of generaliz- 
ing from the generic particular shows that you need to 


suppose m is any integer 


and show that there is an integer n with 4n — 1 = m. 


Note That the quantity 
b* is a real number for 
any real number x follows 
from the least-upper- 
bound property of the 
real number system. (See 
Appendix A.) 
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Can you reach what is to be shown from the supposition? No! If 4n — 1 = m, 


m+1 
n= = by adding 1 and dividing by 4. 
But n must be an integer. And when, for example, m = 0, 
O+1 1 
n==— = 
4 4 


which is not an integer. 

Thus, in trying to prove that / is onto, you run into difficulty, and this difficulty 
reveals a counterexample that shows A is not onto. 

This discussion is summarized in the following formal answer. 


Answer to (b): 


If the function h: Z — Z is defined by the rule h(n) = 4n — 1 for each integer n, 
then / is not onto. 


Counterexample: The co-domain of h is Z and 0 € Z. But h(n) # 0 for any 
integer n. For if h(n) = 0, then 


4n-—1=0 by definition of h 
which implies that 
4n = 1 by adding | to both sides 
and so 
1 
n= A by dividing both sides by 4. 


1 
But ri is not an integer. Hence there is no integer n for which f(n) = 0, and thus 


fis not onto. 


Relations between Exponential and Logarithmic Functions 
For positive numbers b # 1, the exponential function with base b, denoted exp,, is the 
function from R to R* defined as follows: For each real number X, 
expa) = b*, 
where b° = 1 and b™ = 1/b*. 


When working with the exponential function, it is useful to recall the laws of exponents 
from elementary algebra. 


Laws of Exponents 


If b and c are any positive real numbers and u and v are any real numbers, the fol- 
lowing laws of exponents hold true: 


b" b = ptt” 721 
bY = b” 727 
b" u—v 
mig b 7.2.3 
(bo)! = b"c" 724 
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In Section 7.1 the logarithmic function with base b was defined for any positive number 
b # 1 tobe the function from R” to R with the property that for each positive real number x, 


log, (x) = the exponent to which b must be raised to obtain x. 
Or, equivalently, for each positive real number x and real number y, 
log,x=y & bD=x. 


It can be shown using calculus that both the exponential and logarithmic functions are one- 
to-one and onto. Therefore, by definition of one-to-one, the following properties hold true: 


For any positive real number b with b # 1, 
if b“ = b” then u = v for all real numbers u and v, 125 
and 
if log,u = log,v then u = v for all positive real numbers u and v. 126 


These properties are used to derive many additional facts about exponents and logarithms. 
In particular we have the following properties of logarithms. 


Theorem 7.2.1 Properties of Logarithms 


For any positive real numbers b, c, x, and y with b # 1 and c # 1 and for every real 
number a: 


a. log, (xy) = log, x + log, y 


b. roe(?) = log, x — log, y 


c. log, (x") = alog,x 
log, x 


Theorem 7.2.1(d) is proved in the next example. You are asked to prove the remainder 
of the theorem in exercises 33-35 at the end of this section. 


Using the One-to-Oneness of the Exponential Function 


Use the definition of logarithm, the laws of exponents, and the one-to-oneness of the ex- 
ponential function (property 7.2.5) to prove part (d) of Theorem 7.2.1: For any positive real 
numbers b, c, and x, with b # 1 andc # 1, 


log, x 


log.x = : 
logyc 


Solution Suppose positive real numbers b, c, and x are given with b # 1 and c # 1. Let 


(1) u = log,c (2) v = log.x (3) w = logpx. 
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Then, by definition of logarithm, 
a^) c=b" Q)x=c B) x= b". 
Substituting (1’) into (2’) and using one of the laws of exponents gives 
x= = (b =b"  py7.2.2. 
Now by (3), x = b” also. Hence 
bi’ = b”, 
and so by the one-to-oneness of the exponential function (property 7.2.5), 
uv = w. 
Substituting from (1), (2), and (3) gives that 
(logpc)(log. x) = logy x. 


And dividing both sides by log,c (which is nonzero because c # 1) results in 


Computing Logarithms with Base 2 on a Calculator 


In computer science it is often necessary to compute logarithms with base 2. Most calcu- 
lators do not have keys to compute logarithms with base 2 but do have keys to compute 
logarithms with base 10 (called common logarithms and often denoted simply log) and 
logarithms with base e (called natural logarithms and usually denoted In). Suppose your 
calculator shows that In 5 = 1.609437912 and In 2 = 0.6931471806. Use Theorem 7.2.1(d) 
to find an approximate value for log, 5. 


Solution By Theorem 7.2.1(d), 


In5 _ 1.609437912 
In2  0.6931471806 


log) 5 = = 2,321928095. m 


One-to-One Correspondences 


Consider a function F : X — Y that is both one-to-one and onto. Given any element x in X, 
there is a unique corresponding element y = F(x) in Y (since F is a function). Also given 
any element y in Y, there is an element x in X such that F(x) = y (since F is onto) and there 
is only one such x (since F is one-to-one). Thus, a function that is one-to-one and onto sets 
up a pairing between the elements of X and the elements of Y that matches each element of 
X with exactly one element of Y and each element of Y with exactly one element of X. Such 
a pairing is called a one-to-one correspondence or bijection and is illustrated by the arrow 
diagram in Figure 7.2.5. One-to-one correspondences are often used as aids to counting. 
The pairing of Figure 7.2.5, for example, shows that there are five elements in the set X. 


X = domain of F F Y = co-domain of F 


FIGURE 7.2.5 An Arrow Diagram for a One-to-One Correspondence 
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Definition 


A one-to-one correspondence (or bijection) from a set X to a set Y is a function 
F: X — Y that is both one-to-one and onto. 


A Function from a Power Set to a Set of Strings 


Let P({a, b}) be the set of all subsets of {a, b} and let S be the set of all strings of length 
2 made up of 0’s and 1’s. Then P({a, b}) = {Ø, {a}, {b}, {a, b} and S = {00, 01, 10, 11}. 
Define a function h from P({a, b}) to S as follows: Given any subset A of {a, b}, a is either 
in A or not in A, and b is either in A or not in A. If a is in A, write a 1 in the first position 
of the string A(A); otherwise write a 0 there. Similarly, if b is in A, write a 1 in the second 
position of the string h(A); otherwise write a 0 there. This definition is summarized in the 
following table. 


h 
anne 
Subset A of {a, b} Status of a in A Status of b in A String h(A) in S 
Ø not in not in 00 
{a} in not in 10 
{b} not in in 01 
{a, b} in in 11 


Is A a one-to-one correspondence? 


Solution The arrow diagram shown in Figure 7.2.6 shows clearly that h is a one-to-one 
correspondence. It is onto because each element of S has an arrow pointing to it. It is one- 
to-one because each element of S has no more than one arrow pointing to it. 


A({a, b}) S 
— 


h 


FIGURE 7.2.6 


ZEWG AString-Reversing Function 


Let T be the set of all finite strings of x’s and y’s. Define g: T — T by the following rule: 
For each string s € T, 


g(s) = the string obtained by writing the 
characters of s in reverse order. 


Is g a one-to-one correspondence from T to itself? 


Solution The answer is yes. To show that g is a one-to-one correspondence, it is neces- 
sary to show that g is one-to-one and onto. 

To see that g is one-to-one, suppose that for some strings s and sy in T, g(s|) = g(s2). [We 
must show that sı = s2.] Now to say that g(s1) = g(s) is the same as saying that the string 
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obtained by writing the characters of s; in reverse order equals the string obtained by writ- 
ing the characters of s, in reverse order. But if s; and s, are equal when written in reverse 
order, then they must be equal to start with. In other words, sı = sz [as was to be shown]. 

To show that g is onto, suppose t is any string in T. [We must find a string s in T such that 
g(s) = t.] Let s = g(t). By definition of g, s = g(t) is the string in T obtained by writing the 
characters of t in reverse order. But when the order of the characters of a string is reversed 
once and then reversed again, the original string is recovered. Thus 


g(s) = g(g(t)) = the string obtained by writing the characters 
of t in reverse order and then writing those 
characters in reverse order again 


=t, 


[as was to be shown]. E 


A Function of Two Variables 
Define a function F: R X R — R X R as follows: For every (x, y) E R X R, 
F(x, y) = Œœ +y, x= y). 
Is F a one-to-one correspondence from R X R to itself? 


Solution The answer is yes. Showing that F is a one-to-one correspondence requires 
showing both that F is one-to-one and that F is onto. 


Proof that F is one-to-one: Suppose that (x1, yı) and (x2, y2) are any ordered pairs in 
R X R such that 


F(x, y1) = FO, y2). 


[We must show that (x1, y1) = (%2, y2).] By definition of F, 


(x) + yr x1 7 y1) = Oy + Ya, X2 — Yo). 


For two ordered pairs to be equal, both the first and second components must be equal. 
Thus x, y1, X2, and y, satisfy the following system of equations: 


Xp Fy = XT Yo (1) 
X1— Yi = X2— V2. (2) 
Adding equations (1) and (2) gives that 
2x, = 2x, andso x) = xX. 
Substituting x, = x2 into equation (1) yields 
xı tyi =x, +y2, andso y, = yo. 
A Thus, by definition of equality of ordered pairs, (x1, y1) = (Xo, y2) [as was to be shown]. 


Scratch work for the proof that F is onto: To prove that F is onto, suppose that you have 
any ordered pair—say (u, v)—in the co-domain R X R and then show that there is an 
ordered pair in the domain that is sent to (u, v) by F. To do this, suppose provisionally that 
you have found such an ordered pair, say (7, s). Then, on the one hand, 


Caution! This scratch 

work only shows what 

(r, s) has to be if it exists. 

The scratch work does not 

prove that (r, s) exists. F(r, s) = (u, v) because you are supposing 
that F sends (r, s) to (u, v) 
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and, on the other hand, 
Fr, s) = (rt+s,r—s) by definition of F. 
Equating the right-hand sides of these two equations gives 
(r+s,r—s) = (u, v). 
By definition of equality of ordered pairs this means that 


r+s=u (1) 
r—-s=v. (2) 


To solve for r and s in terms of u and v, first add equations (1) and (2) to get 


uty 
= 


2r=ut+y, andso r= 


Then subtract equation (2) from equation (1) to obtain 


u—v 


2 


2s=u-—v, andso s= 


Thus, if F sends (r, s) to (u, v), then r = & x D and s = a 


. To turn this scratch work into 


a proof, you need to make sure that (1) (# 7 — 7 ») is in the domain of F, and (2) that F re- 


ally does send (=, eal to (u, v). 


Proof that F is onto: Suppose (u, v) is any ordered pair in the co-domain of F. [We will 
show that there is an ordered pair in the domain of F that is sent to (u, v) by F.] Let 


uty u-v 
and s= 
2 2 


Then (r, s) is an ordered pair of real numbers, and so it is in the domain of F. In addition: 


r= 


utv u-v a 
by substitution 


Fons) =F 2° 72 


UTE AN utv u-v 
2 2°” 2 2 


) by definition of F 


utvt+u-v u+tv—-uty 
2 Í 2 


= (u, v) by algebra 


[as was to be shown]. E 


Inverse Functions 


If F is a one-to-one correspondence from a set X to a set Y, then there is a function from Y 
to X that “undoes” the action of F; that is, it sends each element of Y back to the element of 
X that it came from. This function is called the inverse function for F. 
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Theorem 7.2.2 


Suppose F: X — Y is a one-to-one correspondence; in other words, suppose F is 
one-to-one and onto. Then there is a function F ~': Y — X that is defined as follows: 
Given any element y in Y, 


F‘(y) = that unique element x in X such that F(x) equals y. 


Or, equivalently, 


The proof of Theorem 7.2.2 follows immediately from the definition of one-to-one and 
onto. Given any element y in Y, there is an element x in X with F(x) = y because F is onto; 
x is unique because F is one-to-one. 


Definition 


The function F`! of Theorem 7.2.2 is called the inverse function for F. 


Note that according to this definition, the logarithmic function with base b > 0 and 
b # 1 is the inverse of the exponential function with base b. 

The diagram that follows illustrates the fact that an inverse function sends each element 
back to where it came from. 


X = domain of F Y = co-domain of F 


ZEWG ALE Finding an Inverse Function for a Function Given by an Arrow Diagram 
Define the inverse function for the one-to-one correspondence h given in Example 7.2.8. 


Solution The arrow diagram for h`! is obtained by tracing the h-arrows back from S to 
P({a, b}) as shown below. 


P({a, b}) ho! S 


h00) =Ø h!(10)= {a} 
h!(01) = {b} h-!(11) = {a, b} 


ZEWG AVA Finding an Inverse Function for a Function Given in Words 


Define the inverse function for the one-to-one correspondence g given in Example 7.2.9. 
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Solution The function g: T — T is defined by the following rule: 
For all strings t in T, 


g(t) = the string obtained by writing the 
characters of ¢ in reverse order. 


Now if the characters of t are written in reverse order and then written in reverse order 
again, the original string is recovered. Thus given any string t in T, 


g A = the unique string that, when written 
in reverse order, equals t 

the string obtained by writing the 
characters of t in reverse order 

= g(t). 


Hence gt: T — T is the same as g, or, in other words, ¢* =g. E 


SEWCIO AFA Finding an Inverse Function for a Function Given by a Formula 
The function f: R — R defined by the formula 
f(x) = 4x-1 foreach real number x 


was shown to be one-to-one in Example 7.2.2 and onto in Example 7.2.5. Find its inverse 
function. 


Solution For any [particular but arbitrarily chosen] y in R, by definition of f~', 


f TIo) = that unique real number x such that f(x) = y. 


But 
fœ) =y 
< 4x-l=y by definition of f 
y+1 
> x= a” by algebra. 
= yt+1 
Hence f '(y) = a. m 


The following theorem follows easily from the definitions. 


Theorem 7.2.3 


If X and Y are sets and F: X — Y is one-to-one and onto, then F` !: YX is also 
one-to-one and onto. 


Proof: F7! is one-to-one: Suppose y; and y are elements of Y such that 
F` 'y1) = F'O). [We must show that y, = y>.] Let x = F_'(y,) = F~ 'y2). Then 
x E X, and by definition of a 

F(x) =y; since x = F!) 
and F(x) =y since x = F7 !). 
Consequently, yı = yz because each is equal to F(x). [This is what was to be shown.] 


F™ is onto: Suppose x € X. [We must show that there exists an element y in Y such 
that F7!(y) = x.] Let y = F(x). Then y € Y, and by definition of F~', F7!) = x 
[as was to be shown]. 
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TEST YOURSELF 


1. 


. The following two statements are 


Example 7.2.14 


Define the inverse function F 


given in Example 7.2.10. 


Solution The solution to Example 7.2.10 shows that F eae 7 


one-to-one, this means that 


(E u-v 
2 + 2 


7.2 ONE-TO-ONE, ONTO, AND INVERSE FUNCTIONS 
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Finding an Inverse Function for a Function of Two Variables 


: R X R—R X R for the one-to-one correspondence 


utv u-v : 
us) = (u, v). Because F is 


) is the unique ordered pair in the domain of F that is sent to (u, v) by F. 


Thus, F Zl is defined as follows: For each ordered pair (u, v) E R X R, 


Fu, v) = ( 


utv u-v 
2 OJ 


If F is a function from a set X to a set Y, then F is 
one-to-one if, and only if, 


. If F is a function from a set X to a set Y, then F is 


not one-to-one if, and only if, 


. If F is a function from a set X to a set Y, then F is 


onto if, and only if, 


. If F is a function from a set X to a set Y, then F is 


not onto if, and only if, 


Vu, v E U, if H(u) = H(v) then u = v. 
Vu, v E U, if u # v then H(u) # H(v). 


EXERCISE SET 7.2 


1. The definition of one-to-one is stated in two ways: 


2. 


H 3. When asked to state the definition of one-to-one, a 


Vx, X2 © X, if F(x) = F(x) then x, = x, 
and Vx1, x. © X, if x, # x» then F(x) # F(x»). 
Why are these two statements logically equivalent? 


Fill in each blank with the word most or least. 

a. A function F is one-to-one if, and only if, each 
element in the co-domain of F is the image of 
at one element in the domain of F. 

b. A function F is onto if, and only if, each ele- 
ment in the co-domain of F is the image of at 
—— one element in the domain of F. 


student replies, “A function fis one-to-one if, and 


10. 


Tl. 


. Given a function F: X — Y where X is an infinite 


set, to prove that F is one-to-one, you suppose 
that and then you show that 


Given a function F: X — Y where X is an infinite 
set, to prove that F is onto, you suppose that 
and then you show that 


. Given a function F: X — Y, to prove that F is not 


one-to-one, you 


. Given a function F: X — Y, to prove that F is not 


onto, you 


A one-to-one correspondence from a set X to a set 
Yisa that is ; 


If F is a one-to-one correspondence from a set X 
to a set Y and y is in Y, then F '(y) is___. 


only if, every element of X is sent by f to exactly 
one element of Y.” Give a counterexample to show 
that the student’s reply is incorrect. 


. Let f: X — Y bea function. True or false? A suf- 


ficient condition for f to be one-to-one is that for 
every element y in Y, there is at most one x in X 
with f(x) = y. Explain your answer. 


. All but two of the following statements are correct 


ways to express the fact that a function fis onto. 

Find the two that are incorrect. 

a. fis onto + every element in its co-domain is 
the image of some element in its domain. 

b. fis onto + every element in its domain has a 
corresponding image in its co-domain. 

c. fis onto & Vy E Y, Jx E X such that f(x) = y. 
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d. fis onto & Vx E X, dy E Y such that f(x) = y. 


e. fis onto + the range of fis the same as the 
co-domain of f. 
6. Let X = {1,5,9} and Y = {3,4, 7}. 
a. Define f: X — Y by specifying that 
fHD=4 fO=7, fY) =4. 
Is f one-to-one? Is f onto? Explain your 


answers. 
b. Define g: X — Y by specifying that 


8(1)=7, 8(5)=3, 20=4 


Is g one-to-one? Is g onto? Explain your 
answers. 


7. Let X = {a, b, c, d} and Y = {e, f, g}. Define 
functions F and G by the arrow diagrams below. 


Domain of F Co-domain of F 


Domain of G Co-domain of G 


a. Is F one-to-one? Why or why not? Is it onto? 
Why or why not? 

b. Is G one-to-one? Why or why not? Is it onto? 
Why or why not? 


8. Let X = {a,b,c} and Y = {d, e, f, g}. Define 
functions H and K by the arrow diagrams below. 


Domain of H Co-domain of H 
X Y 


E 
— 


Domain of K Co-domain of K 
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a. Is H one-to-one? Why or why not? Is it onto? 
Why or why not? 

b. Is K one-to-one? Why or why not? Is it onto? 
Why or why not? 


9. Let X = {1, 2,3}, Y = {1, 2, 3, 4}, and Z = {1, 2}. 

a. Define a function f: X — Y that is one-to-one 
but not onto. 

b. Define a function g : X — Z that is onto but 
not one-to-one. 

c. Define a function h: X — X that is neither 
one-to-one nor onto. 

d. Define a function k: X — X that is one-to-one 
and onto but is not the identity function on X. 


10. a. Define f: Z — Z by the rule f(n) = 2n, for 
every integer n. 
(i) Is f one-to-one? Prove or give a 
counterexample. 
(ii) Is f onto? Prove or give a counterexample. 


b. Let 2Z denote the set of all even integers. That 
is, 2Z = {n E€ Z |n = 2k, for some integer k}. 
Define h: Z — 2Z by the rule h(n) = 2n, for 
each integer n. Is h onto? Prove or give a coun- 
terexample. 


H 11. a. Define g: Z — Z by the rule g(n) = 4n — 5, 


for each integer n. 
(i) Is g one-to-one? Prove or give a 
counterexample. 
(ii) Is g onto? Prove or give a counterexample. 
b. Define G: R — R by the rule G(x) = 4x- 5 
for every real number x. Is G onto? Prove or 
give a counterexample. 


12. a. Define F: Z — Z by the rule F(n) = 2 — 3n, 
for each integer n. 


(i) Is F one-to-one? Prove or give a 
counterexample. 
(ii) Is F onto? Prove or give a counterexample. 
b. Define G: R — R by the rule G(x) = 2 — 3x 
for each real number x. Is G onto? Prove or 
give a counterexample. 


13. a. Define H: R— R by the rule A(x) = x, for 
each real number x. 
(i) Is H one-to-one? Prove or give a 
counterexample. 
(ii) Is H onto? Prove or give a counterexample. 
b. Define K: R” — R"""s by the rule 
K(x) = x, for each nonnegative real number x. 
Is K onto? Prove or give a counterexample. 


14. 


Explain the mistake in the following “proof.” 
Theorem: The function f: Z — Z defined by the 
formula f(n) = 4n + 3, for each integer n, is one- 
to-one. 

“Proof: Suppose any integer n is given. Then by 
definition of f, there is only one possible value 
for f(n)—namely, 4n + 3. Hence fis one-to-one.” 


In each of 15-18 a function f is defined on a set of real 
numbers. Determine whether or not f is one-to-one and 
justify your answer. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


x+1 
f(x) = ——,, for each number x # 0 
x 


f@ = —— for each real number x 
xt) 


3x—-1 


Xx 


, for each real number x # 0 


f(x) = 
xt+1 

f@ = a, for each real number x # 1 
= 


Referring to Example 7.2.3, assume that records 
with the following ID numbers are to be placed in 
sequence into Table 7.2.1. Find the position into 
which each record is placed. 

a. 417302072 

b. 364981703 

c. 283090787 


Define Floor: R — Z by the formula 

Floor(x) = |x], for every real number x. 

a. Is Floor one-to-one? Prove or give a 
counterexample. 

b. Is Floor onto? Prove or give a counterexample. 


Let S be the set of all strings of 0’s and 1’s, and 


define tie S —4 gronneg by 
L(s) = the length of s, for every string s in S. 


a. Is L one-to-one? Prove or give a counterex- 
ample. 
b. Is L onto? Prove or give a counterexample. 


Let S be the set of all strings of 0’s and 1’s, and 
define D: S — Z as follows: For every s E S, 


D(s) = the number of 1’s in s minus the 
number of 0’s in s. 


a. Is D one-to-one? Prove or give a counterexample. 
b. Is D onto? Prove or give a counterexample. 


Define F: P({a, b, c}) — Z as follows: For every 
A in P({a, b, c}), 


F(A) = the number of elements in A. 


7.2 


24. 


25. 


26. 


H 27. 


28. 


29. 


H*30. 


* 31. 
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a. Is F one-to-one? Prove or give a counterexample. 
b. Is F onto? Prove or give a counterexample. 


Les S be the set of all strings of a’s and b’s, and 
define N: S — Z by 


Ns) = the number of a’s ins, for each s E S. 


a. Is N one-to-one? Prove or give a counterexample. 
b. Is N onto? Prove or give a counterexample. 


Let S be the set of all strings in a’s and b’s, and 
define C: S — S by 


C(s) = as, foreach s E S. 


(C is called concatenation by a on the left.) 

a. Is C one-to-one? Prove or give a counterex- 
ample. 

b. Is C onto? Prove or give a counterexample. 


Define S: Z* —> Z* by the rule: For each integer 
n, S(n) = the sum of the positive divisors of n. 
a. Is S one-to-one? Prove or give a counterexample. 
b. Is S onto? Prove or give a counterexample. 


Let D be the set of all finite subsets of positive 

integers, and define T: Z* — D by the following 

rule: For every integer n, T(n) = the set of all of 

the positive divisors of n. 

a. Is T one-to-one? Prove or give a counterex- 
ample. 

b. Is T onto? Prove or give a counterexample. 


Define G: R X R —> R x R as follows: 

G(x, y) = (2y, —x) for every (x, y) E R X R. 

a. Is G one-to-one? Prove or give a counterex- 
ample. 

b. Is G onto? Prove or give a counterexample. 


Define H: R X R — R x R as follows: 

H (x, y) = (x+ 1, 2 — y) for every (x, y) E R X R. 

a. Is H one-to-one? Prove or give a counterex- 
ample. 

b. Is H onto? Prove or give a counterexample. 


Define J: Q X Q —> R by the rule 

J (r,s) = r+ V2s for each (r, s) E Q X Q. 

a. Is J one-to-one? Prove or give a counterex- 
ample. 

b. Is J onto? Prove or give a counterexample. 


Define F: Z* X Z* — Z* and G: Z* X Z* > Z* 
as follows: For each (n, m) E Z* X Z+, 


F(n, m) = 3"5" and G(n, m) = 3"6”. 
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H a. Is F one-to-one? Prove or give a counterex- 
ample. 
b. Is G one-to-one? Prove or give a counterexample. 


32. a. Is logg27 = log23? Why or why not? 
b. Is logig9 = log43? Why or why not? 


The properties of logarithm established in 33-35 are used 
in Sections 11.4 and 11.5. 


33. Prove that for all positive real numbers b, x, and y 
with b # 1, 


ion (?) = log, x — log, y. 


34. Prove that for all positive real numbers b, x, and y 
with b # 1, 


log, (xy) = log, x + log, y. 


H 35. Prove that for all real numbers a, b, and x with b 


and x positive and b # 1, 
log, (x^) = a log, x. 


Exercises 36 and 37 use the following definition: If 

f:R — Rand g: R — Rare functions, then the function 
(f +g): R — Ris defined by the formula (f+ g)(x) = 
f(x) + g(x) for every real number x. 


36. Iff: R — Rand g: R — R are both one-to-one, is 
f+ g also one-to-one? Justify your answer. 


37. Iff: R— Rand g: R > R are both onto, is f+ g 
also onto? Justify your answer. 


Exercises 38 and 39 use the following definition: If 
f:R — Risa function and c is a nonzero real number, 
the function (c-f): R — R is defined by the formula 
(c f)(x) = c (f(x)) for every real number x. 


38. Let f: R — R be a function and c a nonzero real 
number. If fis one-to-one, is c-f also one-to-one? 
Justify your answer. 


39. Let f: R — R be a function and c a nonzero real 
number. If fis onto, is c-f also onto? Justify your 
answer. 


H 40. Suppose F: X — Y is one-to-one. 


a. Prove that for every subset A C X, 
F_\(F(A)) = A. 

b. Prove that for all subsets A; and A, in X, 
F(A, N Az) = F(A;) A F(A3). 
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41. Suppose F: X — Y is onto. Prove that for every 
subset B C Y, F(F ~! (B)) = B. 


Let X = {a, b, c, d, e} and Y = {s, t, u, v, w}. In each of 42 
and 43 a one-to-one correspondence F: X — Y is defined 
by an arrow diagram. In each case draw an arrow diagram 


for F™!. 


In 44-55 indicate which of the functions in the referenced 
exercise are one-to-one correspondences. For each func- 
tion that is a one-to-one correspondence, find the inverse 
function. 


44. Exercise 10a 45. Exercise 10b 


46. Exercise lla 47. Exercise 11b 


48. Exercise 12a 49. Exercise 12b 


50. Exercise 21 51. Exercise 22 


52. Exercise 15 with the co-domain taken to be the set 
of all real numbers not equal to | 


53. Exercise 16 with the co-domain taken to be the set 
of all real numbers 


54. Exercise 17 with the co-domain taken to be the set 
of all real numbers not equal to 3 


55. Exercise 18 with the co-domain taken to be the set 
of all real numbers not equal to 1 


56. In Example 7.2.8 a one-to-one correspondence 
was defined from the power set of {a, b} to the set 
of all strings of 0’s and I’s that have length 2. Thus 
the elements of these two sets can be matched up 
exactly, and so the two sets have the same number 
of elements. 

a. Let X = {x,, X2, . . . , Xp} be a set with 
n elements. Use Example 7.2.8 as a model to 


define a one-to-one correspondence from 
P(X), the set of all subsets of X, to the set of 
all strings of 0’s and 1’s that have length n. 
b. In Section 9.2 we show that there are 2” 
strings of 0’s and 1’s that have length n. What 
does this allow you to conclude about the 
number of subsets of P(X)? (This provides an 
alternative proof of Theorem 6.3.1.) 


ANSWERS FOR TEST YOURSELF 


H 


H 
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57. Write a computer algorithm to check whether a 
function from one finite set to another is one- 
to-one. Assume the existence of an independent 
algorithm to compute values of the function. 


58. Write a computer algorithm to check whether a 
function from one finite set to another is onto. As- 
sume the existence of an independent algorithm to 
compute values of the function. 


1. for all x; and x, in X, if F(x) = F(x.) then x, = x, 

2. there exist elements x, and x, in X such that F(x,) = FQ) 
and x; # Xx, 3. for every element y in Y, there exists 

at least one element x in X such that f(x) =y 4. there 
exists an element y in Y such that for every element x in X, 
f(x) +y 5. logically equivalent ways of expressing what it 
means for a function H to be one-to-one (The second is the 
contrapositive of the first.) 6. xı and x, are any /particular 


but arbitrarily chosen] elements in X with the property that 


7.3 


F(x) = Fœ); x1 = X2 
chosen] element in Y; there exists at least one element x in X 


7. y is any [particular but arbitrarily 


such that F(x) = y 8. show that there are concrete elements 
x, and x, in X with the property that F(x,) = F(x) and 

xı #X, 9. show that there is a concrete element y in Y 
with the property that F(x) # y for any element x in X 

10. function from X to Y; both one-to-one and onto 11. the 
unique element x in X such that F(x) = y (in other words, 


F '(y) is the unique preimage of y in X) 


Composition of Functions 


It is no paradox to say that in our most theoretical moods we may be nearest to our 


most practical applications. —Alfred North Whitehead 


Consider two functions, the successor function and the squaring function, both defined 
from Z to Z, and imagine that each is represented by a machine. If the two machines are 
hooked up so that the output from the successor function is used as input to the squaring 
function, then they work together to operate as one larger machine. In this larger machine, 
an integer n is first increased by 1 to obtain n + 1; then the quantity n + 1 is squared to 
obtain (n + 1). This is illustrated in the following drawing. 


eo 


n 


successor function > 


ie function 


n+l 


2 


(n+ 1} 


Combining functions in this way is called composing them; the resulting function is 
called the composition of the two functions. Note that the composition can be formed only 
if the output of the first function is acceptable input to the second function. That is, the 
range of the first function must be contained in the domain of the second function. 
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Note Even though we 
write g ° f, we put f first 
when we say “the compo- 
sition of fand g” because 
an element x is acted upon 
first by f and then by g. 


Example 7.3.1 


A 


Caution! Be careful 
not to confuse g ° f and 
e(fFœ@): g ° f is the name 
of the function whereas 
g(f(a)) is the value of the 
function at x. 


Note For general func- 
tions F and G, F ° Gneed 
not necessarily equal 

G ° F (although the two 
may be equal). 
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Definition 


Let f: X — Y and g: Y' — Z be functions with the property that the range of fis a 
subset of the domain of g. Define a new function g ° f: X — Z as follows: 


(ee PA = g(f(x)) for each x E X, 


where g ° f is read “g circle f” and g(f(x)) is read “g of f of x.” The function g ° f is 
called the composition of f and g. 


This definition is shown schematically below. 


0 (FQ) = 


Bef) 


Composition of Functions Defined by Formulas 


Let f: Z — Z be the successor function and let g: Z — Z be the squaring function. Then 
f(n) = n+ 1 foreach n E Z and g(n) = n for eachn E Z. 


a. Find the compositions g ° f and f° g. 
b. Isg ° f= f° g? Explain. 


Solution 


a. Because f sends each integer to that integer plus 1 and g sends each integer to the 
square of that integer, it can be helpful to think of the action of f and g as follows: 


f(any integer) = that integer +1 [even if the integer is g(n)]. 
g(any integer) = (that integer)” [even if the integer is f(n)]. 


Thus the functions g ° f and f° g are defined as follows: 
(g °f)(n) = g(fin)) = g(n+1) = (n +1)" foreachn E Z, 
and 
(f° g)(n) = f(g(n)) = f(r’) = n° +1 foreachn E Z. 
b. Two functions from one set to another are equal if, and only if, they always take the 
same values. In this case, 
(g of (1) = (1+ 1) = 4, whereas (f° g)(1) = 17 +1=2. 


Thus the two functions g ° f and f° g are not equal: 
Sof FJ & E 


Example 7.3.1 illustrates the important fact that composition of functions is not a com- 
mutative operation. 
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Composition of Functions Defined on Finite Sets 


Let X = {1,2,3}, Y = {a, b, c, d}, Y = {a, b, c,d, e}, and Z = {x, y, z}. Define func- 
tions f: X — Y' and g : Y — Z by the arrow diagrams below. 


Draw the arrow diagram for g ° f. What is the range of g ° f? 


Solution To find the arrow diagram for g ° f, just trace the arrows all the way across 
from X to Z through Y. The result is shown below. 


X gof Z: 
— 


(8 e P0) = a 70) = ge) =z 


IS (g °f)(2) = g(f(2)) = gb) = y 
Ba (e °f)(3) = (FG) = gla) = y 


The range of g ° f is {y, z}. a 
Recall that the identity function on a set X, Iy, is the function from X to X defined by 
the formula 
x(x) =x forevery x E X. 


That is, the identity function on X sends each element of X to itself. What happens when an 
identity function is composed with another function? 


Composition with the Identity Function 


Let X = {a, b, c, d} and Y = {u, v, w}, and suppose f: X — Y is given by the arrow dia- 
gram shown below. 


Find f° ly and Iy° f. 


Solution The values of f ° Iy are obtained by tracing through the arrow diagram shown below. 


(fe a) = fUx(a)) = fla) = u 
(fe Lyd) = fUx)) =f) = v 
(fe Inte) = fax) = fe) = v 
(F° ld = fUxld)) = fd) = u 
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Thus, for every element x in X, 


(fe Dw = fa). 


By definition of equality of functions, this means that fo Iy = f. 
Similarly, the equality Iy ° f = f can be verified by tracing through the arrow diagram 
below for each x in X and noting that in each case, (Iy ° f)(x) = f(x). 


More generally, the composition of any function with an identity function equals the 
function. 


Theorem 7.3.1 Composition with an Identity Function 


If fis a function from a set X to a set Y, and Zy is the identity function on X, and Iy is 
the identity function on Y, then 


(a) foly=f and (b)lyof=f. 


Proof: 
Part (a): Suppose fis a function from a set X to a set Y and Zy is the identity function 
on X. Then, for each x in X, 


(fe DWA) = fUx@)) = FO). 
Hence, by definition of equality of functions, f° Jy = f, as was to be shown. 


Part (b): This is exercise 16 at the end of this section. 


Now let f be a function from a set X to a set Y, and suppose f has an inverse function 
f -l Recall that f ~ is the function from Y to X with the property that 


f'M=x e f@=y. 


What happens when fis composed with f ~!? Or when f T! is composed with f? 


ZEWJGÆZI Composing a Function with Its Inverse 
Let X = {a, b, c} and Y = {x, y, z}. Define f: X — Y by the following arrow diagram. 
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Note Recall that if b is 
any element of Y, then 

f \(b) = that element a 
of X such that f(a) = x. 
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You can see from the diagram that f is one-to-one and onto. Thus f T! exists and is found 
by tracing the arrows backwards, as shown below. 


Now f | o f is found by following the arrows from X to Y by fand back to X by f~ '. If you 
do this, you will see that 

y aor F@) =f '@=a 

(Ft fb) =f "FO) =f '@ = b 


and 


¢ ef =f 'FO) =f 'O) =e. 


Thus the composition of f and f ~! sends each element to itself. So by definition of the 
identity function, 


fl of=Iy. 


In a similar way, you can see that 


fsf =a. E 


More generally, the composition of any function with its inverse (if it has one) is an 
identity function. Intuitively, the function sends an element in its domain to an element in 
its co-domain and the inverse function sends it back again, so the composition of the two 
sends each element to itself. This reasoning is formalized in Theorem 7.3.2. 


Theorem 7.3.2 Composition of a Function with Its Inverse 


If f: X — Y is a one-to-one and onto function with inverse function f 1 y—X, then 


(a) f r= and (b) feof E lip 


Proof: 

Part (a): Suppose f: X — Y is a one-to-one and onto function with inverse function 
f: Y > X. [To show that f '°f= Iy, we must show that for each x€ X, 
f lo J )(x) = x.] Let x be any element in X. Then, by definition of composition of 
functions, 


¢' efi =f 'F) 
Let 


z =f SO). 
By definition of inverse function, 


f@) =f), 


(continued on page 466) 
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and, because fis one-to-one, this implies that 
Z=xX. 
Now z=f NF (x)) also, and so, by substitution, 
Fosa 
Or, equivalently, 
ET PA =x 


[as was to be shown]. 


Since x is any element of X and since Jy(x) = x, this proves that f Tlo f= Iy. 
Part (b): This is exercise 17 at the end of this section. 


Composition of One-to-One Functions 


The composition of functions interacts in interesting ways with the properties of be- 
ing one-to-one and onto. What happens, for instance, when two one-to-one functions 
are composed? Must their composition be one-to-one? For example, let X = {a, b, c}, 
Y = {w, x, y, z}, and Z = {1, 2, 3, 4, 5}, and define one-to-one functions f: X — Y and 
g: Y — Z as shown in the arrow diagrams of Figure 7.3.1. 


FIGURE 7.3.1 


Then g ° f is the function with the arrow diagram shown in Figure 7.3.2. 


X Z 


FIGURE 7.3.2 


From the diagram it is clear that for these particular functions, the composition is one-to- 
one. This result is no accident. It turns out that the composition of two one-to-one functions 
is always one-to-one. 
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Theorem 7.3.3 


Iff: X — Y and g: Y— Z are both one-to-one functions, then g ° f is one-to-one. 


By the method of direct proof, the proof of Theorem 7.3.3 has the following starting point 
and conclusion to be shown. 


Starting Point: Suppose fis a one-to-one function from X to Y and g is a one-to-one func- 
tion from Y to Z. 


To Show: g ° f is a one-to-one function from X to Z. 
The conclusion to be shown says that a certain function is one-to-one. How do you show 


that? The crucial step is to realize that if you substitute g ° f into the definition of one-to- 
one, you see that 


g ° fis one-to-one <= Vx, X E X, if (e ° f Xx) = (g ° f)(x) then x, = x. 


By the method of direct proof, then, to show g ° f is one-to-one, you 
suppose x, and x, are any elements of X such that (g ° f Xx) = (g ° f)(2), 


and you 
show that x; = x. 


Now the heart of the proof begins. To show that x; = x2, you work forward from the sup- 
position that (g ° f)(x) = (g ° f)(x), using the fact that f and g are both one-to-one. By 
definition of composition, 


(g ef) = sf) and (g ° f)(%2) = gf). 
Since the left-hand sides of the equations are equal, so are the right-hand sides. Thus 
gf) = gf). 
Now just stare at the above equation for a moment. It says that 
g(something) = g(something else). 


Because g is a one-to-one function, any time g of one thing equals g of another thing, those 
two things are equal. Hence 


fœ) = fF). 


But fis also a one-to-one function. Any time f of one thing equals f of another thing, those 
two things are equal. Therefore, 


xX, = X92. 


This is what was to be shown! 
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This discussion is summarized in the following formal proof. 


Proof of Theorem 7.3.3: 


Suppose f: X — Y and g: Y — Z are both one-to-one functions. [We must show that 
g ° f is one-to-one.] Suppose x, and x, are elements of X such that 


(g ° Pa) = 8 ° f). 
[We must show that x, = x>.] By definition of composition of functions, 
8 FQ) = 8( FO). 
Since g is one-to-one, fœ) = fo). 
And since fis one-to-one, X =X 


[as was to be shown]. Hence g ° f is one-to-one. 


Composition of Onto Functions 


Now consider what happens when two onto functions are composed. For example, let 
X = {a,b,c,d,e}, Y = {w, x, y, z}, and Z = {1, 2, 3}. Define onto functions f and g by 
the following arrow diagrams. 


Then g ° f is the function with the arrow diagram shown below. 
It is clear from the diagram that g ° f is onto. 


X Z 


It turns out that the composition of any two onto functions (that can be composed) is onto. 


Theorem 7.3.4 
Iff: X— Y and g: Y — Z are both onto functions, then g ° f is onto. 


A direct proof of Theorem 7.3.4 has the following starting point and conclusion to be 
shown: 


Starting Point: Suppose fis an onto function from X to Y, and g is an onto function from 
Y to Z. 
To Show: g ° f is an onto function from X to Z. 
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The conclusion to be shown says that a certain function is onto. How do you show that? The 
crucial step is to realize that if you substitute g ° f into the definition of onto, you see that 


gof:X—Zisonto << given any element z of Z, it is possible to find an element 
x of X such that (g ° f)(x) = z. 


Since this statement is universal, to prove it you 


A suppose z is a [particular but arbitrarily chosen] element of Z 


and show that there is an element x in X such that (g ° f )(x) = z. 
Caution! To show that a 


function is onto, you must Hence you must start the proof by supposing you are given a particular but arbitrari- 


start with an arbitrary ly chosen element in Z. Let us call it z. Your job is to find an element x in X such that 
element of the co-domain (g ° f)(x) = z. 
and deduce that it is the To find x, reason from the supposition that z is in Z, using the fact that both g and f are 
image of some element in onto. Imagine arrow diagrams for the functions f and g. 
the domain. 
X Y Z 
f 8 
~ — 
gof 


You have a particular element z in Z, and you need to find an element x in X such that when 
x is sent over to Z by g ° f, its image will be z. Now since g is onto, z is at the tip of some 
arrow coming from Y. That is, there is an element y in Y such that 


gly) =z. 7.3.1 


This means that the arrow diagrams can be drawn as follows: 


X Y Z 
(C) C l = 
-~a C “S 
gof 


But f also is onto, and so every element in Y is at the tip of an arrow coming from X. In 
particular, y is at the tip of some arrow. That is, there is an element x in X such that 


f(x) = y. 7.3.2 


The diagram, therefore, can be drawn as shown below. 


X Y Z 
f g 
eS a a 
x 7 z 
e >e >e 
gof 
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Now just substitute equation (7.3.2) into equation (7.3.1) to obtain 


g(f(x)) =z. 


And by definition of g ° f, this can be rewritten as 
BF) = (g e PA). 
Hence 
(Be Pa) = z. 


Thus x is an element of X that is sent by g ° f to z, and so x is the element you were sup- 
posed to find. 
This discussion is summarized in the following formal proof. 


Proof of Theorem 7.3.4: 


Suppose f: X — Y and g: Y— Z are both onto functions. [We must show that g ° f is 
onto.] Let z be any [particular but arbitrarily chosen] element of Z. [We must show the 
existence of an element in X such that g ° f of that element equals z.] Since g is onto, 
there is an element, say y, in Y such that g(y) = z. And since f is onto, there is an 
element, say x, in X such that f(x) = y. Hence there is an element x in X such that 


BeH = sf @)) = 80) =z 


[as was to be shown]. It follows that g ° f is onto. 


Seueeee An Incorrect “Proof” That a Function Is Onto 
To prove that a composition of onto functions is onto, a student wrote: 


1. “Suppose f: X — Y and g: Y— Z are both onto. Then 
2 Vy E Y, 3x E X such that f(x) = y, 

3. and 

4. Vz E Z, dy E Y such that f(y) = z. 

5 

6 


. So 
(g °f)@) = (Fa) = 80) =z 
7. and thus g ° f is onto.” 
Explain the mistakes in this “proof.” 


Solution To show that g ° f is onto, you have to meet the following challenge: If some- 
one gives you an element z in Z (over which you have no control), you must be able to ex- 
plain how to find an element x in X such that (g ° f)(x) = z. Thus a proof that g ° f is onto 
must start with the assumption that you have been given a particular but arbitrarily chosen 
element of Z. This proof does not do that. 

In fact the statements in lines 2 and 4 simply restate the hypothesis that fand g are 
functions that are onto. The x, y, and z in these lines are local variables, with no mean- 
ing outside the quantified statements that contain them. In particular, the variable y in 
line 2 is unrelated to the variable y in line 4. So in line 6 it is wrong to assume that the 
two y’s refer to the same object, which removes the justification for concluding that 
g ° f is onto. E 
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TEST YOURSELF 


1. 


EXERCISE SET 7.3 


7.3 COMPOSITION OF FUNCTIONS 471 


If fis a function from X to Y’, g is a function 
from Y to Z, and Y’ C Y, then g ° f is a function 
from to , and (g ° f)(x) = for 
every x in X. 


. If fis a function from X to Y and 7, and J, are the 


identity functions from X to X and Y to Y, respec- 
tively, then f° I, = and I, ° f = ; 


. If fis a one-to-one correspondence from X to Y, 


then f !of=____and fof }=___. 


4. If fis a one-to-one function from X to Y and g is 
a one-to-one function from Y to Z, you prove that 
g ° f is one-to-one by supposing that and 
then showing that ____. 


5. If fis an onto function from X to Y and g is an 
onto function from Y to Z, you prove that g ° f is 
onto by supposing that and then showing 
that ; 


In each of 1 and 2, functions f and g are defined by arrow 


diagrams. Find g ° fand f ° g and determine whether g ° f 


equals f ° g. 


1. 


X X X X 
f g 
— Ca S 
(Ee) =] (of) a. 


For all strings s € S, 
L(s) = the number of characters in s. 
Let T: Z — {0, 1, 2} be the mod 3 function: 
For every integer n, T(n) =n mod 3. 


a. (T° L)(abaa) =? 
b. (T° L)(baaab) =? 
c. (T ° L)(aaa) =? 


2 X X X X , 
f £ 9. Define F: R — R and G: R — Z by the follow- 
— ee. a e F(x) = ey 3 and G(x) = |x] for every 
\ a. (Go F)\(2) =? 
b. (Go F)(—3) =? 


In 3 and 4, functions F and G are defined by formulas. 


Find G ° F and F ° G and determine whether G © F equals 


FoG. 


3. 
4. 
5. 


F(x) = x and G(x) = x— 1, for each real number x. 


F(x) = x and G(x) = x!” for each real number x. 


Define f: R — R by the rule f(x) = —x for every 
real number x. Find (f° f)(). 


. Define F: Z — Z and G: Z — Z by the rules 


F(a) = 7a and G(a) = a mod 5 for each integer a. 
Find (G ° F)(0), (G ° F)(1), (G ° F)(2), (G ° F)(3), 
and (G ° F)(4). 


. Define L: Z — Z and M: Z — Z by the rules 


L(a) = a’ and M(a) = a mod 5 for each integer a. 


Ha. Find (L ° M)(12), (M ° D(12), (L ° M)(9), and 


(M ° L)(9). 
b. ILe M=M°L? 


Let S be the set of all strings in a’s and b’s and let 
L: S— Z be the length function: 


c. (Go F)(5) =? 


10. Define F: Z — Z and G: Z — Z by the rules 
F(n) = 2n and G(n) = |n/2] for every integer n. 
a. Find (G ° F)(8), (F ° G)(8), (G ° F)(3), and 
(F ° G)(3). 
b. Is G° F = F ° G? Explain. 


Tl. Define F: R — R and G: R— R by the rules 
F(n) = 3x and G(n) = [x/3] for every real number x. 
a. Find (G ° F)(6), (F ° G)(6), (G ° F)(1), and 
(F ° G)(1). 
b. Is Go F = F ° G? Explain. 


The functions of each pair in 12-14 are inverse to each 
other. For each pair, check that both compositions give 
the identity function. 


12. F: R — R and F 'R —R are defined by 


=f y=2 
F@)=3x+2 and FQ) =" 


for every y E R. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


472 CHAPTER7 PROPERTIES OF FUNCTIONS 


13. G: R* — R* andG ': R” — R* are defined by H 22. Iff: X — Y and g: Y — Z are functions and 
o i 9 : 
G@=x2 and GQ) = Vx g ° f is onto, must g be onto? Prove or give a 
counterexample. 


+ 
sa E Re 23. Let f: W— X, g: X — Y, and h: Y — Z be func- 


14. H and H™! are both defined from R — {1} to tions. Must h ° (g ° f) = (h ° g) ° f? Prove or give 
R-— {1} by the formula a counterexample. 


Ca AL f h R- 24. True or False? Given any set X and given any 
Hu) =H Q= r or each x E R~ {1}. functions f: X > X, g: X > X, and h: X > X, if h 
is one-to-one and h ° f = h ° g, then f= g. Justify 


15. Explain how it follows from the definition of 
your answer. 


logarithm that 
a. log,(b*) = x, for every real number x. 25. True or False? Given any set X and given any 

b. b!°%* = x, for every positive real number x. functions f: X > X, g : X — X, and h: X — X, if h 
is one-to-one and f° h = g oh, then f= g. Justify 


H 16. Prove Theorem 7.3.1(b): If fis any function from 
your answer. 


a set X to a set Y, then Iy ° f = f, where Iy is the 
identity function on Y. In 26 and 27 find (g ° f) ',g', f ', and f" ° g’, and state 


h of) 'andf og” lated. 
17. Prove Theorem 7.3.2(b): If f: X — Y is a one-to- es a i 


one and onto function with inverse function 26. Let X = {a,b,c}, Y = {x, y, z}, and Z = {u, v, w}. 
f~: YX, then fof! = Iy, where Iy is the Define f: X — Y and g : Y — Z by the arrow diagrams 
identity function on Y. below. 

18. Suppose Y and Z are sets and g : Y — Z is a one- X Jy Y a a 
to-one function. This means that if g takes the 
same value on any two elements of Y, then those \ =a i, A 
elements are equal. Thus, for example, if a and b o > ae 


are elements of Y and g(a) = g(b), then it can be 
inferred that a = b. What can be inferred in the 


followine situations? 27. Define f: R — R and g: R —> R by the formulas 
ollowing situations? 


a. s,and s,, are elements of Y and g(s;,) = 8(Sm)- f@®) =x+3 and g(x)= -~x foreachxER. 
b. z/2 and t/2 are elements of Y and 
g(z/2) = g(t/2). 28. Prove or give a counterexample: If f: X — Y and 
c. f(x) and f(x) are elements of Y and g: Y — X are functions such that g ° f = Iy and 
efa) = g(f(%)). f° g = Iy, then fand g are both one-to-one and 


. onto and g =f. 
19. Iff: X— Y and g : Y — Z are functions and g ° f 


is one-to-one, must g be one-to-one? Prove or give H 29. Suppose f: X — Y and g : Y — Z are both one-to- 
a counterexample. one and onto. Prove that (g ° f) | exists and that 


of)! — f-lo ow! 
20. Iff: X— Y and g: Y— Z are functions and g ° f is (os) f = 
onto, must f be onto? Prove or give a counterexample. 30. Let f: X > Y and g: Y— Z. Is the following 
property true or false? For every subset C in Z, 


a RecA Beaman a d (g°f) (©) =f \g '(O). Justify your answer. 


is one-to-one, must f be one-to-one? Prove or give 
a counterexample. 


ANSWERS FOR TEST YOURSELF 


1. X; Z; 8f) 2.ff 3.Ix;Iy 4.x, and x are any 5. z is any [particular but arbitrarily chosen] element 
[particular but arbitrarily chosen] elements in X with the in Z; there exists at least one element x in X such that 
property that (g ° f)Q1) = (8 ° f)Q2); x1 = X2 (g°f\@) =z 
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ÆJ Cardinality with Applications to Computability 


There are as many squares as there are numbers because they are just as numerous 
as their roots. —Galileo Galilei, 1632 


The term cardinal number refers to the size of a set (“This set has eight elements”), where- 
as the term ordinal number refers to the order of an element in a sequence (“This is the 
eighth element in the row”). The mathematical definition of cardinal number is based on 
the use of fingers or tally marks to represent numbers. For instance, small children often 
indicate their age by holding up the same number of fingers as the years of their life, and 
adults frequently use one tally mark to correspond to each vote received by a candidate in 
an election. As was discussed in Section 7.2, a pairing of the elements of two sets is called 
a one-to-one correspondence. We say that two finite sets whose elements can be paired 
Galileo Galilei by a one-to-one correspondence have the same size. This is illustrated by the following 
(1564-1642) diagram. 


wynnter/Getty Images 


A B 


A 


T. The elements of set A can 

o be put into one-to-one 
correspondence with the 

oo elements of B. 


Now a finite set is one that has no elements at all or that can be put into one-to-one cor- 
respondence with a set of the form {1, 2,..., n} for some positive integer n. By contrast, 
an infinite set is a nonempty set that cannot be put into one-to-one correspondence with 
{1, 2,..., n} for any positive integer n. Suppose that, as suggested by the quote from Gali- 
leo at the beginning of this section, we extend the concept of size to infinite sets by saying 
that one infinite set has the same size as another if, and only if, the first set can be put into 
one-to-one correspondence with the second. What consequences follow from such a defi- 
nition? Do all infinite sets have the same size, or are some infinite sets larger than others? 
These are the questions we address in this section. The answers are sometimes surprising 
and have the interesting consequence that there are functions defined on the set of integers 
whose values cannot be computed on a computer. 


Definition 


Let A and B be any sets. A has the same cardinality as B if, and only if, there is a 
one-to-one correspondence from A to B. In other words, A has the same cardinality 
as B if, and only if, there is a function f from A to B that is one-to-one and onto. 


The following theorem gives some basic properties of cardinality, most of which follow 
from statements proved earlier about one-to-one and onto functions. 


Theorem 7.4.1 Properties of Cardinality 
For all sets A, B, and C: 


a. Reflexive property of cardinality: A has the same cardinality as A. 


b. Symmetric property of cardinality: If A has the same cardinality as B, then B 
has the same cardinality as A. 


(continued on page 474) 
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c. Transitive property of cardinality: If A has the same cardinality as B and B has 
the same cardinality as C, then A has the same cardinality as C. 


Proof: 

Part (a), Reflexivity: Suppose A is any set. [To show that A has the same cardinality 
as A, we must show there is a one-to-one correspondence from A to A.] Consider the 
identity function J, from A to A. This function is one-to-one because if xı and x, are 
any elements in A with J,(x,) = (%2), then, by definition of I4, x; = x2. The identity 
function is also onto because if y is any element of A, then y = J,(y) by definition of 
I,. Hence J, is a one-to-one correspondence from A to A. [So there exists a one-to-one 
correspondence from A to A, as was to be shown.] 


Part (b), Symmetry: Suppose A and B are any sets and A has the same cardinality as 
B. [We must show that B has the same cardinality as A.] Since A has the same cardi- 
nality as B, there is a function f from A to B that is one-to-one and onto. But then, by 
Theorems 7.2.2 and 7.2.3, there is a function f ~! from B to A that is also one-to-one 
and onto. Hence B has the same cardinality as A [as was to be shown]. 


Part (c), Transitivity: Suppose A, B, and C are any sets and A has the same cardi- 
nality as B and B has the same cardinality as C. [We must show that A has the same 
cardinality as C.] Since A has the same cardinality as B, there is a function f from A 
to B that is one-to-one and onto, and since B has the same cardinality as C, there is 
a function g from B to C that is one-to-one and onto. But then, by Theorems 7.3.3 
and 7.3.4, g ° f is a function from A to C that is one-to-one and onto. Hence A has the 
same cardinality as C [as was to be shown]. 


Note that Theorem 7.4.1(b) makes it possible to say simply that two sets have the same 
cardinality instead of always having to say that one set has the same cardinality as another. 
That is, the following definition can be made. 


Definition 


A and B have the same cardinality if, and only if, A has the same cardinality as B 
or B has the same cardinality as A. 


The following example illustrates a very important property of infinite sets—namely, 
that an infinite set can have the same cardinality as a proper subset of itself. This property 
is sometimes taken as the definition of infinite set. The example shows that even though it 
may seem reasonable to say that there are twice as many integers as there are even integers, 
the elements of the two sets can be matched up exactly, and so, according to the definition, 
the two sets have the same cardinality. 


An Infinite Set and a Proper Subset Can Have the Same Cardinality 
Let 2Z be the set of all even integers. Prove that 2Z and Z have the same cardinality. 
Solution Consider the function H from Z to 2Z defined as follows: 


H(n) = 2n_ for eachn EZ. 
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A (partial) arrow diagram for H is shown below. 


To show that H is one-to-one, suppose H(n,) = H(nz) for some integers n; and nz. 
Then 2n; = 2m by definition of H, and dividing both sides by 2 gives nı = m. Hence h is 
one-to-one. 
To show that H is onto, suppose m is any element of 2Z. Then m is an even integer, and 
so m = 2k for some integer k. It follows that H(k) = 2k = m. Thus there exists k in Z with 
H(k) = m, and hence H is onto. 
Note So there are “as Therefore, by definition of cardinality, Z and 2Z have the same cardinality. a 
many” even integers as 


there are integers! . : . _ 
In Section 9.4 we will show that a function from one finite set to another set of the 


same size is one-to-one if, and only if, it is onto. This result does not hold for infinite 
sets. Although it is true that for two infinite sets to have the same cardinality there must 
exist a function from one to the other that is both one-to-one and onto, it is also always 
the case that: 


If A and B are infinite sets with the same cardinality, then there exist 
functions from A to B that are one-to-one but not onto and functions 
from A to B that are onto but not one-to-one. 


For instance, since the function H in Example 7.4.1 is one-to-one and onto, Z and 2Z have 
the same cardinality. But the “inclusion function” I from 2Z to Z, given by (n) = n for all 
even integers n, is one-to-one but not onto. And the function J from Z to 2Z defined by 
J(n) = 2|n/2], for each integer n, is onto but not one-to-one. (See exercise 6 at the end of 
this section.) 


Countable Sets 


The most basic of all infinite sets is Z* the set of counting numbers {1, 2, 3, 4,...}. A set A 
having the same cardinality as this set is called countably infinite. The reason is that the 
one-to-one correspondence between the two sets can be used to “count” the elements of 
A: If F is a one-to-one and onto function from Z to A, then F(1) can be designated as the 
first element of A, F(2) as the second element of A, F(3) as the third element of A, and so 
forth. This is illustrated graphically in Figure 7.4.1 on the next page. Because F is one-to- 
one, no element is ever counted twice, and because it is onto, every element of A is counted 
eventually. 
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e “First” element of A 
e “Second” element of A 


e “Third” element of A 


FIGURE 7.4.1 “Counting” a Countably Infinite Set 


Definition 


A set is finite if, and only if, it is the empty set or can be put into one-to-one cor- 
respondence with a set of the form {1, 2,..., n} for some positive integer n. A set is 


countably infinite if, and only if, it has the same cardinality as the set of positive 
integers Z*. A set is countable if, and only if, it is finite or countably infinite. A set 
that is not countable is called uncountable. 


ZEWG AYA 4 Countability of Z, the Set of All Integers 
Show that Z, the set of all integers, is countable. 


Solution The set Z is certainly not finite, so if it is countable, it must be because it is 
countably infinite. To show that Z is countably infinite, find a function from the positive 
integers Z* to Z that is one-to-one and onto. This seems to contradict common sense 
because judging from the diagram below, there appear to be more than twice as many 
integers than there are positive integers. 


positive integers 
oo , 
5 4 3 2 1 0 1 2 3 4 5. 


EEE- 
all integers 


But you were alerted that results in this section might be surprising. Try to think how 
you might “count” the set of all integers. 

One way is to start in the middle and work outward systematically. Let the first integer be 
0, the second 1, the third —1, the fourth 2, the fifth —2, and so forth, as shown in Figure 7.4.2, 
starting at 0 and swinging outward in back-and-forth arcs from positive to negative integers 
and back again, picking up one additional integer at each swing. 


Integers: --- _5 4 3 2 1 0 
The “count” of each integer: 1 9 7 5 3 1 2 4 6 8 10 


d 
w 
iN 
n 


FIGURE7.4.2 “Counting” the Set of All Integers 


It is clear from the diagram that no integer is counted twice (so the function is one-to- 
one) and every integer is counted eventually (so the function is onto). Consequently, this 
diagram defines a function from Z” to Z that is one-to-one and onto. Even though in one 
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sense there seem to be more integers than positive integers, the elements of the two sets 
can be paired up one for one. It follows by definition of cardinality that Z* has the same 
cardinality as Z. Thus Z is countably infinite and hence countable. 

The diagrammatic description of the previous function is acceptable as given. You can 
check, however, that the function can also be described by the explicit formula 


if is an even positive integer 


if n is an odd positive integer. 


SEUMAS] Countability of 2Z, the Set of All Even Integers 
Show that 2Z, the set of all even integers, is countable. 


Solution Example 7.4.2 showed that Z” has the same cardinality as Z, and Example 7.4.1 
showed that Z has the same cardinality as 2Z. Thus, by the transitive property of cardinality, 
Z* has the same cardinality as 2Z. It follows by definition of countably infinite that 2Z is 
countably infinite and hence countable. E 


The Search for Larger Infinities: 
The Cantor Diagonalization Process 


Every infinite set we have discussed so far has been countably infinite. Do any larger in- 
finities exist? Are there uncountable sets? Here is one candidate. 
Imagine the number line as shown below. 


4 3 2 1 0 1 2 3 4- 


As noted in Section 1.2, the integers are spread along the number line at discrete intervals. 
On the other hand, the rational numbers are dense along the number line. Between any 
two distinct rational numbers is another rational number, which implies that there are in- 
finitely many rational numbers between any two distinct rational numbers no matter how 
close they are to each other. (See exercise 17 at the end of this section or exercise 20 in 
Section 4.3.) So, because there seem to be vastly more rational numbers than integers, it 
would be natural to conjecture that the infinity of the set of rational numbers is larger than 
the infinity of the set of integers. 

Amazingly, this conjecture is false. Despite the fact that the rational numbers are 
crowded onto the number line whereas the integers are quite separated, the set of all 
rational numbers can be put into one-to-one correspondence with the set of integers. The 
next example gives part of a proof of this fact. It shows that the set of all positive rational 
numbers can be put into one-to-one correspondence with the set of all positive integers. 
In exercise 16 at the end of this section you are asked to use this result, together with a 
technique similar to that of Example 7.4.2, to show that the set of all rational numbers 
is countable. 


The Set of All Positive Rational Numbers Is Countable 
Show that QF, the set of all positive rational numbers, is countable. 


Solution Display the elements of the set Q* in a grid as shown in Figure 7.4.3 on the 
next page. 
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1 1 1 1 1 1 
>, > >, os 

1 72 F 5 76 
2 2 2/ 2 
1 3 3 6 
J 

3 3/ 3 3 
1 4 5 6 
4 4 4 4 
1 4 5 6 
l 

5% sf 3 BD 5 3 
1 /2 3 4 5 6 
6 6 6 6 6 6 
1 2 3 4 5 6 
{ 

FIGURE 7.4.3 


Define a function F from Z* to Q* by starting to count at 1 and following the arrows as 
indicated, skipping over any number that has already been counted. 

To be specific: Set F(1) = 1, F(2) = 4, FG) = 7, and F(4) =}. Then skip 5 since 
Z = L which was counted already. After that, set F(5) = i, F(6) = L F(T) = Z, F(8) = 3 


FQ) = t and F(10) = > Then skip pa 3 and f (since 5 = z ; = L and $ = 5), and set 


3 
1 


F(11) = Continue in this way, defining F(n) for each positive integer n. 

Note that every positive rational number appears somewhere in the grid, and the counting 
procedure is set up so that every point in the grid is reached eventually. Thus the function 
F is onto. Also, skipping numbers that have already been counted ensures that no number 
is counted twice. Thus F is one-to-one. Consequently, F is a function from Z* to Q* that is 


one-to-one and onto, and so Q* is countably infinite and hence countable. E 
al-Kashi In 1874 the German mathematician Georg Cantor achieved success in the search for a 
(1380-1429) larger infinity by showing that the set of all real numbers is uncountable. His method of 


proof was somewhat complicated, however. We give a proof of the uncountability of the 
set of all real numbers between 0 and 1 using a simpler technique introduced by Cantor in 
1891 and now called the Cantor diagonalization process. Over the intervening years, this 
technique and variations on it have been used to establish a number of important results in 
logic and the theory of computation. 

Before stating and proving Cantor’s theorem, we note that every real number is a measure 
of location on a number line. Each can be represented by a decimal expansion of the form 


4o-41đ243..-, 


where ag is an integer (positive, negative, or zero) and for each į = 1, a; is an integer from 
0 through 9. This way of thinking about numbers was developed over several centuries 
by mathematicians in the Chinese, Hindu, and Islamic worlds, culminating in the work 
of Ghiyath al-Din Jamshid al-Kashi in 1427. In Europe it was first clearly formulated and 
successfully promoted by the Flemish mathematician Simon Stevin in 1585. We illustrate 


Bettmann/Getty Images 


Simon Stevin à 
(1548-1620) the concept with an example. 
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Consider the point P in Figure 7.4.4. Figure 7.4.4(a) shows P located between | and 2. 
When the interval from 1 to 2 is divided into ten equal subintervals (see Figure 7.4.4(b)), 
P is seen to lie between 1.6 and 1.7. If the interval from 1.6 to 1.7 is itself divided into ten 
equal subintervals (see Figure 7.4.4(c)), the P is seen to lie between 1.62 and 1.63 but closer 
to 1.62 than to 1.63. So the first three digits of the decimal expansion for P are 1.62. 


(a) 
3 
(b) t+—+—+ 
2.0 
P 
(c) -——_+-e+—_+—_+—_+—+—_ ++ 
1.60 1.62 1.63 1.65 1.70 
FIGURE 7.4.4 


Assuming that any interval of real numbers, no matter how small, can be divided into 
ten equal subintervals, the process of obtaining additional digits in the decimal expansion 
for P can, in theory, be repeated indefinitely. At any stage if P is seen to be a subdivision 
point, then all further digits in the expansion may be taken to be 0. If not, then the process 
gives an expansion with an infinite number of digits. 

The resulting decimal representation for P is unique except for numbers that end in 
infinitely repeating 9’s or infinitely repeating 0’s. For example (see exercise 25 at the end 
of this section), it can be proved that 


0.199999... = 0.200000... 


Let us agree to express any such decimal in the form that ends in all 0’s so that we will have 
a unique representation for every real number. 


Theorem 7.4.2 (Cantor) 


The set of all real numbers between 0 and 1 is uncountable. 


Proof (by contradiction): Suppose the set of all real numbers between 0 and 1 is 
countable. Then the decimal representations of these numbers can be written in a 
list as follows: 


0.411412413 °°" ain't 
0.a21422023 °** Ant: 
0.a31432433 ` ` © 3, °°" 


0.4n14n24n3 "Ann" 


[We will derive a contradiction by showing that there is a number between 0 and 1 that 
does not appear on this list.] 


(continued on page 480) 
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For each pair of positive integers i and j, the jth decimal digit of the ith number 
on the list is a. In particular, the first decimal digit of the first number on the list is 
ay, the second decimal digit of the second number on the list is a22, and so forth. As 
an example, suppose the list of real numbers between 0 and | starts out as follows: 


0.@014 8 8 02 
O..1.4 66 6 0 24 
0-0 365 332-0 
o 96706809 
o 00030002 


The diagonal elements are circled: a1, is 2, aņ2 is 1, a33 18 3, a44 is 7, ass is 1, and so forth. 
Construct a new decimal number d = 0.dıdzd3' '* dp’ +> as follows: 


1 ifay, #1 
d = nn 
" F ifa,, = 1. 
In the previous example, 


dı is | because a,, = 2 # 1, 
d is 2 because an = 1, 

d3 is 1 because a33 = 3 # 1, 
d4is 1 because a44 = 7 # 1, 


ds is 2 because ass = 1, 


and so forth. Hence d would equal 0.12112... 

The crucial observation is that for each integer n, d differs in the nth decimal 
position from the nth number on the list. But this implies that d is not on the list! 
In other words, d is a real number between 0 and 1 that is not on the list of all real 
numbers between 0 and 1. This contradiction shows the falseness of the supposition 
that the set of all numbers between 0 and | is countable. Hence the set of all real 
numbers between 0 and 1 is uncountable /as was to be shown]. 


Along with demonstrating the existence of an uncountable set, Cantor developed a 
whole arithmetic theory of infinite sets of various sizes. One of the most basic theorems of 
the theory states that any subset of a countable set is countable. 


Theorem 7.4.3 


Any subset of any countable set is countable. 


Proof: Let A be a particular but arbitrarily chosen countable set and let B be any subset 
of A. [We must show that B is countable.] Either B is finite or it is infinite. If B is finite, 
then B is countable by definition of countable, and we are done. So suppose B is infi- 
nite. Since A is countable, the distinct elements of A can be represented as a sequence 


Qi, A2, A3z,.... 
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Note If g(k—1) = a; 
then g(k) could also be 
defined by applying the 
well-ordering principle 
for the integers to the 
set {n E Z|n > i and 
a; E B}. 
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Define a function g: Z" — B inductively as follows: 


1. Search sequentially through elements of aj, a>, a3, ... until an element of B 
is found. [This must happen eventually since B C A and B + Ø.] Call that ele- 
ment g(1). 


2. For each integer k = 2, suppose g(k— 1) has been defined. Then g(k— 1) = a; 
for some a; in {dy, a2, a3,...}. Starting with a;+ 1, search sequentially through 
a;+1,a;+2,a;+3,... trying to find an element of B. One must be found even- 
tually because B is infinite, and {g(1), g(2),..., g(k — 1)} is a finite set. When 
an element of B is found, define it to be g(k). 


By (1) and (2) above, the function g is defined for each positive integer. 

Since the elements of a), a2, a3,... are all distinct, g is one-to-one. Further- 
more, the searches for elements of B are sequential: Each picks up where the 
previous one left off. Thus every element of A is reached during some search. 
Moreover, all the elements of B are located somewhere in the sequence a}, az, 
a3,..., and so every element of B is eventually found and made the image of 
some integer. Hence g is onto. These remarks show that g is a one-to-one cor- 
respondence from Z* to B. So B is countably infinite and thus countable /as was 
to be shown]. 


An immediate consequence of Theorem 7.4.3 is the following corollary. 


Corollary 7.4.4 


Any set with an uncountable subset is uncountable. 


Proof: Consider the following equivalent phrasing of Theorem 7.4.3: For every set S 
and for every subset A of S, if S is countable, then A is countable. The contrapositive 
of this statement is logically equivalent to it and states: For every set S and for every 
subset A of S, if A is uncountable then S is uncountable. Since this is an equivalent 
phrasing for the corollary, the corollary is proved. 


Corollary 7.4.4 implies that the set of all real numbers is uncountable because the subset 
of numbers between 0 and | is uncountable. In fact, as Example 7.4.5 shows, the set of all 
real numbers has the same cardinality as the set of all real numbers between 0 and 1! This 
fact is further explored in exercises 13 and 14 at the end of this section. 


The Cardinality of the Set of All Real Numbers 


Show that the set of all real numbers has the same cardinality as the set of real numbers 
between 0 and 1. 


Solution Let S be the open interval of real numbers between 0 and 1: 


S = {xER|O<x< 1}. 
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Imagine picking up S and bending it into a circle as shown below. Since S does not include 
either endpoint 0 or 1, the top-most point of the circle is omitted from the drawing. 


Define a function F: S$ — R as follows: 
Draw a number line and place the interval, S, somewhat enlarged and bent into a circle, 
tangent to the number line at the point 0. This is shown below. 


Number line F(x) 
< } } } } } } > 
=3 2 =l 0 1 2. 3 


For each point x on the circle representing S, draw a straight line L through the top-most 
point of the circle and x, and let F(x) be the point of intersection of L and the number line. 
(F(x) is called the projection of x onto the number line.) 

It is clear from the geometry of the situation that distinct points on the circle go to 
distinct points on the number line, so F is one-to-one. In addition, given any point y 
on the number line, a line can be drawn through y and the top-most point of the circle. 
This line must intersect the circle at some point x, and, by definition, y = F(x). Thus F 
is onto. Hence F is a one-to-one correspondence from S to R, and so S and R have the 
same cardinality. E 


The combination of Example 7.4.5 and Theorem 7.4.2 shows that the set of all real 
numbers is uncountable, which implies that there is an infinite set whose cardinality is 
“greater” than the infinity of the set of positive integers. In exercise 35, you are asked to 
prove that any set and its power set have different cardinalities. And because there is a one- 
to-one function from any set to its power set (the function that takes each element a to the 
singleton set {a}), this implies that the cardinality of any set is “less than” the cardinality of 
its power set. As a result, you can create an infinite sequence of larger and larger infinities! 
For example, you could begin with Z, the set of all integers, and take Z, P(Z), P(P(Z)), 
P(P(P(Z))), and so forth. 


Application: Cardinality and Computability 

Knowledge of the countability and uncountability of certain sets can be used to answer a 

question of computability. We begin by showing that a certain set is countable. 
ZEWG Countability of the Set of Computer Programs in a Computer Language 

Show that the set of all computer programs in a given computer language is countable. 


Solution This result is a consequence of the fact that any computer program in any lan- 
guage can be regarded as a finite string of symbols in the (finite) alphabet of the language. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


7.4 CARDINALITY WITH APPLICATIONS TO COMPUTABILITY 483 


Given any computer language, let P be the set of all computer programs in the language. 
Either P is finite or P is infinite. If P is finite, then P is countable and we are done. If P is 
infinite, set up a binary code to translate the symbols of the alphabet of the language into 
strings of 0’s and 1’s. (For instance, either the seven-bit American Standard Code for In- 
formation Interchange, known as ASCII, or the eight-bit Extended Binary-Coded Decimal 
Interchange Code, known as EBCDIC, might be used.) 

For each program in P, use the code to translate all the symbols in the program into 0’s and 
l’s. Order these strings by length, putting shorter before longer, and order all strings of a given 
length by regarding each string as a binary number and writing the numbers in ascending order. 

Define a function F: Z* — P by specifying that 


F(n) = the nth program in the list for each n € Z*. 


By construction, F is one-to-one and onto, and so P is countably infinite and hence 
countable. 

As a simple example, suppose the following are all the programs in P that translate into 
bit strings of length less than or equal to 5: 


10111, 11, 0010, 1011, 01, 00100, 1010, 00010. 
Ordering these by length gives 
length 2: 11,01 


length 4: 0010, 1011, 1010 
length 5: 10111, 00100, 00010 


And ordering those of each given length by the size of the binary number they represent gives 


Ol =F) 
11 =F) 
0010 = F(3) 
1010 = F(4) 
1011 = F(5) 
00010 = F(6) 
00100 = F(7) 
10111 = F(8) 


Note that when viewed purely as numbers and ignoring leading zeros, 0010 = 00010. 
This illustrates why it is important to order the strings by length before arranging them 
in ascending numeric order because otherwise the values of F would not be uniquely 
determined. | 


The final example of this section shows that a certain set is uncountable and hence that 
there must exist a noncomputable function. 


The Cardinality of a Set of Functions and Computability 


a. Let T be the set of all functions from the positive integers to the set {0, 1, 2, 3, 4, 5, 6, 
7, 8, 9}. Show that T is uncountable. 


b. Derive the consequence that there are noncomputable functions. Specifically, show 
that for any computer language there must be a function F from Z” to {0, 1, 2, 3, 4, 5, 
6, 7, 8, 9} with the property that no computer program can be written in the language 
to take arbitrary values as input and output the corresponding function values. 
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Solution 


a. Let S be the set of all real numbers between 0 and 1. As noted before, any number in S 
can be represented in the form 


0.14203... ap.. -; 
where each q; is an integer from 0 to 9. This representation is unique if decimals that 


end in all 9’s are omitted. 
Define a function F from S to a subset of T as follows: 


F(0.a\a7a3...d,...) = the function that sends each 
positive integer n to a,. 


Note As an example, 

let G = F(0.2901...). 
Then G(1) = 2, G(2) = 9, 
G(3) = 0, G(4) = 1, and 
so forth. 


Choose the co-domain of F to be exactly that subset of T that makes F onto, recalling 
that T is the set of all functions from Z* to {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}. In other words, 
define the co-domain of F to equal the image of F. Now F is one-to-one because in 
order for the functions F'(x,) and F (x2) to be equal, they must have the same value for 
each positive integer, and so each decimal digit of x; must equal the corresponding 
decimal digit of x2, which implies that x; = x2. Thus F is a one-to-one correspon- 
dence from S to a subset of 7: But S is uncountable by Theorem 7.4.2. Hence T has an 
uncountable subset, and so, by Corollary 7.4.4, T is uncountable. 


b. Part (a) shows that the set T of all functions from Z” to {0, 1, 2, 3, 4, 5, 6, 7, 8, 9} 
is uncountable. But, by Example 7.4.6, given any computer language, the set of all 
programs in that language is countable. Consequently, in any computer language there 
are not enough programs to compute values of every function in T. There must exist 
functions that are not computable! E 


TEST YOURSELF 


1. A set is finite if, and only if, 


7. A set is called countable if, and only if, 


2. To prove that a set A has the same cardinality as a 
set B you must 


3. The reflexive property of cardinality says that @) The set of all integers is 
given any set A, (b) The set of all rational numbers is 


4. The symmetric property of cardinality says that (©) The set of all real numbers between 0 and 1 
given any sets A and B, Is 


. In each of the following, fill in the blank with the 
word countable or the word uncountable. 


(d) The set of all real numbers is 


5. The transitive property of cardinality says that 
given any sets A, B, and C, : 9. The Cantor diagonalization process is used to 
prove that 


6. A set is called countably infinite if, and only 
if, i 


EXERCISE SET 7.4 


1. When asked what it means to say that set A has 
the same cardinality as set B, a student replies, “A 
and B are one-to-one and onto.” What should the 
student have replied? Why? 
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. Show that “there are as many squares as there are 


numbers” by exhibiting a one-to-one correspon- 
dence from the positive integers, Z*, to the set S 
of all squares of positive integers: 


S={nEZ |n= K, for some positive integer k}. 


H 6. 


7.4 CARDINALITY WITH APPLICATIONS TO COMPUTABILITY 


. Let 3Z = {n E Z|n = 3k, for some integer k}. 


Prove that Z and 3Z have the same cardinality. 


Let O be the set of all odd integers. Prove that O 
has the same cardinality as 2Z, the set of all even 
integers. 


Let 25Z be the set of all integers that are multiples 
of 25. Prove that 25Z has the same cardinality as 
2Z, the set of all even integers. 


Use the functions J and J defined in the paragraph 
following Example 7.4.1 to show that even though 
there is a one-to-one correspondence, H, from 2Z 
to Z, there is also a function from 2Z to Z that is 
one-to-one but not onto and a function from Z to 
2Z that is onto but not one-to-one. In other words, 
show that / is one-to-one but not onto, and show 
that J is onto but not one-to-one. 


a. Check that the formula for F given at the end 
of Example 7.4.2 produces the correct values 
for n = 1, 2, 3, and 4. 

b. Use the floor function to write a formula for F 
as a single algebraic expression for each posi- 
tive integer n. 


Use the result of exercise 3 to prove that 3Z is 
countable. 


Show that the set of all nonnegative integers is 
countable by exhibiting a one-to-one correspon- 
dence between Z* and Z"0""®, 


In 10-14 S denotes the set of real numbers strictly 
between O and 1. That is, S = {x E R|O<x < 1}. 


10. 


H 11. 


12. 


13. 


* 14. 


Let U = {x E R|0 < x < 2}. Prove that S$ and U 
have the same cardinality. 


Let V = {x E R|2 < x < 5}. Prove that S and V 
have the same cardinality. 


Let a and b be real numbers with a < b, and sup- 
pose that W = {x E R|a < x < b}. Prove that S 
and W have the same cardinality. 


Draw the graph of the function f defined by the 
following formula: 
For each real number x with 0 < x < 1, 


fœ) = n(m = z), 


Use the graph to explain why S and R have the 
same cardinality. 


Define a function g from the set of real numbers 
to S by the following formula: 


15. 


16. 


H 17. 


H*19. 


20. 


21. 


H 22. 


23. 


485 


For each real number x, 


=l x yl 
ex a E 2 


Prove that g is a one-to-one correspondence. (It is 
possible to prove this statement either with calcu- 
lus or without it.) What conclusion can you draw 
from this fact? 


Show that the set of all bit strings (strings of 0’s 
and 1’s) is countable. 


Show that Q, the set of all rational numbers, is 
countable. 


Show that Q, the set of all rational numbers, is dense 
along the number line by showing that given any 
two rational numbers r; and r, with r; < r2, there 
exists a rational number x such that r; < x < r2. 


. Must the average of two irrational numbers always 


be irrational? Prove or give a counterexample. 


Show that the set of all irrational numbers is dense 
along the number line by showing that given any 
two real numbers, there is an irrational number in 
between. 


Give two examples of functions from Z to Z that 
are one-to-one but not onto. 


Give two examples of functions from Z to Z that 
are onto but not one-to-one. 


Define a function g: Z% X Z* — Z* by the 
formula g(m, n) = 2'"3" for all (m, n) E Z? X Z*. 
Show that g is one-to-one and use this result to 
prove that Z* X Z” is countable. 


a. Explain how to use the following diagram to 
show that Z Onneg: x ONNES, and OOTES. have 


the same cardinality. 


H* b. Define a function H: Z"0""°8 X Z" > 


ze""’s by the formula 


Hon, n) = n+ SANS 
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* 24. 


H 25. 
26. 


27. 


28. 


H 29. 


H 30. 


H 31. 


H 32. 


33. 


ANSWERS FOR TEST YOURSELF 


CHAPTER7 PROPERTIES OF FUNCTIONS 


for all nonnegative integers m and n. Interpret the ac- 
tion of H geometrically using the diagram of part (a). 


Prove that the function H defined analytically in 
exercise 23b is a one-to-one correspondence. 


Prove that 0.1999... = 0.2. 


Prove that any infinite set contains a countably 
infinite subset. 


Prove that if A is any countably infinite set, B is 
any set, and g: A — B is onto, then B is countable. 


Prove that a disjoint union of any finite set and 
any countably infinite set is countably infinite. 


Prove that a union of any two countably infinite 
sets is countably infinite. 


Use the result of exercise 29 to prove that the set 
of all irrational numbers is uncountable. 


Use the results of exercises 28 and 29 to prove that 
a union of any two countable sets is countable. 


Prove that Z X Z, the Cartesian product of the set 
of integers with itself, is countably infinite. 


Use the results of exercises 27, 31, and 32 to prove 
the following: If R is the set of all solutions to all 
equations of the form x’ +bx+c = 0, where b 
and c are integers, then R is countable. 


H 34. Let P(S) be the set of all subsets of set S, and let 
T be the set of all functions from S to {0, 1}. Show 
that P(S) and T have the same cardinality. 


Let S be a set and let P(S) be the set of all subsets 
of S. Show that S is “smaller than” P(S) in the 

sense that there is a one-to-one function from S to 
P(S) but there is no onto function from S to P(S). 


H 35. 


The Schroeder—Bernstein theorem states the fol- 
lowing: If A and B are any sets with the property 
that there is a one-to-one function from A to B 
and a one-to-one function from B to A, then A and 
B have the same cardinality. Use this theorem to 
prove that there are as many functions from Z* 

to {0, 1, 2, 3, 4, 5, 6, 7, 8, 9} as there are functions 
from Z* to {0, 1}. 


* 36. 


H 37. Prove that if A and B are any countably infinite 


sets, then A X B is countably infinite. 


* 38. Suppose A, A>, A3,...is an infinite sequence of 


countable sets. Recall that 


JA; = {x | x E A; for some positive integer i}. 
i=l 
Prove that ie A; is countable. (In other words, 
prove that a countably infinite union of countable 
sets is countable.) 


1. it is the empty set or there is a one-to-one correspondence 


from {1, 2,..., n} to it, where n is a positive integer 


2. show that there exists a function from A to B that is one- 


to-one and onto (Or: show that there exists a one-to-one 


correspondence from A to B) 


as A 


3. A has the same cardinality 
4. if A has the same cardinality as B, then B has the 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


same cardinality as A 5. if A has the same cardinality as 
B and B has the same cardinality as C, then A has the same 
cardinality as C 6. it has the same cardinality as the set of 
all positive integers 7. it is finite or countably infinite 

8. countable; countable; uncountable; uncountable 9. the 


set of all real numbers between 0 and | is uncountable 


PROPERTIES OF RELATIONS 


In this chapter we discuss the mathematics of relations defined on sets, focusing on ways to 
represent relations and exploring various properties they may have. The concept of equiva- 
lence relation is introduced in Section 8.3 and applied in Section 8.4 to modular arithmetic 
and cryptography. Partial order relations are discussed in Section 8.5, and an application 
is given showing how to use these relations to help coordinate and guide the flow of indi- 
vidual tasks that must be performed to accomplish a complex, large-scale project. 


EA] Relations on Sets 


Strange as it may sound, the power of mathematics rests on its evasion of all unnecessary 
thought and on its wonderful saving of mental operations. —Ernst Mach, 1838-1916 


A more formal way to refer to the kind of relation defined in Section 1.3 is to call it a 
binary relation because it is a subset of a Cartesian product of two sets. At the end of this 
section we define an n-ary relation to be a subset of a Cartesian product of n sets, where 
n is any integer greater than or equal to two. Such a relation is the fundamental structure 
used in relational databases. However, because we focus on binary relations in this text, 
when we use the term relation by itself, we will mean binary relation. 


The Less-than Relation for Real Numbers 
Define a relation L from R to R as follows: For all real numbers x and y, 
xLy&x<y. 
a. Is 57 L 53? b. Is (—17) L (-14)? c. Is 143 L 143? d. Is (—35) L 1? 
e. Draw the graph of L as a subset of the Cartesian plane R X R. 


Solution 


a. No, 57 > 53. b. Yes, —17 < —14. c. No, 143 = 143. d. Yes, —35 < 1. 


e. For each value of x, all the points (x, y) with y > x are on the graph. So the graph 
consists of all the points above the line x = y. 
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ZEWJGEAPA The Congruence Modulo 2 Relation 
Define a relation E from Z to Z as follows: For every (m, n) E Z X Z, 
mEn <= m-~—niseven. 


a. Is4 E0? Is 2 E6? Is 3 E(—3)? 1s 5 E 2? 
b. List five integers that are related by E to 1. 


c. Prove that if n is any odd integer, then n E 1. 
Solution 


a. Yes, 4 E 0 because 4 — 0 = 4 and 4 is even. 
Yes, 2 E 6 because 2 — 6 = —4 and —4 is even. 
Yes, 3 E (—3) because 3 — (—3) = 6 and 6 is even. 
No, SE 2 because 5 — 2 = 3 and 3 is not even. 


b. There are many such lists. One is 


1 because | — 1 = O is even. 
because 3 — | = 2 is even. 
because 5 — | = 4 is even. 

—1 because —1—1 = —2 is even. 
—3 because —3— 1 = —4 is even. 


c. Proof: Suppose n is any odd integer. Then n = 2k + 1 for some integer k. Now by 
definition of E, n E 1 if, and only if, n — 1 is even. But by substitution, 


n—1=(2k+1)—1 = 2k, 


and since k is an integer, 2k is even. Hence n E 1 [as was to be shown]. 


It can be shown (see exercise 2 at the end of this section) that integers m and n are related 
by E if, and only if, m mod 2 = n mod 2 (that is, both are even or both are odd). When this 
occurs m and n are said to be congruent modulo 2. E 


SCUMGEARJ A Relation on a Power Set 


Let X = {a, b, c}. Then P(X) = {Ø, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c}}. Define a 
relation S from P(X) to P(X) as follows: For all sets A and B in P(X) (that is, for all subsets 
A and B of X), 


A S B & A has at least as many elements as B. 
a. Is {a, b} S {b, c}? b. Is {a} S @? c. Is {b, c} S {a, b, c}? d. Is {c} S {a}? 
Solution 


a. Yes, both sets have two elements. 
b. Yes, {a} has one element and Ø has zero elements, and 1 = 0. 


c. No, {b, c} has two elements and {a, b, c} has three elements and 2 < 3. 


d. Yes, both sets have one element. E 


The Inverse of a Relation 


If R is a relation from A to B, then a relation R`! from B to A can be defined by interchang- 
ing the elements of all the ordered pairs of R. 
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Definition 


Let R be a relation from A to B. Define the inverse relation R`! from B to A as 
follows: 


R`! = {(y,x) E B X A | (x, y) E R}. 


This definition can be written operationally as follows: 


For all x E A and y E B, 0, xX ER! < ay ER. 


The Inverse of a Finite Relation 


Let A = {2, 3, 4} and B = {2, 6, 8}, and let R be the “divides” relation from A to B: For 
every ordered pair (x, y) E A X B, 


xRy <+> xļ|y xdividesy. 
a. State explicitly which ordered pairs are in R and R`', and draw arrow diagrams for R 
and R™!. 
b. Describe R`! in words. 
Solution 
a. R = {(2, 2), (2, 6), (2, 8), (3, 6), (4, 8)} 
R`! = {(2, 2), (6, 2), (8, 2), (6, 3), (8, 4)} 


To draw the arrow diagram for R€, you can copy the arrow diagram for R but reverse 
the directions of the arrows. 


b. R`! can be described in words as follows: For every ordered pair (y, x) € B X A, 


yR''x < yisamultiple of x. a 
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The Inverse of an Infinite Relation 
Define a relation R from R to R as follows: For every ordered pair (x, y) E R X R, 
xRy = y=2J|z|. 
Draw the graphs of R and R™! in the Cartesian plane. Is R7! a function? 


Solution A point (v, u) is on the graph of R`! if, and only if, (u, v) is on the graph of R. 
Note that if x = 0, then the graph of y = 2|x| = 2x is a straight line with slope 2. And if 
x < 0, then the graph of y = 2|x| = 2(—x) = —2x is a straight line with slope —2. Some 
sample values are tabulated and the graphs are shown below. 


R= {œ y)|y = 2|x]} R™ = (0, |y = 2| x1} 

y x 

0 0 
2 1 
2 | =1 
4 2 
4 |-2 

Ist coordinate 2nd coordinate Ist coordinate 2nd coordinate 


Graph of R 


Graph of R7! 


> 


R`! is not a function because, for instance, both (2, 1) and (2, — 1) are in R` !. E 


Directed Graph of a Relation 


In the remaining sections of this chapter, we discuss important properties of relations that 
are defined from a set to itself. 


Note Be careful to dis- 
tinguish clearly between 


a relation and the set on A relation on a set A is a relation from A to A. 
which it is defined. 


Definition 


When a relation R is defined on a set A, the arrow diagram of the relation can be modi- 
fied so that it becomes a directed graph. Instead of representing A as two separate sets 
of points, represent A only once, and draw an arrow from each point of A to each related 
point. As with an ordinary arrow diagram, 
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For all points x and y in A, 


there is an arrow from xtoy + xRy & (@,yeER. 


If a point is related to itself, a loop is drawn that extends out from the point and goes back 


to it. 
ZCLUJGEARKA Directed Graph of a Relation 
Note Read “For every Let A = {3, 4, 5, 6, 7, 8} and define a relation R on A as follows: For every x, y E A, 
x, y CA” as “For every x 
and y in A.” xRy = 2|@—y). 


Draw the directed graph of R. 


Solution Note that 3 R 3 because 3 — 3 = 0 and 2 | 0 since 0 = 2-0. Thus there is a loop 
from 3 to itself. Similarly, there is a loop from 4 to itself, from 5 to itself, and so forth, since 
the difference of each integer with itself is 0, and 2 | 0. 

Note also that 3 R 5 because 3 —5 = —2 = 2:(—1).And5 R3 because 5 — 3 = 2 = 2-1. 
Hence there is an arrow from 3 to 5 and also an arrow from 5 to 3. The other arrows in the 
directed graph, as shown below, are obtained by similar reasoning. 


N-ary Relations and Relational Databases 


A special group of relations, called n-ary relations, form the mathematical foundation for 
relational database theory. Just as a binary relation is a subset of a Cartesian product of two 
sets, an n-ary relation is a subset of a Cartesian product of n sets. 


Definition 


Given sets Aj, A2, . . . , An, an n-ary relation R on A, X A, X ++- X A, is a subset of 
A, X Ay X ++: X A,. The special cases of 2-ary, 3-ary, and 4-ary relations are called 
binary, ternary, and quaternary relations, respectively. 


A Simple Database 


The following is a radically simplified version of a database that might be used in a hospi- 
tal. Let A, be a set of positive integers, A, a set of alphabetic character strings, A3 a set of 
numeric character strings, and A, a set of alphabetic character strings. Define a quaternary 
relation R on A, X A, X A, X A; as follows: 


(ai, dy, 43,44} ER < a patient with patient ID number a,, named a, was 
admitted on date a3, with primary diagnosis a4. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


492 CHAPTER8 PROPERTIES OF RELATIONS 


At a particular hospital, this relation might contain the following 4-tuples: 


(011985, John Schmidt, 020719, asthma) 
(574329, Tak Kurosawa, 011419, pneumonia) 
(466581, Mary Lazars, 010319, appendicitis) 
(008352, Joan Kaplan, 112419, gastritis) 
(011985, John Schmidt, 021719, pneumonia) 
(244388, Sarah Wu, 010319, broken leg) 
(778400, Jamal Baskers, 122719, appendicitis) 


In discussions of relational databases, the n-tuples are normally thought of as being 
written in tables. Each row of the table corresponds to one n-tuple, and the header for each 
column gives the descriptive attribute for the elements in the column. 

Operations within a database allow the data to be manipulated in many different ways. 
For example, in the database language SQL, if the above database is denoted S, the result 
of the query 


SELECT Patient_ID#, Name FROM S WHERE 
Admission_Date = 010319 
would be a list of the ID numbers and names of all patients admitted on 01-03-19: 


466581 Mary Lazars 
244388 Sarah Wu 


This is obtained by taking the intersection of the set A, X A, X {010319} X A4 with the 
database and then projecting onto the first two coordinates. (See exercise 25 of Section 7.1.) 
Similarly, SELECT can be used to obtain a list of all admission dates of a given patient. 
For John Schmidt this list is 

02-07-19 

02-17-19 
Individual entries in a database can be added, deleted, or updated, and most databases 


can sort data entries in various ways. In addition, entire databases can be merged, and the 
entries common to two databases can be moved to a new database. E 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. If Ris a relation from A to B, x E A, and y E B, 4. A relation on a set A is a relation from ——— to 
the notation x R y means that 


2. If Ris a relation from A to B, x E A, and y E B, 5. If Ris a relation on a set A, the directed graph of R 
the notation x & y means that has an arrow from x to y if, and only if, 


3. If Ris a relation from A to B, x E A, and y E B, 
then (y, x) E R`" if, and only if, 
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1. As in Example 8.1.2, the congruence modulo 2 
relation E is defined from Z to Z as follows: For 
every ordered pair (m, n) E Z X Z, 


mEn < m-—niseven. 


a. Is 0 E0?Is 5 E 271s (6,6) E E? 
Is (—1,7) € E? 
b. Prove that for any even integer n, n E 0. 


H 2. Prove that for all integers m and n, m — n is even 
if, and only if, both m and n are even or both m 
and n are odd. 


3. The congruence modulo 3 relation, T, is defined 
from Z to Z as follows: For all integers m and n, 


mTn = 3|(m—n). 


a. Is 107 1? Is 1 T10? Is (2,2) E T? 

Is (8,1) E T? 

List five integers n such that n T 0. 

List five integers n such that n T 1. 

List five integers n such that n T 2. 

Make and prove a conjecture about which 
integers are related by T to 0, which integers 
are related by T to 1, and which integers are 
related by T to 2. 


pans 


4. Define a relation P on Z as follows: For every 
ordered pair (m, n) E Z X Z, 


mPn <> m andn have a common prime factor. 


a. Is 15 P25? b: Is 22 P27? 
c. Is0P5? d. Is8 P8? 


5. Let X = {a, b, c}. Recall that P(X) is the power 
set of X. Define a relation S on P(X) as follows: 
For all sets A and B in P(X), 


ASB = Ahas the same number of elements as B. 


a. Is {a, b} S {b, c}? b. Is {a} S {a, b}? 
c. Is {c} S {b}? 


6. Let X = {a, b, c}. Define a relation J on P(X) as 
follows: For all sets A and B in P(X), 


AJB = ANBŁØ. 


a. Is {a} J {c}? b. Is {a, b} J {b, c}? 
c. Is {a, b} J {a, b, c}? 


7. Define a relation R on Z as follows: For all inte- 


10. 


Tl. 


12. 


gers m and n, 
mRn = 5 | m-n’). 


a. Is 1 R(—9)? b. Is 2R 13? 
c. Is 2R (-8)? d. Is (—8) R 2? 


. Let A be the set of all strings of a’s and b’s of 


length 4. Define a relation R on A as follows: For 
every s, t E A, 


sRt < s has the same first two characters as t. 


a. Is abaa R abba? b. Is aabb R bbaa? 
c. Is aaaa R aaab? d. Is baaa R abaa? 


Let A be the set of all strings of 0’s, 1’s, and 2’s of 
length 4. Define a relation R on A as follows: For 
every s, t E A, 


sRt <> the sum of the characters in s equals 
the sum of the characters in t. 


a. Is 0121 R 2200? b. Is 1011 R 2101? 
c. Is 2212 R 2121? d. Is 1220 R 2111? 


Let A = {3, 4,5} and B = {4, 5, 6} and let R be 
the “less than” relation. That is, for every ordered 
pair (x, y) EA X B, 


xRy & x<y. 


State explicitly which ordered pairs are in R and R t 


Let A = {3, 4, 5} and B = {4, 5, 6} and let S be 
the “divides” relation. That is, for every ordered 
pair (x, y) E A X B, 


xSy $ aly 


State explicitly which ordered pairs are in Sand S K 


a. Suppose a function F: X — Y is one-to-one 
but not onto. Is F~! (the inverse relation for F) 
a function? Explain your answer. 

b. Suppose a function F: X — Y is onto but not 
one-to-one. Is F~! (the inverse relation for F) a 
function? Explain your answer. 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 


given. The symbol * signals that an exercise is more challenging than usual. 
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Draw the directed graphs of the relations defined 
in 13-18. 


13. Define a relation R on A = {0, 1, 2, 3} by 
R = {(0, 0), C1, 2), (2, 2)}. 

14. Define a relation S on B = {a, b, c, d} by 
S = {(a, b), (a, c), (b, ©), (d, d)}. 

15. Let A = {2, 3, 4, 5, 6, 7, 8} and define a relation R 
on A as follows: For every x, y E A, 


xRy = xly. 


16. Let A = {5, 6, 7, 8, 9, 10} and define a relation S 
on A as follows: For every x, y € A, 


xSy <= 2|(-Yy). 


17. Let A = {2, 3, 4, 5, 6, 7, 8} and define a relation T 
on A as follows: For every x, y € A, 


xTy = 3|/G@=y). 


18. Let A = {0, 1, 3, 4, 5, 6} and define a relation V 
on A as follows: For every x, y € A, 


xVy =} 5|Q?-y”). 


Exercises 19-20 refer to unions and intersections of rela- 
tions. Since relations are subsets of Cartesian products, 
their unions and intersections can be calculated as for any 
subsets. Given two relations R and S from A to B, 
RUS= {(x,y) EA X Bl G@, y) E Ror (x, y) E S} 
RAS = {(x,y) EA X B| (x,y) E Rand (x, y) E S}. 


19. Let A = {2, 4} and B = {6, 8, 10} and define 
relations R and S from A to B as follows: For every 
(x,y) EA XB, 


xRy < x|y and 
xSy < y-4=x. 


State explicitly which ordered pairs are in A X B, 
R, S, R U S, and R N S. 


ANSWERS FOR TEST YOURSELF 


20. Let A = {-1, 1, 2, 4} and B = {1, 2} and define 
relations R and S from A to B as follows: For every 
(x,y) EA XB, 


xRy < |x| = |y] and 


xSy << x-—yiseven. 


State explicitly which ordered pairs are in A X B, 
R,S,RUS, and RN S. 


21. Define relations R and S on R as follows: 


R={x, y) ERXR]|x<y} and 
S={@, y) ERXR|x= y}. 
That is, R is the “less than” relation and S is the 


“equals” relation on R. Graph R, S, R U S, and 
R N S in the Cartesian plane. 


22. Define relations R and S on R as follows: 


R={&, V ERXR|x +y =4} and 
S={x, y) ERXR|x= y}. 


Graph R, S, RU S, and RMS in the Cartesian 
plane. 


23. Define relations R and S on R as follows: 


R={(x,y)€RXR|y= |x|} and 
S = {(, y) ER XR|y = I}. 


Graph R, S, RUS, and RMS in the Cartesian 
plane. 


24. In Example 8.1.7 consider the query SELECT 
Patient_ID#, Name FROM S WHERE Primary_ 
Diagnosis = X. The response to the query is 
the projection onto the first two coordinates 
of the intersection of the database with the set 
A, X Ay X Az X {X}. 

a. Find the result of the query SELECT Patient_ 
ID#, Name FROM S WHERE Primary_ 
Diagnosis = pneumonia. 

b. Find the result of the query SELECT Patient_ 
ID#, Name FROM S WHERE Primary_ 
Diagnosis = appendicitis. 


1. xis related to yby R 2. xis not related to yby R 3. @œ,y) ER 4.A;A 5.xis related to y by R 
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EFJ Reflexivity, Symmetry, and Transitivity 


Mathematics is the tool specially suited for dealing with abstract concepts of any kind 
and there is no limit to its power in this field. —P. A. M. Dirac, 1902-1984 


Let A = {2, 3, 4, 6, 7, 9} and define a relation R on A as follows: For every x, y € A, 
xRy <= 3|(@-y). 


Then 2 R 2 because 2 — 2 = 0, and 3 | 0. Similarly, 3 R 3, 4 R 4, 6 R 6,7 R 7, and 9 R 9. 
Also 6 R 3 because 6 — 3 = 3, and 3 | 3. And 3 R 6 because 3 — 6 = —(6—3) = —3, and 
3 | (—3). Similarly, 3 R 9, 9 R 3, 6 R 9, 9 R 6, 4 R 7, and 7 R 4. Thus the directed graph for 
R has the appearance shown below. 


This graph has three important properties: 
1. Each point of the graph has an arrow looping around from it and going back to it. 


2. In each case where there is an arrow going from one point to a second, there is an arrow 
going from the second point back to the first. 


3. In each case where there is an arrow going from one point to a second and from the 
second point to a third, there is an arrow going from the first point to the third. That is, 
there are no “incomplete directed triangles” in the graph. 


Properties (1), (2), and (3) correspond to properties of general relations called reflexiv- 
ity, symmetry, and transitivity. 


A Definition 
Let R be a relation on a set A. 


Caution! The definition 
of symmetric does not 
say that x is related to y 
by R; rather, it states only 3. R is transitive if, and only if, for every x, y, z E A, if x R y and y R z then x R z. 
that if it happens that x is 
related to y, then y must 
be related to x. Because of the equivalence of the expressions x R y and (x, y) E R for every x and y in 
A, the reflexive, symmetric, and transitive properties can also be written as follows: 


1. Ris reflexive if, and only if, for every x € A, x R x. 
2. Ris symmetric if, and only if, for every x, y E A, if x R y then y R x. 


1. Ris reflexive < for every xin A, (x, x) E R. 
2. Ris symmetric <> for every x and y in A, if (x, y) E R then (y, x) E R. 


3. Ris transitive <> for every x, y, and z in A, if (x, y) E R and (y, z) E R then 
(x, z) ER. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


496 CHAPTER8 PROPERTIES OF RELATIONS 


A In informal terms, properties (1)—(3) say the following: 
1. Reflexive: Each element is related to itself. 
Caution! The “first,” , 7 
“second,” and “third” 2. Symmetric: If any one element is related to any other element, then the second element 
elements in the informal is related to the first. 


versions need not all be 
distinct. This is a disadvan- 
tage of informality: It may 


3. Transitive: If any one element is related to a second and that second element is related 
to a third, then the first element is related to the third. 


mask nuances that a formal Note that the definitions of reflexivity, symmetry, and transitivity are universal state- 
definition makes clear. ments. This means that to prove a relation has one of the properties, you use either the 
method of exhaustion or the method of generalizing from the generic particular. 
Now consider what it means for a relation not to have one of the properties defined 
previously. Recall that the negation of a universal statement is existential. Hence if R is a 
relation on a set A, then 


1. R is not reflexive <> there is an element x in A such that x K x [that is, such that 


(x, x) € R]. 


2. Ris not symmetric <> there are elements x and y in A such that x R y but y K x [that 
is, such that (x, y) E R but (y, x) É R]. 


3. Ris not transitive < there are elements x, y, and z in A such that x R y andy R z 
but x K z [that is, such that (x, y) E R and (y, 2) E R but 
(x, z) É R]. 


It follows that you can show that a relation does not have one of the properties by finding 
a counterexample. 
Cea Properties of Relations on Finite Sets 
Let A = {0, 1, 2, 3} and define relations R, S, and T on A as follows: 
R = {(0, 0), (0, 1), ©, 3), G, 0), G, 1), (2, 2), 3, 0), G, 3)}, 


S = {(0, 0), (0, 2), (0, 3), (2, 3)}, 
T = {(0, 1), (2, 3)}. 


a. Is R reflexive? symmetric? transitive? 
b. Is S reflexive? symmetric? transitive? 


c. Is T reflexive? symmetric? transitive? 
Solution 


a. The directed graph of R has the appearance shown below. 


R is reflexive: There is a loop at each point of the directed graph. This means that 
each element of A is related to itself, so R is reflexive. 
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R is symmetric: In each case where there is an arrow going from one point of the 
graph to a second, there is an arrow going from the second point back to the first. This 
means that whenever one element of A is related by R to a second, then the second is 
related to the first. Hence R is symmetric. 


R is not transitive: There is an arrow going from | to 0 and an arrow going from 0 to 
3, but there is no arrow going from 1 to 3. This means that there are elements of A—O, 
1, and 3—such that 1 R 0 and 0 R 3 but 1 K 3. Hence R is not transitive. 


b. The directed graph of S has the appearance shown below. 


3e<—_____+2 


S is not reflexive: There is no loop at 1, for example. Thus (1, 1) € S, and so S is not 
reflexive. 


S is not symmetric: There is an arrow from 0 to 2 but not from 2 to 0. Hence 
(0, 2) E S but (2, 0) € S, and so S is not symmetric. 


S is transitive: There are three cases for which there is an arrow going from one point 
of the graph to a second and from the second point to a third. In particular, there are 
arrows going from 0 to 2 and from 2 to 3; there are arrows going from 0 to 0 and from 
0 to 2; and there are arrows going from 0 to 0 and from 0 to 3. In each case there is 

an arrow going from the first point to the third. (Note again that the “first,” “second,” 
and “third” points need not be distinct.) This means that whenever (x, y) E S and 

(y, z) E S, then (x, z) E S, for every x, y, z E {0, 1, 2, 3}, and so S is transitive. 


c. The directed graph of T has the appearance shown below. 


0 e———e | 


3 e<—____—_—_—e2 
T is not reflexive: There is no loop at 0, for example. Thus (0, 0) € T, so Tis not reflexive. 


T is not symmetric: There is an arrow from 0 to 1 but not from 1 to 0. Thus (0, 1) © T 
but (1, 0) € T, and so T is not symmetric. 


Note 7 is transitive by T is transitive: The transitivity condition is vacuously true for T. To see this, observe 
default because it is not that the transitivity condition says that 


not transitive! s 
For every x, y,z E A, if (x, y) E T and (y, z) E T then (x, z) E T. 


The only way for this to be false would be for there to exist elements of A that make 
the hypothesis true and the conclusion false. That is, there would have to be elements 
x, y, and z in A such that 


&,y ET and ,)JET and &,z ÉT. 


In other words, there would have to be two ordered pairs in T that have the potential to 
“link up” by having the second element of one pair be the first element of the other pair. 
But the only elements in T are (0, 1) and (2, 3), and these do not have the potential to 
link up. Hence the hypothesis is never true. It follows that it is impossible for T not to be 
transitive, and thus T is transitive. E 
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When a relation R is defined on a finite set A, it is possible to write computer algo- 
rithms to check whether R is reflexive, symmetric, and transitive. One way to do this 
is to represent A as a one-dimensional array, (a[1], a[2],..., a[n]) and use a modifica- 
tion of the algorithm of exercise 38 in Section 6.1 to check whether an ordered pair in 
A X A is in R. Checking whether R is reflexive can be done with a loop that examines 
each element a[i] of A in turn. If, for some i, (a[i], a[i]) € R, then R is not reflexive. 
Otherwise, R is reflexive. Checking for symmetry can be done with a nested loop that 
examines each pair (a[i], a[j]) of A X A in turn. If, for some i and j, (a[i], a[j]) E R and 
(aLj], ali]) É R, then R is not symmetric. Otherwise, R is symmetric. Checking whether 
R is transitive can be done with a triply nested loop that examines each triple (a[i], a[j], 
a[k]) of A X A X A in turn. If, for some triple, (a[i], a[j]) E R, (a[j], a[k]) E R, and (a[i], 
a[k]) É R, then R is not transitive. Otherwise, R is transitive. In the exercises for this 
section, you are asked to formalize these algorithms. 


Properties of Relations on Infinite Sets 


Suppose a relation R is defined on an infinite set A. To prove the relation is reflexive, sym- 
metric, or transitive, first write down what is to be proved. For instance, for symmetry you 
need to prove that 


Vx, y E A, if x R y then y R x. 


Then use the definitions of A and R to rewrite the statement for the particular case in 
question. For instance, for the “equality” relation on the set of real numbers, the rewritten 
statement is 


Vx, y E R, if x = y then y = x. 


Sometimes the truth of the rewritten statement will be immediately obvious (as it is here). 
At other times you will need to prove it using the method of generalizing from the generic 
particular. We give examples of both cases in this section. We begin with the relation of 
equality, one of the simplest and yet most important relations. 


Properties of Equality 


Define a relation R on R as follows: For all real numbers x and y, 


xRy & x=y. 


a. Is R reflexive? b. Is R symmetric? c. Is R transitive? 


Solution 
a. R is reflexive: R is reflexive if, and only if, the following statement is true: 


For every x E R, xRx. 
Since x R x just means that x = x, this is the same as saying 
For every x E R, x=x. 


But this statement is certainly true; every real number is equal to itself. 


b. Ris symmetric: R is symmetric if, and only if, the following statement is true: 


For every x, y E R, ifxRytheny Rx. 
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By definition of R, x R y means that x = y and y R x means that y = x. Hence R is 
symmetric if, and only if, 
For every x, y E R, ifx = y then y = x. 


But this statement is certainly true; if one number is equal to a second, then the sec- 
ond is equal to the first. 


c. R is transitive: R is transitive if, and only if, the following statement is true: 
For every x, y,z E R, ifx R yand y R z then x R z. 


By definition of R, x R y means that x = y, y R z means that y = z, and x R z means 
that x = z. Hence R is transitive if, and only if, the following statement is true: 


For every x, y,z E R, ifx = yand y = z then x = z. 


But this statement is certainly true: If one real number equals a second and the second 
equals a third, then the first equals the third. E 


SCWGEY ÆA Properties of “Less Than” 


Define a relation R on R as follows: For all real numbers x and y, 


xRy & x<y. 


a. Is R reflexive? b. Is R symmetric? c. Is R transitive? 


Solution 

a. R is not reflexive: R is reflexive if, and only if, Vx € R, x R x. By definition of R, this 
means that Yx € R, x < x. But this is false: 4x € R such that x £ x. As a counterex- 
ample, let x = 0 and note that 0 £ 0. Hence R is not reflexive. 


b. Ris not symmetric: R is symmetric if, and only if, Vx, y E R, if x R y then y Rx. 
By definition of R, this means that Vx, y E R, if x < y then y < x. But this is false: 
dx, y E R such that x < y and y £ x. As a counterexample, let x = 0 and y = 1 and 
note that 0 < 1 but 1 £ 0. Hence R is not symmetric. 


c. Ris transitive: R is transitive if, and only if, Vx, y, z © R, if x R y and y R z then 
x R z. By definition of R, this means that Vx, y, z E R, if x < y and y < z, then x < z. 
But this statement is true by the transitive law of order for real numbers (Appendix A, 
T18). Hence R is transitive. a 


Sometimes a property is “universally false” in the sense that it is false for every element 
of its domain. It follows immediately, of course, that the property is false for each particular 
element of the domain and hence counterexamples abound. In such a case, it may seem more 
natural to prove the universal falseness of the property rather than to give a single counter- 
example. In the example above, for instance, you might find it natural to answer (a) and (b) 
as follows: 


Alternative Answer to (a): R is not reflexive because x £ x for every real number x (by 
the trichotomy law—Appendix A, T17). 


Alternative Answer to (b): R is not symmetric because for all real numbers x and y in A, 
if x < y then y £ x (by the trichotomy law). 
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Seu weg Properties of Congruence Modulo 3 


Define a relation T on Z (the set of all integers) as follows: For all integers m and n, 


mTn = 3 | (m—n). 


This relation is called congruence modulo 3. 


a. Is T reflexive? b. Is T symmetric? c. Is T transitive? 
Solution 
a. Tis reflexive: To show that T is reflexive, it is necessary to show that 
For everym E Z, mTm. 
By definition of T, this means that 
For every m E Z, 3|(m—m), 


which is true because m — m = 0 and 3 | 0 (since 0 = 3-0). Hence T is reflexive. This 
reasoning is formalized in the following proof. 


Proof of Reflexivity: Suppose m is a particular but arbitrarily chosen integer. 
[We must show that m T m.] Now m—m = 0. But 3 | 0 since 0 = 3-0. Hence 
3 | (m — m). Thus, by definition of T, m T m [as was to be shown]. 


b. Tis symmetric: To show that T is symmetric, it is necessary to show that 
For every m n E Z, ifmTnthenn Tm. 
By definition of T this means that 
For every m,n E Z, if 3|(m—n) then 3 | (n — m). 


Is this true? Suppose m and n are particular but arbitrarily chosen integers such that 
3 | (m—n). Must it follow that 3 | (n— m)? [In other words, can we find an integer so 
that n—m = 3- (that integer)?] By definition of “divides,” since 


3|(m—n), 
then 
m—n = 3k for some integer k. 
The crucial observation is that n —m = —(m — n). Hence, you can multiply both sides 


of this equation by —1 to obtain 
—(m—n) = —3k, 
which is equivalent to 
n—m = 3(—k). 
[Thus we have found an integer, —k, so that n—m = 3- (that integer).] 
Since —k is an integer, this equation shows that 
3 | (n-m). 


It follows that T is symmetric. 
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The reasoning above is formalized in the following proof. 


Proof of Symmetry: Suppose m and n are particular but arbitrarily chosen inte- 
gers that satisfy the condition m T n. [We must show that n T m.] By definition of T, 
since m Tn then 3 | (m — n). By definition of “divides,” this means that m — n = 3k, 
for some integer k. Multiplying both sides by — 1 gives n — m = 3(—k). Since —k 
is an integer, this equation shows that 3 | (n — m). Hence, by definition of T, n T m 
[as was to be shown]. 


c. T is transitive: To show that T is transitive, it is necessary to show that 
For every m,n, p E Z, ifmTnandnTpthenmT p. 
By definition of T this means that 
For every m,n E Z, if 3| (m—n) and 3 | (n — p) then 3 | (m — p). 


Is this true? Suppose m, n, and p are particular but arbitrarily chosen integers such that 
3 | (m—n) and 3 | (n — p). Must it follow that 3 | (m — p)? [In other words, can we find 
an integer so that m— p = 3- (that integer)?] By definition of “divides,” since 


3 | (m—n) and 3 | (n—p), 
then 
m—n=3r _ for some integer r, 
and 


n—p = 3s for some integer s. 


The crucial observation is that (m — n) + (n — p) = m — p. Add these two equations 
together to obtain 


(m—n)+(n-p) = 3r+3s, 
which is equivalent to 
m—p =3(rts). 


[Thus we have found an integer so that m— p = 3- (that integer).] Since r and s are inte- 
gers, r + is an integer. So this equation shows that 


3 | (m—p). 


It follows that T is transitive. 
The reasoning above is formalized in the following proof. 


Proof of Transitivity: Suppose m, n, and p are particular but arbitrarily chosen 
integers that satisfy the condition m T n and n T p. [We must show that m T p.] By 
definition of T, since m T n and n T p, then 3 | (m — n) and 3 | (n—p). By defini- 
tion of “divides,” this means that m—n = 3r and n— p = 3s, for some integers r 
and s. Adding the two equations gives (m—n) + (n — p) = 3r + 3s, and simplify- 
ing gives that m — p = 3(r + s). Since r + is an integer, this equation shows that 
3 | (m — p). Hence, by definition of T, m T p [as was to be shown]. 
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The Transitive Closure of a Relation 


Generally speaking, a relation fails to be transitive because it fails to contain certain or- 
dered pairs. For example, if (1, 3) and (3, 4) are in a relation R, then the pair (1, 4) must be 
in R if R is to be transitive. To obtain a transitive relation from one that is not transitive, it 
is necessary to add ordered pairs. Roughly speaking, the relation obtained by adding the 
least number of ordered pairs to ensure transitivity is called the transitive closure of the 
relation. More precisely, the transitive closure of a relation is the smallest transitive relation 
that contains the relation. 


Definition 


Let A be a set and R a relation on A. The transitive closure of R is the relation R’ on 
A that satisfies the following three properties: 
1. R'is transitive. 


2L RCR. 


3. If Sis any other transitive relation that contains R, then RES. 


Transitive Closure of a Relation 
Let A = {0, 1, 2, 3} and consider the relation R defined on A as follows: 
R= {(0, 1), (1, 2), (2, 3)}. 
Find the transitive closure of R. 
Solution Every ordered pair in R is in R‘, so 
{0, D, (1, 2), (2, 3)} GR. 


Thus the directed graph of R contains the arrows shown below. 


3e o? 


Since there are arrows going from 0 to 1 and from 1 to 2, R' must have an arrow going 
from 0 to 2. Hence (0, 2) E€ R’. Then (0, 2) E€ R’ and (2, 3) € R’, so since R’ is transitive, 
(0, 3) E R'. Also, since (1, 2) € R' and (2, 3) E R’, then (1, 3) E€ R'. Thus R‘ contains at 
least the following ordered pairs: 


{O, 1), ©, 2), ©, 3), d, 2), CL, 3), 2, 3). 


But this relation is transitive; hence it equals R‘. The directed graph of R’ is shown 
below. 
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TEST YOURSELF 


1. 


EXERCISE SET 8.2 


In 1- 
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For a relation R on a set A to be reflexive means 
that 


. For a relation R on a set A to be symmetric means 


that__ — 


. For a relation R on a set A to be transitive means 


that 


To show that a relation R on an infinite set A is 
reflexive, you suppose that and you show 
that ___. 


To show that a relation R on an infinite set A is 
symmetric, you suppose that and you 
show that 


6. 


7. 


10. 


To show that a relation R on an infinite set A is 
transitive, you suppose that and you show 
that ___. 


To show that a relation R on a set A is not reflex- 
ive, you 


To show that a relation R on a set A is not symmet- 
ric, you 
To show that a relation R on a set A is not transi- 


tive, you 


Given a relation R on a set A, the transitive closure of 
Ris the relation R‘ on A that satisfies the following 
three properties: ; , and 


8, anumber of relations are defined on the set 


A = {0, 1, 2, 3}. For each relation: 


anpa 


Draw the directed graph. 

Determine whether the relation is reflexive. 
Determine whether the relation is symmetric. 
Determine whether the relation is transitive. 


Give a counterexample in each case in which the relation 
does not satisfy one of the properties. 


1. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 


Rı = {(, 0), O, 1, ©, 3), d, 1, A, 0), 2, 3), G, 3) 
Ry = {0, 0), ©, D, (L, D, (L, 2), 2, 2), 2, 3) 

R; = {(2, 3), G, 2)} 

R4 = {(1, 2), @, 1), d, 3), G, D) 

Rs = {(0, 0), ©, 1), O, 2), CL, 2} 

Re = {0, 1), O, 2)} 

R3 = {0, 3), (2, 3)} 

Rs = {(0, 0), G, D} 


In 9-33, determine whether the given relation is reflexive, 


symmetric, transitive, or none of these. Justify your answers. 


9. 


10. 


Tl. 


R is the “greater than or equal to” relation on 
the set of real numbers: For every x, y E R, 
xRy@xey. 


C is the circle relation on the set of real numbers: 
For every x, y © R.xCyexrt+y =1. 


D is the relation defined on R as follows: For 
every x, y E R, x D y& xy Z 0. 


12. 


13. 


14. 


15. 


16. 


17. 


H 18. 


19. 


20. 


E is the congruence modulo 4 relation on Z: For 
every m,n E Z, m En $ 4| (m-n). 


F is the congruence modulo 5 relation on Z: For 
every m,n E Z, m F n% 5| (m-n). 


O is the relation defined on Z as follows: For 
every m, n E Z, m O n & m-n is odd. 


D is the “divides” relation on Z”: For all positive 
integers m and n, m D n 4m |n. 


A is the “absolute value” relation on R: For all real 
numbers x and y, xA y= |x| = |yl. 


Recall that a prime number is an integer that is 
greater than 1 and has no positive integer divi- 
sors other than 1 and itself. (In particular, 1 is not 
prime.) A relation P is defined on Z as follows: 
For every m,n E Z, m P n 4& Ja prime number p 
such that p | m and p| n. 


Define a relation Q on R as follows: For all real 
numbers x and y, x Q y x — y is rational. 


Define a relation Z on R as follows: For all real 
numbers x and y, x I y <> x — y is irrational. 


Let X = {a, b, c} and P(X) be the power set of X 
(the set of all subsets of X). A relation E is defined 
on P(X) as follows: For every A, B E P(X), 
AEBS the number of elements in A equals the 
number of elements in B. 
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21. Let X = {a, b, c} and P(X) be the power set of X. 
A relation L is defined on P(X) as follows: For ev- 
ery A, B E P(X), A L B= the number of elements 
in A is less than the number of elements in B. 


22. Let X = {a, b, c} and P(X) be the power set of X. 
A relation N is defined on P(X) as follows: For 
every A, B E P(X), A N B= the number of ele- 
ments in A is not equal to the number of elements 
in B. 


23. Let X be a nonempty set and P(X) the power set of 
X. Define the “subset” relation S on P(X) as fol- 
lows: For every A, B E P(X), A S BACB. 


24. Let X be a nonempty set and P(X) the power set of 
X. Define the “not equal to” relation U on P(X) as 
follows: For every A, BE A(X), A U B&A +B. 


25. Let A be the set of all strings of a’s and b’s of 
length 4. Define a relation R on A as follows: For 
every s, t E A, s R t & s has the same first two 
characters as t. 


26. Let A be the set of all strings of 0’s, 1’s, and 2’s 
that have length 4 and for which the sum of the 
characters in the string is less than or equal to 2. 
Define a relation R on A as follows: For every 
s, t E A, s Rt< the sum of the characters of s 
equals the sum of the characters of t. 


27. Let A be the set of all English statements. A relation 
l is defined on A as follows: For every p, q E A, 
plqa=p-— qis true. 

28. Let A = R X R. A relation F is defined on A as 
follows: For every (xj, yı) and (x2, y2) in A, 


(x1, y) F (Xo, y2) S x1 = XD. 


29. Let A = R X R. A relation S is defined on A as 
follows: For every (x1, yı) and (x2, y2) in A, 


(x1, yD S Qaz, y) + yi = Yo. 


30. Let A be the “punctured plane”; that is, A is the set 
of all points in the Cartesian plane except the ori- 
gin (0, 0). A relation R is defined on A as follows: 
For every p, and p; in A, pı R po ® p; and p; lie 
on the same half line emanating from the origin. 


31. Let A be the set of people living in the world 
today. A relation R is defined on A as follows: For 
all people p and q in A, 


p R q €p lives within 100 kilometers of q. 


32. Let A be the set of all lines in the plane. A relation 
R is defined on A as follows: For every /; and /5 in 
A, L R L = 1, is parallel to l}. (Assume that a line 
is parallel to itself.) 


33. Let A be the set of all lines in the plane. A relation R 
is defined on A as follows: For every /, and /, in A, 


lL R L = l is perpendicular to l. 


In 34-36, assume that R is a relation on a set A. Prove or 
disprove each statement. 


34. If R is reflexive, then R`! is reflexive. 
35. If R is symmetric, then R | is symmetric. 


36. If R is transitive, then R`! is transitive. 


In 37-42, assume that R and S are relations ona set A. 
Prove or disprove each statement. 


37. If R and S are reflexive, is R N S reflexive? Why? 


H 38. If Rand S are symmetric, is R N S symmetric? Why? 


39. If R and S are transitive, is R N S transitive? Why? 
40. If R and S are reflexive, is R U S reflexive? Why? 


41. If R and S are symmetric, is R U S symmetric? 
Why? 
42. If R and S are transitive, is R U S transitive? Why? 
In 43-50, the following definitions are used: A relation on 
a set A is defined to be 
irreflexive if, and only if, for every x € A, x Rx; 


asymmetric if, and only if, for every x,y E AifxRy 
theny Rx; 


intransitive if, and only if, for every x,y,z EA, ifxRy 
andy Rzthenx Rz. 


For each of the relations in the referenced exercise, 
determine whether the relation is irreflexive, asymmetric, 
intransitive, or none of these. 


43. Exercise 1 44. Exercise 2 


45. Exercise 3 46. Exercise 4 


47. Exercise 5 48. Exercise 6 


49. Exercise 7 50. Exercise 8 
In 51-53, R, S, and Tare relations defined on A = {0, 1, 2, 3}. 


51. Let R = (0, 1), (0, 2), (1, 1), (1, 3), (2, 2), (3, 0). 
Find R’, the transitive closure of R. 


52. Let S = {(0, 0), (0, 3), (1, 0), (1, 2), (2, 0), (3, 2)}. 
Find S‘, the transitive closure of S. 
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53. Let T = {(0, 2), (1, 0), (2, 3), (3, D}. Find T’, the 55. Write a computer algorithm to test whether a rela- 
transitive closure of T. tion R defined on a finite set A is symmetric, where 
54. Write a computer algorithm to test whether a rela- A = {a[l], a[2],..., a[n]}. 


tion R defined on a finite set A is reflexive, where 56. Write a computer algorithm to test whether a rela- 


tion R defined on a finite set A is transitive, where 


A = {a{\], a[2],..., a[n]}. 


A = fall], a[2],..., alnl}. 


ANSWERS FOR TEST YOURSELF 


1. foreveryxinA,xRx 2. for every x and y in A, if x R y that there is an element x in A such that xx $8. show that 
thenyRx 3. for every x, y, and z in A, if x R y and y Rz there are elements x and y in A such that x R y but y# x 
thenxRz 4. xis any element of A;xRx 5.x and y are 9. show that there are elements x, y, and z in A such that 
any elements of A such that x R y, y Rx 6. x, y, and z are xRyandyRzbutxXz 10. R'is transitive; R C Rí; if S is 
any elements of A such that x R yand y Rz;xRz 7. show any other transitive relation that contains R, then R‘ C S 


EJ Equivalence Relations 


“You are sad” the Knight said in an anxious tone: “let me sing you a song to 
comfort you.” 

. The name of the song is called ‘Haddocks’ Eyes.’ 

“Oh, that’s the name of the song, is it?” Alice said, trying to feel interested. 


” 


“No, you don’t understand,” the Knight said, looking a little vexed. “That’s what 
the name is called. The name really is ‘The Aged Aged Man. 

“Then I ought to have said ‘That’s what the song is called’?” Alice corrected 
herself. 

“No, you oughtn’t: that’s quite another thing! The song is called ‘Ways and 


2» 


Means’: but that’s only what it’s called, you know!” 

“Well, what is the song, then?” said Alice, who was by this time completely 
bewildered. 
“I was coming to that,” the Knight said. “The song really is ‘A-sitting on a Gate’: and 
the tune’s my own invention.” 
—Lewis Carroll, Through the Looking Glass, 1872 


You know from your early study of fractions that each fraction has many equivalent forms. 
For example, 


1.2.3. =) =3:15 
24° 6 =2'=6' 30° 


.., and so on, 


are all different ways to represent the same number. They may look different; they 
may be called different names; but they are all equal. The idea of grouping together 
things that “look different but are really the same” is the central idea of equivalence 
relations. 
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The Relation Induced by a Partition 


A partition of a set A is a finite or infinite collection of nonempty, mutually disjoint 
subsets whose union is A. The diagram of Figure 8.3.1 illustrates a partition of a set A by 
subsets Aj, A2, . . . , Ag. 


5 A; N A; = Ø, whenever i # j 


J 
| UA, U+*UAG=A 


FIGURE 8.3.1 A Partition of a Set 


Definition 


Given a partition of a set A, the relation induced by the partition, R, is defined on 
A as follows: For every x, y € A, 


xRy < there is a subset A, of the partition 
such that both x and y are in A;. 


Relation Induced by a Partition 
Let A = {0, 1, 2, 3, 4} and consider the following partition of A: 
{0, 3, 4}, {1}, {2}. 

Find the relation R induced by this partition. 

Solution Since {0, 3, 4} is a subset of the partition, 
OR3 because both 0 and 3 are in {0, 3, 4} 
3 RO_ because both 3 and 0 are in {0, 3, 4} 
OR4 because both 0 and 4 are in {0, 3, 4} 
4R0_ because both 4 and 0 are in {0, 3, 4} 
3R4 because both 3 and 4 are in {0, 3, 4} 


Note These statements and 
may seem strange, but, 4R3 because both 4 and 3 are in {0, 3, 4}. 
after all, they are not false! 
Also, 
ORO because both 0 and 0 are in {0, 3, 4} 
3R3 because both 3 and 3 are in {0, 3, 4} 
and 


4R4 because both 4 and 4 are in {0, 3, 4}. 


Since {1} is a subset of the partition, 
1R1_ because both 1 and 1 are in {1}, 


and since {2} is a subset of the partition, 


2R2 because both 2 and 2 are in {2}. 
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Note The deduction that 
x E A; and x E A; is based 
on the logical equivalence 
of the statement forms p 
and p ^ p. 


Note The deduction that 
y € A; and x € A; is based 
on the logical equivalence 
of the statement forms 
p^qandq ^p. 
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Hence 


R = {(0, 0), 0, 3), (0, 4), (L, D, (2, 2), (3, 0), (3, 3), (3, 4), (4, 0), (4, 3), (4, 4}. 


The fact is that a relation induced by a partition of a set satisfies all three properties 


studied in Section 8.2: reflexivity, symmetry, and transitivity. 


Theorem 8.3.1 


Let A be a set with a partition and let R be the relation induced by the partition. Then 
R is reflexive, symmetric, and transitive. 


Proof: Suppose A is a set with a partition. In order to simplify notation, we assume 
that the partition consists of only a finite number of sets. The proof for an infinite 
partition is identical except for notation. Denote the partition subsets by 


Ai, Azs.. A: 


Then A; N A; = Ø whenever i # j, and A; U Ap U ++ UA, = A. The relation R in- 
duced by the partition is defined as follows: For every x, y E A, 


xRy < thereis a set A; of the partition 
such that x E A; and y E A;. 


[Idea for the proof of reflexivity: For R to be reflexive means that each element of A is 
related by R to itself. But by definition of R, for an element x to be related to itself means 
that x is in the same subset of the partition as itself. Well, if x is in some subset of the 
partition, then it is certainly in the same subset as itself. And x is in some subset of the 
partition because the union of the subsets of the partition is all of A. This reasoning is 
formalized as follows.] 


Proof that R is reflexive: Suppose x E A. Since Aj, A3, . . . , A, is a partition of A, it 
follows that x € A; for some i, and so the statement 


there is a set A; of the partition such that x € A; and x € A; 


is true. Thus, by definition of R, x R x. 


[Idea for the proof of symmetry: For R to be symmetric means that any time one ele- 
ment is related to a second, then the second is related to the first. Now for one element 
x to be related to a second element y means that x and y are in the same subset of the 
partition. But if this is the case, then y is in the same subset of the partition as x, so y is 
related to x by definition of R. This reasoning is formalized as follows.] 


Proof that R is symmetric: Suppose x and y are elements of A such that x R y. Then 
there is a subset A; of the partition such that x © A; and y € A; 
by definition of R. It follows that the statement 


there is a subset A; of the partition such that y © A; and x € A; 


is also true. Hence, by definition of R, y R x. 


(continued on page 508) 
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[Idea for the proof of transitivity: For R to be transitive means that any time one ele- 
ment of A is related by R to a second and that second is related to a third, then the first 
element is related to the third. But for one element to be related to another means that 
there is a subset of the partition that contains both. So suppose x, y, and z are elements 
such that x is in the same subset as y and y is in the same subset as z. Must x be in the 
same subset as z? Yes, because the subsets of the partition are mutually disjoint. Since 
the subset that contains x and y has an element in common with the subset that contains 
y and z (namely, y), the two subsets are equal. But this means that x, y, and z are all in 
the same subset, and so, in particular, x and z are in the same subset. Hence x is related 
by R to z. This reasoning is formalized as follows.] 


Proof that R is transitive: Suppose x, y, and z are in A and x R y and y R z. By defini- 
tion of R, there are subsets A; and A j of the partition such that 
xand yarein A; and yand zare in Aj. 


Suppose A; # Aj. [We will deduce a contradiction.] Then A; N A; = Ø since {Aj, Ao, 
A3,...,A,} is a partition of A. But y is in A; and y is in A; also. Hence A; N A; # ©. 
[This contradicts the statement that A; N A; = ©.] Thus A; = Aj. It follows that x, y, 
and z are all in A;, and so, in particular, 

x and z are in A;. 


Thus x R z by definition of R. 


Definition of an Equivalence Relation 


A relation on a set that satisfies the three properties of reflexivity, symmetry, and transitiv- 
ity is called an equivalence relation. 


Definition 


Let A be a set and R a relation on A. R is an equivalence relation if, and only if, R is 
reflexive, symmetric, and transitive. 


Thus, according to Theorem 8.3.1, the relation induced by a partition is an equivalence 
relation. A variety of additional examples of equivalence relations are given below and in 
the exercises. 

An Equivalence Relation on a Set of Subsets 
Let X be the set of all nonempty subsets of {1, 2, 3}. Then 
X = {{1}, {2}, {3}, (1, 2}, (1, 3}, (2, 3}, (1, 2, 3}. 
Define a relation R on X as follows: For every A and B in X, 
ARB & the least element of A equals the least element of B. 
Prove that R is an equivalence relation on X. 


Solution 


R is reflexive: Suppose A is a nonempty subset of {1, 2, 3}. [We must show that AR A.] It 
is true to say that the least element of A equals the least element of A. Thus, by definition 
of R, ARA. 
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R is symmetric: Suppose A and B are nonempty subsets of {1, 2, 3} and A R B. [We must 
show that B R A.] Since AR B, the least element of A equals the least element of B. But this 
implies that the least element of B equals the least element of A, and so, by definition of 
R, BRA. 


R is transitive: Suppose A, B, and C are nonempty subsets of {1, 2, 3}, A R B, and BRC. 
[We must show that AR C.] Since AR B, the least element of A equals the least element of B 
and since BR C, the least element of B equals the least element of C. Thus the least element 
of A equals the least element of C, and so, by definition of R, A R C. | 


Seem Equivalence of Digital Logic Circuits Is an Equivalence Relation 


Let S be the set of all digital logic circuits with a fixed number n of inputs. Define a relation 
E on S as follows: For all circuits C, and C in S, 


CEC, < Chas the same input/output table as C). 


If Ci E Cy, then circuit Cj is said to be equivalent to circuit C}. Prove that E is an equiva- 
lence relation on S. 


Solution 


E is reflexive: Suppose C is a digital logic circuit in S. [We must show that CE C.] Certainly 
C has the same input/output table as itself. Thus, by definition of E, CE C [as was to be 
shown]. 


E is symmetric: Suppose C, and C, are digital logic circuits in § such that C, E Cy. [We 
must show that C E C,.] By definition of E, since C, E C,, then C, has the same input/out- 
put table as C}. It follows that C, has the same input/output table as C4. Hence, by defini- 
tion of E, C, E C, [as was to be shown]. 


Eis transitive: Suppose C}, C3, and C; are digital logic circuits in S such that C, E C, and 
C, E C3. [We must show that C, E C3.] By definition of E, since C, E C, and C, E C3, then 


Cı has the same input/output table as Cy 
and 

Cə has the same input/output table as C3. 
It follows that 

C; has the same input/output table as C3. 


Hence, by definition of E, C, E C3 fas was to be shown]. 


Since E is reflexive, symmetric, and transitive, E is an equivalence relation on S. E 


Certain implementations of computer languages do not place a limit on the allowable 
length of an identifier. This permits a programmer to be as precise as necessary in naming 
variables without having to worry about exceeding length limitations. However, compil- 
ers for such languages often ignore all but some specified number of initial characters: As 
far as the compiler is concerned, two identifiers are the same if they have the same initial 
characters, even though they may look different to a human reader of the program. For 
example, to a compiler that ignores all but the first eight characters of an identifier, the 
following identifiers would be the same: 


NumberOfScrews MNumberOfBolts. 
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Obviously, in using such a language, the programmer has to be sure to avoid giving two 
distinct identifiers the same first eight characters. When a compiler lumps identifiers to- 
gether in this way, it sets up an equivalence relation on the set of all possible identifiers in 
the language. Such a relation is described in the next example. 


A Relation on a Set of Identifiers 


Let L be the set of all allowable identifiers in a certain computer language, and define a 
relation R on L as follows: For all strings s and t in L, 


sRt <> thefirst eight characters of s equal the first eight characters of t. 


Prove that R is an equivalence relation on L. 


Solution 


R is reflexive: Let s E L. [We must show that s R s.] Clearly s has the same first eight char- 
acters as itself. Thus, by definition of R, s R s [as was to be shown]. 


R is symmetric: Let s and t be in L and suppose that s R t. [We must show that t R s.] By 
definition of R, since s R t, the first eight characters of s equal the first eight characters of t. 
It follows that the first eight characters of t equal the first eight characters of s, and so, by 
definition of R, t R s [as was to be shown]. 


R is transitive: Let s, t, and u be in L and suppose that s R t and t R u. [We must show that 
s R u.] By definition of R, since s R t and t R u, the first eight characters of s equal the first 
eight characters of t, and the first eight characters of t equal the first eight characters of u. 
Hence the first eight characters of s equal the first eight characters of u. Hence, by defini- 
tion of R, s R u [as was to be shown]. 

Since R is reflexive, symmetric, and transitive, R is an equivalence relation on L. a 


Equivalence Classes of an Equivalence Relation 


Suppose there is an equivalence relation on a certain set. If a is any particular element of 
the set, then one can ask, “What is the subset of all elements that are related to a?” This 
subset is called the equivalence class of a. 


pier aie 


distinguish among the 


following: (1) a relation Suppose A is a set and R is an equivalence relation on A. For each element a in A, the 
on a set, (2) the (underly- equivalence class of a, denoted [a] and called the class of a for short, is the set of all 
ing) set itself, and (3) the elements x in A such that x is related to a by R. 

equivalence class for an 

element of the (underly- In symbols: 

ing) set. [a] = {x EA|xRa} 


The procedural version of this definition is 


for every x EA, xEļa] < xRa. 


When several equivalence relations on a set are under discussion, the notation [a] may 
be used to denote the equivalence class of a for the relation R. 
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Equivalence Classes of a Relation Given as a Set of Ordered Pairs 
Let A = {0, 1, 2, 3, 4} and define a relation R on A as follows: 
R = {(0, 0), ©, 4), (1, D, (A, 3), 2, 2), G, D, G, 3), 4, 0), 4, 4)}- 


The directed graph for R is as shown below. As can be seen by inspection, R is an equiva- 
lence relation on A. Find the distinct equivalence classes of R. 


é 


Solution First find the equivalence class of every element of A. 


[0] = {x E A| xR 0} = {0, 4} 
[1] ={xEA|xR = {1,3} 
[2] = {x EA | xR 2} = {2} 

[3] = {x © Al xR 3} = {1, 3} 
[4] = {x E A | xR 4} = {0, 4} 


Note that [0] = [4] and [1] = [3]. Thus the distinct equivalence classes of the relation are 


{0, 4}, {1, 3}, and {2}. 5 


When a problem asks you to find the distinct equivalence classes of an equivalence rela- 
tion, you will generally solve the problem in two steps. In the first step you either explicitly 
construct (as in Example 8.3.5) or imagine constructing (as in infinite cases) the equiva- 
lence class for each element of the domain A of the relation. Usually several of the classes 
will contain exactly the same elements, so in the second step you must take a careful look 
at the classes to determine which are the same. You then indicate the distinct equivalence 
classes by describing them without duplication. 


SEWE Equivalence Classes of a Relation on a Set of Subsets 


In Example 8.3.2 it was shown that the relation R was an equivalence relation, where for 
nonempty subsets A and B of {1, 2, 3} to be related by R means that they have the same 
least element. Describe the distinct equivalence classes of R. 


Solution The equivalence class of {1} is the set of all the nonempty subsets of {1, 2, 3} 
whose least element is 1. Thus 


HIH = {{1}, (1, 2}, {1, 3}, {1, 2, 3}} 


The equivalence class of {2} is the set of all the nonempty subsets of {1, 2, 3} whose least 
element is 2. Thus 


H2} = {{2}, {2, 3). 
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The equivalence class of {3} is the set of all the nonempty subsets of {1, 2, 3} whose least 
element is 3. There is only one such set: {3} itself. Thus 


[{3}] = {{3}}. 
Since all the nonempty subsets of {1, 2, 3} are in one of the equivalence classes, this is a 
complete listing. Moreover, these classes are all distinct. E 


SCW ea Equivalence Classes of Identifiers 


In Example 8.3.4 it was shown that the relation R of having the same first eight characters 
is an equivalence relation on the set L of allowable identifiers in a computer language. 
Describe the distinct equivalence classes of R. 


Solution By definition of R, two strings in L are related by R if, and only if, they have 
the same first eight characters. Given any string s in L, 


[s] ={t EL | tRs} 
= {t E L| the first eight characters of t equal the first eight characters of s}. 


Thus the distinct equivalence classes of R are sets of strings such that (1) each class 
consists entirely of strings all of which have the same first eight characters, and (2) any two 
distinct classes contain strings that differ somewhere in their first eight characters. E 


Seu eee Equivalence Classes of the Identity Relation 


Let A be any set and define a relation R on A as follows: For every x and y in A, 


xRy & x=y. 


Then R is an equivalence relation. /To prove this, just generalize the argument used in 
Example 8.2.2.] Describe the distinct equivalence classes of R. 


Solution Given any a in A, the class of a is 
[la] = {xE A|xRa}. 
Now by definition of R, a R x if, and only if, a = x. So 


[a] = {x EA|x =a} 
= {a} since the only element of A that equals a is a. 


Hence, given any a in A, 


la] = {a}, 


and if x # a then {x} # {a}. Consequently, all the classes of all the elements of A are dis- 
tinct, and the distinct equivalence classes of R are all the single-element subsets of A. E 


In each of Examples 8.3.5, 8.3.6, 8.3.7, and 8.3.8, the set of distinct equivalence classes of 
the relation consists of mutually disjoint subsets whose union is the entire domain A of the 
relation. This means that the set of equivalence classes of the relation forms a partition of the 
domain A. In fact, it is always the case that the equivalence classes of an equivalence relation 
partition the domain of the relation into a union of mutually disjoint subsets. We establish the 
truth of this statement in stages, first proving two lemmas and then proving the main theorem. 

The first lemma says that if two elements of A are related by an equivalence relation R, 
then their equivalence classes are the same. 
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Lemma 8.3.2 


Suppose A is a set, R is an equivalence relation on A, and a and b are elements of A. 
If a R b, then [a] = [b]. 


This lemma says that if a certain condition is satisfied, then [a] = [b]. Now [a] and 
[b] are sets, and two sets are equal if, and only if, each is a subset of the other. Hence the 
proof of the lemma consists of two parts: first, a proof that [a] C [b] and second, a proof 
that [b] C [a]. To show each subset relation, it is necessary to show that every element in 
the left-hand set is an element of the right-hand set. 


Proof of Lemma 8.3.2: 
Let A be a set, let R be an equivalence relation on A, and suppose 


a and b are elements of A such that a R b. 


[We must show that [a] = [b].] 
Proof that [a] C [b]: Let x E [a]. [We must show that x E [b].] Since 


xE [a], 
then 
xRa 
by definition of class. But 
aRb 
by hypothesis. Thus, by transitivity of R, 
xRb. 
Hence 
x E [b] 


by definition of class. /This is what was to be shown.] 


Proof that [b] C [a]: Let x E [b]. [We must show that x € [a].] Since 


xE [b] 
then 
xRb 
by definition of class. Now 
aRb 
by hypothesis. Thus, since R is symmetric, 
bRa 
also. Then, since R is transitive and x R b and b R a, 
xRa. 
Hence, 
xE |a] 


by definition of class. /This is what was to be shown.] 


Since [a] C [b] and [b] C [a], it follows that [a] = [b] by definition of set equality. 
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The second lemma says that any two equivalence classes of an equivalence relation are 
either mutually disjoint or identical. 


Lemma 8.3.3 


If A is a set, R is an equivalence relation on A, and a and b are elements of A, then 


either [a] n [b] =Ø or [a]= [b]. 


The statement of Lemma 8.3.3 has the form 


if p then (q or r), 
Note You can always where p is the statement “A is a set, R is an equivalence relation on A, and a and b are 
prove a statement of the elements of A,” q is the statement “[a] N [b] = ©,” and r is the statement “[a] = [b].” To 
form “if p then (q or r)” prove the lemma, we will prove the logically equivalent statement 
by proving one of the 
logically equivalent state- if (p and not q) then r. 


ments: “if (p and not q) 

then r” or “if (p and not r) 

then q.”* If A is a set, R is an equivalence relation on A, a and b are elements of 
A, and [a] N [b] + Ø, then [a] = [b]. 


That is, we will prove the following: 


Proof of Lemma 8.3.3: 


Suppose A is a set, R is an equivalence relation on A, a and b are elements of A, and 
[a] N [b] # Ø. 


[We must show that [a] = [b].] Since [a] N [b] # Ø, there exists an element x in A 
such that x E [a] N [b]. By definition of intersection, 


x€[a] and x€[b], 
and so xRa and xRb 


by definition of class. Since R is symmetric [being an equivalence relation] and x R a, 
then a R x. But R is also transitive [since it is an equivalence relation], and so, since 
aRxandxRb, 


aRb. 
Now a and b satisfy the hypothesis of Lemma 8.3.2. Hence, by that lemma, 
[a] = [b] 


[as was to be shown]. 


Theorem 8.3.4 The Partition Induced by an Equivalence Relation 


If A is a set and R is an equivalence relation on A, then the distinct equivalence 
classes of R form a partition of A; that is, the union of the equivalence classes is all 
of A, and the intersection of any two distinct classes is empty. 


*See exercise 14 in Section 2.2. 
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The proof of Theorem 8.3.4 is divided into two parts: first, a proof that A is the union 
of the equivalence classes of R and second, a proof that the intersection of any two distinct 
equivalence classes is empty. The proof of the first part follows from the fact that the rela- 
tion is reflexive. The proof of the second part follows from Lemma 8.3.3. 


Proof of Theorem 8.3.4: 


Suppose A is a set and R is an equivalence relation on A. For notational simplicity, 
we assume that R has only a finite number of distinct equivalence classes, which we 
denote 


Ai Az, -p Åp 


where n is a positive integer. (When the number of classes is infinite, the proof is 
identical except for notation.) 


Proof that A = A; UA, U: +U Ap: [We must show that ACA UA U> UA, 
and that A, U A, U ++: UA, CAJ 


To show that A C A, U A U ++: UA,, suppose x is any element of A. [We must 
show that x E Ay U A; U-:: UA,,./ By reflexivity of R, x R x. And this implies that 
x E [x] by definition of class. Since x is in some equivalence class, it must be in one 
of the distinct equivalence classes A), A2, ..., or A,. Thus x E A; for some index i, 
and hence x E A; U A> U ++- U A, by definition of union fas was to be shown]. 

To show that A, UA, U +++ U A, CA, suppose x E A; UA, U ++ UA,. [We 
must show that x E A.] Then x € A; for some i = 1, 2,..., n, by definition of union. 
Now each A; is an equivalence class of R, and equivalence classes are subsets of A. 
Hence A; C A and so x E A [as was to be shown]. 

Since A C A; UA) U +++ UA, and A; UA, U ++: UA, CA, then by definition 
of set equality, A = A, U Ay U ++: U Ap 


Proof that the distinct classes of R are mutually disjoint: Suppose that A; and A; 
are any two distinct equivalence classes of R. [We must show that A; and A, are dis- 
joint.] Since A; and A; are distinct, then A; # A;. And since A; and A; are equivalence 
classes of R, there must exist elements a and b in A such that A; = [a] and A; = [b]. 
By Lemma 8.3.3, 


either [a] A [bh] =Ø or [a] = [b]. 


Now [a] # [b] because A; # A;, and hence [a] N [b] = Ø. Thus A; N A; = Ø, and 
so A; and A; are disjoint /as was to be shown]. 


Equivalence Classes of Digital Logic Circuits 


In Example 8.3.3 it was shown that the relation of equivalence among circuits is an equiva- 
lence relation. Let S be the set of all digital logic circuits with exactly two inputs and one 
output. The binary relation E is defined on S as follows: For every C, and C; in S, 


CEC <= Chas the same input/output table as C). 


Describe the equivalence classes of this relation. How many distinct equivalence classes 
are there? Find two different circuits that are in one of the classes. 
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Solution Given a circuit C, the equivalence class of C is the set of all circuits with two 
input signals and one output signal that have the same input/output table as C. Now each 
input/output table has exactly four rows, corresponding to the four possible combinations 
of inputs: 11, 10, 01, and 00. A typical input/output table is the following: 


Input Output 


P Q R 
1 1 0 
1 0 0 
0 1 0 
0 0 1 


There are exactly as many such tables as there are binary strings of length 4. The reason 
is that distinct input/output tables can be formed by changing the pattern of the four 0’s 
and 1’s in the output column, and there are as many ways to do that as there are strings of 
four 0’s and 1’s. And since the number of binary strings of length 4 is 2* = 16, there are 16 
distinct input/output tables. 

This implies that there are exactly 16 equivalence classes of circuits, one for each dis- 
tinct input/output table. However, there are infinitely many circuits that give rise to each 
table. For instance, two circuits for the previous input/output table are shown below. 


oe a 


Congruence Modulo n 


Example 8.2.4 showed that the relation of congruence modulo 3 is reflexive, symmetric, 
and transitive. Therefore, it is an equivalence relation. 


ZEWJGESALUI Equivalence Classes of Congruence Modulo 3 


Let R be the relation of congruence modulo 3 on the set Z of all integers. That is, for all 
integers m and n, 


mRn = 3|(m—n). 


Describe the distinct equivalence classes of R. 
Solution For each integer a, 
[a] = {x E Z| x Ra} 


= {x E€Z|3|@-a} 
= {x E Z |x— a = 3k, for some integer k}. 


Therefore, 


[a] = {x E Z| x = 3k + a, for some integer k}. 
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In particular, 

[0] = {x E Z| x = 3k +0, for some integer k} 
= {x E Z| x = 3k, for some integer k} 
= {....—9, —6, —3, 0,3,.6, 9, ....}, 

[1] = {x E Z |x = 3k + 1, for some integer k} 
= {...—8,.=5,.=2, 1,4,.7,.10,...}, 

[2] = {x E Z |x = 3k +2, for some integer k} 
= {...= 7, —4, =1,2,5,8, Ty ..} 


Now since 3 R 0, then by Lemma 8.3.2, 


More generally, by the same reasoning, 


[0] = [3] = [-3] = [6] = [-6] =..., and so on. 


Similarly, 


[1] = [4] = [-2] = [7] = [-5] =..., and so on. 
And 


[2] = [5] = [-1] = [8] = [-4] =..., and so on. 


Notice that every integer is in class [0], [1], or [2]. Hence the distinct equivalence classes 
are 


{x E Z| x = 3k, for some integer k}, 
{xEZ | x = 3k + 1, for some integer k}, and 
{x E Z| x = 3k +2, for some integer k}. 


In words, the three classes of congruence modulo 3 are (1) the set of all integers that are 
divisible by 3, (2) the set of all integers that leave a remainder of 1 when divided by 3, and 
(3) the set of all integers that leave a remainder of 2 when divided by 3. B 


Example 8.3.10 illustrates a very important property of equivalence classes, namely that 
an equivalence class may have many different names. In Example 8.3.10, for instance, the 
class of 0, [0], may also be called the class of 3, [3], or the class of —6, [—6]. But what the 
class is is the set of all integers that are divisible by 3: 


{x E Z| x = 3k, for some integers k}. 


(The quote at the beginning of this section refers in a humorous way to the philosophically 
interesting distinction between what things are called and what they are.) 


Definition 


Suppose R is an equivalence relation on a set A and S is an equivalence class of R. A 
representative of the class S is any element a such that [a] = S. 


In exercises 36—41 at the end of this section, you are asked to show, in effect, that if a is 
any element of an equivalence class S, then S = [a]. Hence any element of an equivalence 
class is a representative of that class. 
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The following notation is used frequently when referring to congruence relations. It was in- 
troduced by Carl Friedrich Gauss in the first chapter of his book Disquisitiones Arithmeticae. 
This work, which was published when Gauss was only 24, laid the foundation for modern 
number theory. 


Definition 


Let m and n be integers and let d be a positive integer. We say that m is congruent 
to n modulo d and write 


Bettmann/Getty Images 


m = n (mod d) 
if, and only if, d|(m—n). 
Carl Friedrich Gauss Symbolically: m=n(modd) = d| (m—n). 


(1777-1855) 


Exercise 17(b) at the end of this section asks you to show that m = n (mod d) if, and only 
if, m mod d = n mod d, where m, n, and d are integers and d is positive. 


SEAL Evaluating Congruences 


Determine which of the following congruences are true and which are false. 
a. 12 = 7 (mod 5) b. 6 = —8 (mod 4) c. 3 = 3 (mod 7) 
Solution 

a. True. 12—7=5 = 5-1. Hence 5 | (12 —7), and so 12 = 7 (mod 5). 


b. False. 6 — (—8) = 14, and 4/14 because 14 # 4-k for any integer k. Consequently, 
6 Æ — 8 (mod 4). 


c. True. 3—3 = 0 = 7-0. Hence 7 | (3 — 3), and so 3 = 3 (mod 7). E 


The discussion of binary arithmetic in Section 2.5 used a simplified computer model, 
which used 8-bit two’s complements and binary arithmetic to represent and compute with the 
256 integers from — 128 to 127 — 1. To find 78 + (—46), the following scheme was shown: 


O;1;0;/0; 1} 1]; 1) 0] <7 


+L1[1Jo[1Jofofi [ojo 


1 Lofo][i{o]fofofofo| 


Now the number corresponding to 100100000, is 288, which is too large to be the 8-bit 
binary representation of a number. But the directions said to discard the | in the oth posi- 
tion. When this is done, the result is 00100000,, which corresponds to 32 and is the correct 
answer for the problem. 

To see why this method works, observe that the decimal forms of the two’s comple- 
ments for 78 and —46 are 78 and 2° — 46, respectively. In addition, 


78 + (28 — 46) = 2° + (78 — 46) = (78 — 46) (mod 2°). 


Thus reducing the number 78 + (28 — 46) modulo 28 gives the correct result. And doing so is 
equivalent to dropping the 1 in the 2th position of the computation’s result because the 2th 
position holds the place for the number 28 when integers are represented in binary notation. 
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A Definition for Rational Numbers 


For a moment, forget what you know about fractional arithmetic and look at the numbers 
1 2 
> and > 
3 6 
as symbols. Considered as symbolic expressions, these appear quite different. In fact, if 
they were written as ordered pairs 
(1,3) and (2,6) 


they would be different. The fact that we regard them as “the same” is a specific instance 
of our general agreement to regard any two numbers 


a c 
— and — 
b d 


as equal provided the cross products are equal; in other words, if, and only if, ad = be. 
This can be formalized as follows, using the language of equivalence relations. 


Se ee Rational Numbers Are Really Equivalence Classes 


Let A be the set of all ordered pairs of integers for which the second element of the pair is 
nonzero. Symbolically: 


A=ZX (Z—{0}). 


Define a relation R on A as follows: For all pairs (a, b) and (c, d) in A, 


(a,b)R(c,d) & ad=be. 


The fact is that R is an equivalence relation. 


a. Prove that R is transitive. (Proofs that R is reflexive and symmetric are left to exercise 
42 at the end of the section.) 


b. Describe the distinct equivalence classes of R. 
Solution 


a. [We must show that for all ordered pairs (a, b), (c, d), and (e, f) in A, if (a, b) R (c, d) and 
(c, d) R (e, f), then (a, b) R (e, f).] Suppose (a, b), (c, d), and (e, f) are particular but arbi- 
trarily chosen elements of A such that (a, b) R (c, d) and (c, d) R (e, f). [We must show 
that (a, b) R (e, f).] By definition of R, 


(l)ad=be and (2) cf= de. 


Since the second elements of all ordered pairs in A are nonzero, b # 0, d # 0, and 
f # 0. Multiply both sides of equation (1) by f and both sides of equation (2) by b to 
obtain 


(1') adf = bef and (2’) bcf = bde. 
Because both equal bcf, 
adf = bde, 


and, since d # 0, it follows from the cancellation law for multiplication (T7 in 
Appendix A) that 


af = be. 
Hence, by definition of R, (a, b) R (e, f) [as was to be shown]. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


520 CHAPTER8 PROPERTIES OF RELATIONS 


b. There is one equivalence class for each distinct rational number. Each equivalence class 
consists of all ordered pairs (a, b) that, if written as fractions a/b, would equal each 
other. The reason is that the condition for two rational numbers to be equal is the same 
as the condition for two ordered pairs to be related. For instance, the class of (1, 2) is 


LC, 2)] = {C, 2), (—1, —2), 2, 4), (-2, —4), (3, 6), (—3, —6),.-. } 


= = -3 
since 4 = Z = : =% and so forth. | 


It is possible to expand the result of Example 8.3.12 to define operations of addition and 
multiplication on the equivalence classes of R that satisfy all the same properties as the ad- 
dition and multiplication of rational numbers. (See exercise 43.) It follows that the rational 
numbers can be defined as equivalence classes of ordered pairs of integers. Similarly (see 
exercise 44), it can be shown that all integers, negative and zero included, can be defined 
as equivalence classes of ordered pairs of positive integers. In the late nineteenth century, 
F. L. G. Frege and Giuseppe Peano showed that the positive integers can be defined entirely 
in terms of sets. And just a little earlier, Richard Dedekind (1848-1916) showed that all 
real numbers can be defined as sets of rational numbers. Taken together, these results show 
that the set of real numbers can be defined using logic and set theory alone. 


TEST YOURSELF 


1. For a relation on a set to be an equivalence rela- 5. IfA is a set and R is an equivalence relation on 


tion, it must be 


” 


2. The notation m = n (mod d) is read “ and 


means that 


3. Given an equivalence relation R on a set A and 
given an element a in A, the equivalence class of a 
is denoted and is defined to be 


4. If A is a set, R is an equivalence relation on A, and 
a and b are elements of A, then either [a] = [b] 
or 


EXERCISE SET 8.3 


A, then the distinct equivalence classes of R form 


6. Let A = Z X (Z — {0}), and define a relation R 
on A by specifying that for every (a, b) and (c, d) 
in A, (a, b) R (c, d) if, and only if, ad = bc. Then 
there is exactly one equivalence class of R for each 


1. Suppose that $ = {a, b, c, d, e} and R is a relation 
on § such that a R b, b R c, and d R e. List all of the 
following that must be true if R is (a) reflexive (but 
not symmetric or transitive), (b) symmetric (but not 
reflexive or transitive), (c) transitive (but not reflex- 
ive or symmetric), and (d) an equivalence relation. 
cRb cRc aRc bRa 
aRd eRa eRd cRa 


2. Each of the following partitions of {0, 1, 2, 3, 4} 
induces a relation R on {0, 1, 2, 3, 4}. In each 
case, find the ordered pairs in R. 

a. {0, 2}, {1}, {3, 4} b. {0}, {1, 3, 4}, {2} 
c. {O}, {1, 2, 3, 4} 


In each of 3-6, the relation R is an equivalence rela- 
tion on A. As in Example 8.3.5, first find the specified 
equivalence classes. Then state the number of distinct 
equivalence classes for R and list them. 
3. A= {0, 1, 2, 3, 4} 
R= {(0, 0), (0, 4, (1, 1), G, 3), 2, 2), G, 1), G, 3), 
(4, 0), (4, 4)} 
equivalence classes: [0], [1], [2], [3] 
4. A= {a,b,c,d} 
R = {(a, a), (b, b), (b, d), (c, c), (d, b), (d, d)} 


equivalence classes: [a], [b], [c], [d] 
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5. A= {1,2,3,4,...,20}. Ris defined on A as 
follows: 


Forallx,y EA, xRy — 4|(-y). 
equivalence classes: [1], [2], [3], [4], [5] 


6. A = {—4, —3, —2, —1, 0, 1, 2, 3, 4, 5}. R is de- 
fined on A as follows: 


Forallx,y EA, xRy © 3|(x-y). 


equivalence classes: [0], [1], [2], [3] 
In each of 7-14, the relation R is an equivalence relation 
on the set A. Find the distinct equivalence classes of R. 


7. A = {(, 3), 2, 4,(—4, —8), G, 9), (, 5), G, 6). 
R is defined on A as follows: For every (a, b), 
(c, d) E A, 
(a,b) R (c,d) © ad= bc. 
8. X = {a,b,c} and A = P(X). R is defined on A as 
follows: For all sets v and v in P(X), 
uRv © Nw) = N(v). 


(That is, the number of elements in v equals the 
number of elements in v.) 


9. X = {-1,0, 1} and A = P(X). R is defined on 
P(X) as follows: For all sets s and rin P(X), 

the sum of the elements in s 

equals the sum of the elements in 7. 


10. A = {-5, —4, —3, —2, -1, 0, 1, 2, 3, 4, 5}. 
R is defined on A as follows: For all m, n E Z, 


sRr @& 


mRn & 3 | (m-n). 


11. A = {—4, —3, —2, —1, 0, 1, 2, 3, 4}. R is defined 
on A as follows: For every (m, n) € A, 


mRn & 4| m-n’). 


12. A = {—4, —3, —2, —1, 0, 1, 2, 3, 4}. R is defined 
on A as follows: For all (m, n) E A, 


mRn & 5 | (m? —n’). 


13. A is the set of all strings of length 4 in a’s and b’s. 


R is defined on A as follows: For all strings s and 
tind, 
sRt © 
14. A is the set of all strings of 0’s, I’s, and 2’s that 
have length 4 and for which the sum of the char- 


acters in the string is less than or equal to 2. R is 
defined on A as follows: For every s, t E A, 


s has the same first two characters as t. 
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s R t < the sum of the characters of s equals the 
sum of the characters of t. 


15. Determine which of the following congruence 
relations are true and which are false. 
a. 17 = 2 (mod 5) 


b. 4 = —5 (mod 7) 
c. —2 = —8 (mod 3) 
d. —6 = 22 (mod 2) 


16. a. Let R be the relation of congruence modulo 3. 
Which of the following equivalence classes are 


equal? 
[7], [—4], [—6], [17], [4], [27], [19] 


b. Let R be the relation of congruence modulo 7. 
Which of the following equivalence classes are 
equal? 


[35], [3], L-71, [12], [0], [—2], [17] 


17. a. Prove that for all integers m and n, m = n 

(mod 3) if, and only if, m mod 3 = n mod 3. 

b. Prove that for all integers m and n and any 
positive integer d, m = n (mod d) if, and only 


if, m mod d = n mod d. 


18. a. Give an example of two sets that are distinct 
but not disjoint. 
b. Find sets A; and A, and elements x, y, and z such 
that x and y are in A, and y and z are in A, but x 
and z are not both in either of the sets A; or A>. 
In 19-31, (1) prove that the relation is an equivalence rela- 
tion, and (2) describe the distinct equivalence classes of 
each relation. 


19. A is the set of all students at your college. 
a. Ris the relation defined on A as follows: For 
every x and yin A, 


xRy < _ xhas the same major (or double 


major) as y. 


(Assume “undeclared” is a major.) 
b. Sis the relation defined on A as follows: For 
every x, y E A, 


xSy < xis the same age as y. 
20. E is the relation defined on Z as follows: 


For every m, nE Z, mEn & 4|(m—n). 


H 21. R is the relation defined on Z as follows: 


For every m,n EZ, mRn <—& 7Tm-—Snis even. 
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22. 


23. 


24. 


25. 


H 26. 


27. 


28. 


For every x, y E R, 


29. 


30. 


31. 


CHAPTERS PROPERTIES OF RELATIONS 


Let A be the set of all statement forms in three 
variables p, q, and r. R is the relation defined on A 
as follows: For all P and Q in A, 


PRO © PandQ have the same truth table. 


Let P be a set of parts shipped to a company from 
various suppliers. S is the relation defined on P as 
follows: For every x, y € P, 


xSy < _ xhas the same part number and is 
shipped from the same supplier as y. 


Let A be the set of identifiers in a computer pro- 
gram. It is common for identifiers to be used for 
only a short part of the execution time of a program 
and not to be used again to execute other parts of 
the program. In such cases, arranging for identi- 
fiers to share memory locations makes efficient use 
of a computer’s memory capacity. Define a relation 
R on A as follows: For all identifiers x and y, 


xRy <æ the values of x and y are stored in the 


same memory location during 
execution of the program. 


A is the “absolute value” relation defined on R as 
follows: 


Foreveryx,yER,xAy © |x| = ly]. 


D is the relation defined on Z as follows: For every 
m,n € Z, 
mDn © 3 | (m? — n°). 


R is the relation defined on Z as follows: For every 
(m,n) E Z, 


mRn & 4| m-n’). 


Tis the relation defined on R as follows: 
mIn «æ x-—y isan integer. 


Define P on the set R X R of ordered pairs of 


real numbers as follows: For every (w, x), y, z) © 
R XR, 


w, x) PQ, © wey. 


Define Q on the set R X R as follows: For every 
w, x), (, 2) ER X R, 


(w, x) Q 0,2) © x=z. 


Let P be the set of all points in the Cartesian plane 
except the origin. R is the relation defined on P as 
follows: For every pı and p3 in P, 
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32. 


33. 


A={@y)ERXR|0<x<1 


=p, and pz lie on the same half-line 
emanating from the origin. 


Ppi R pr 


Let A be the set of all straight lines in the Carte- 
sian plane. Define a relation || on A as follows: 


For every /, and /, in A, 


LL < 1, is parallel to L. 


Then || is an equivalence relation on A. Describe 
the equivalence classes of this relation. 


Let A be the set of points in the rectangle with x 
and y coordinates between 0 and 1. That is, 


and OSy=lI}. 


Define a relation R on A as follows: For all (x;, yı) 
and (x5, y2) in A, 


(X1, YR, y2) > 


34. 


(1, y1) = (2, y2); Or 


x}=0O and x,=1 and y =y; or 
x,)=1 and x»=O0 and y =y; or 
y,=0O and y=1 and x, =x; or 
yp=1 and y=O0O and x, =X. 


In other words, all points along the top edge of 
the rectangle are related to the points along the 
bottom edge directly beneath them, and all points 
directly opposite each other along the left and 
right edges are related to each other. The points 
in the interior of the rectangle are not related to 
anything other than themselves. Then R is an 
equivalence relation on A. Imagine gluing togeth- 
er all the points that are in the same equivalence 
class. Describe the resulting figure. 


The documentation for the computer language 
Java recommends that when an “equals method” is 
defined for an object, it be an equivalence relation. 
That is, if R is defined as follows: 


xRy <= _ x.equals(y) forall objects in the class, 


then R should be an equivalence relation. Suppose 
that in trying to optimize some of the mathematics 
of a graphics application, a programmer creates an 
object called a point, consisting of two coordinates 
in the plane. The programmer defines an equals 
method as follows: If p and q are any points, then 


p.equals(q) <= the distance from p to q is 


less than or equal to c 


where c is a small positive number that depends 
on the resolution of the computer display. Is the 


programmer’s equals method an equivalence rela- 


tion? Justify your answer. 


35. Find an additional representative circuit for the 
input/output table of Example 8.3.9. 


Let R be an equivalence relation on a set A. Prove 
each of the statements in 36-41 directly from the 
definitions of equivalence relation and equivalence 
class without using the results of Lemma 8.3.2, Lem- 


ma 8.3.3, or Theorem 8.3.4. 


36. 
37. 
38. 


39. 
40. 


H 41. 
42. 


* 43. 


ANSWERS FOR TEST YOURSELF 


1. reflexive, symmetric, and transitive 2. m is congruent to n modulo d; d divides m — n 


For every ain A, a E [a]. 
For every a and b in A, if b E [a] then a R b. 


For every a, b, and c in A, if b R c and c E [a] 
then b E [a]. 


For every a and b in A, if [a] = [b] then a R b. 


For every a, b, and x in A, if a R b and x € [a] 
then x E [b]. 


For every a and b in A, if a E [b] then [a] = [b]. 


Let R be the relation defined in Example 8.3.12. 
a. Prove that R is reflexive. 

b. Prove that R is symmetric. 

c. List four distinct elements in [(1, 3)]. 

d. List four distinct elements in [(2, 5)]. 


In Example 8.3.12, define operations of addition 
(+) and multiplication (-) as follows: For every 
(a, b), (c, d) E A, 


[(a, b)] + [(c, d)] = [(ad + bc, bd)] 
[(a, b)]-[(c, d] = [(ac, bd)]. 

a. Prove that this addition is well defined. 
That is, show that if [(a, b)] = [(a', b’)] and 
[(c, d)] = [(c’, d')], then [(ad + bc, bd)] = 
[(a'd' + b'c', bayi: 

b. Prove that this multiplication is well defined. 
That is, show that if [(a, b)] = [(a’, b’)] 
and [(c, D] = [(c’, d’)], then [(ac, bd)] = 
[(a'c', b'd’)]. 

c. Show that [(0, 1)] is an identity element for ad- 
dition. That is, show that for any (a, b) € A, 


[(a, b)] + [(0, 1)] = [0, 1)] + [(a, b)] = [(a, b)]. 


d. Find an identity element for multiplication. 
That is, find (i, j) in A so that for every (a, b) in 
A, [(a, LG DI = [E]: Ka, b)] = [(@, b)]. 
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e. For any (a, b) E A, show that [(—a, b)] is 
an inverse for [(a, b)] for addition. That is, 
show that [(—a, b)] + [(a, b)] = [(a, b)] + 
[(—a, b)] = [(0, 1)]. 

f. Given any (a, b) E A with a + 0, find an 
inverse for [(a, b)] for multiplication. That 
is, find (c, d) in A so that [(a, b)]-[(c, d] = 
[(c, d)]-[(a, b)] = [@ j)], where [(, j)] 1s the 
identity element you found in part (d). 


44. Let A = Z* X Z`. Define a relation R on A as 


45. 


46. 


47. 


follows: For every (a, b) and (c, d) in A, 
(a,b)R(c,d) & atd=ctb. 


. Prove that R is reflexive. 

. Prove that R is symmetric. 

. Prove that R is transitive. 

. List five elements in [(1, 1]. 

. List five elements in [(3, 1]. 

. List five elements in [(1, 2)]. 

. Describe the distinct equivalence classes of R. 


am oannt 


The following argument claims to prove that the 
requirement that an equivalence relation be re- 
flexive is redundant. In other words, it claims to 
show that if a relation is symmetric and transi- 
tive, then it is reflexive. Find the mistake in the 
argument. 

“Proof: Let R be a relation on a set A and suppose 
R is symmetric and transitive. For any two elements 
x and yin A, if x R y then y R x since R is symmet- 
ric. Thus it follows by transitivity that x R x, and 
hence R is reflexive.” 


Let R be a relation on a set A and suppose R is 
symmetric and transitive. Prove the following: If 
for every x in A there is a y in A such that x R y, 
then R is an equivalence relation. 


Refer to the quote at the beginning of this section 

to answer the following questions. 

. What is the name of the Knight’s song called? 

. What is the name of the Knight’s song? 

. What is the Knight’s song called? 

. What is the Knight’s song? 

. What is your (full, legal) name? 

. What are you called? 

. What are you? (Do not answer this on paper; 
just think about it.) 


Tan DAanaw 


3. [a]; the set of all elements x in A 


such thatxRa 4.f[a]N[b]=@ 5S.apartition of A 6. rational number 
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E23 Modular Arithmetic with Applications 
to Cryptography 


The “real” mathematics of the “real” mathematicians, the mathematics of Fermat 
and Euler and Gauss and Abel and Riemann, is almost wholly “useless.” . . . It is not 
possible to justify the life of any genuine professional mathematician on the ground of 
the “utility” of his work. —G. H. Hardy, A Mathematician’s Apology, 1941 


Cryptography is the study of methods for sending secret messages. It involves encryption, 
in which a message, called plaintext, is converted into a form, called ciphertext, that is 
sent over a channel possibly open to view by outside parties. The receiver of the ciphertext 
uses decryption to convert the ciphertext back into plaintext. 

With the rise of electronic communication systems, especially the Internet, the most 
important current use of cryptography is to enable transmission of confidential informa- 
tion, such as banking data, medical records, credit card numbers, and governmental com- 
munications, over electronic channels. Developing products for encryption and decryption 
is one of the main activities of the National Security Agency, which is the largest employer 
of mathematicians in the United States. 

Many systems for sending secret messages require both the sender and the receiver to 
know both the encryption and the decryption procedures. For instance, an encryption sys- 
tem once used by Julius Caesar, and now called the Caesar cipher, encrypts messages by 
changing each letter of the alphabet to the one three places farther along, with X wrapping 
around to A, Y to B, and Z to C. In other words, say each letter of the alphabet is coded by 
its position relative to the others—so that A = 01, B = 02,..., Z = 26. If the numerical 
version of the plaintext for a letter is denoted M and the numeric version of the ciphertext 
is denoted C, then 


| C = (M+ 3) mod 26. | 


The receiver of such a message can easily decrypt it by using the formula 


M = (C—3) mod 26. 


For reference, here are the letters of the alphabet, together with their numeric equivalents: 


Encrypting and Decrypting with the Caesar Cipher 
a. Use the Caesar cipher to encrypt the message HOW ARE YOU. 
b. Use the Caesar cipher to decrypt the message L DP ILQH. 
Solution 
a. First, translate the letters of HOW ARE YOU into their numeric equivalents: 
08 15 23 OL 18 05 25 15 21. 
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Next, encrypt the message by adding 3 to each number. The result is 
11 18 26 04 21 08 02 18 24. 


Finally, substitute the letters that correspond to these numbers. The encrypted mes- 
sage becomes 


KRZ DUH BRX. 


b. First, translate the letters of L DP ILQH into their numeric equivalents: 
12 04 16 09 12 17 O08. 
Next, decrypt the message by subtracting 3 from each number: 
09 Ol 13 06 09 14 05. 
Then, translate back into letters to obtain the original message: I AM FINE. a 


One problem with the Caesar cipher is that given a sufficient amount of ciphertext a 
person with knowledge of letter frequencies in the language can easily figure out the ci- 
pher. Partly for this reason, even Caesar himself did not make extensive use of it. Another 
problem with a system like the Caesar cipher is that knowledge of how to encrypt a mes- 
sage automatically gives knowledge of how to decrypt it. When a potential recipient of 
messages passes the encryption information to a potential sender of messages, the channel 
over which the information is passed may itself be insecure. Thus the information may leak 
out, enabling an outside party to decrypt messages intended to be kept secret. 

With public-key cryptography, a potential recipient of encrypted messages openly dis- 
tributes a public key containing the encryption information. However, knowledge of the 
public key provides virtually no clue about how messages are decrypted. Only the recipi- 
ent has that knowledge. Regardless of how many people learn the encryption information, 
only the recipient should be able to decrypt messages that are sent. 

The first public-key cryptography system was developed in 1976-1977 by three 
mathematician/computer scientists working at M.LT.: Ronald Rivest, Adi Shamir, and 


From left to right: 
Ronald Rivest (born 
1948), Adi Shamir (born 
1952), and Leonard 
Adleman (born 1945) 


Kim Kulish/Getty Images 
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Leonard Adleman. In their honor it is called the RSA cipher. In order for you to learn how 
it works, you need to know some additional properties of congruence modulo n. 


Properties of Congruence Modulo n 


The first theorem in this section brings together a variety of equivalent ways of expressing 
the same basic arithmetic fact. Sometimes one way is most convenient; sometimes another 
way is best. You need to be comfortable moving from one to another, depending on the 
nature of the problem you are trying to solve. 


Theorem 8.4.1 Modular Equivalences 
Let a, b, and n be any integers and suppose n > 1. The following statements are all 
equivalent: 

nie b) 

2. a= b(modn) 

3. a= b + kn for some integer k 

4. aand b have the same (nonnegative) remainder when divided by n 

5 


. amodn = b modn 


Proof: We will show that (1) = (2) = (3) = (4) => (5) = (1). It will follow by the 
transitivity of if-then that all five statements are equivalent. 
So let a, b, and n be any integers with n > 1. 


Proof that (1) = (2): Suppose that n| (a — b). By definition of congruence modulo n, 
we can immediately conclude that a = b (mod n). 


Proof that (2) = (3): Suppose that a = b (mod n). By definition of congruence mod- 
ulo n, n | (a — b). Thus, by definition of divisibility, a — b = kn, for some integer k. 
Adding b to both sides gives that a = b + kn. 


Proof that (3) = (4): Suppose that a = b + kn, for some integer k. Use the quotient- 
remainder theorem to divide a by n to obtain 


a=qn+r where q andr are integers and 0 Sr < n. 
So r is the remainder obtained when a is divided by n. Substituting b + kn for a in the 
equation a = qn + r gives that 


b+kn = qn+r, 


and subtracting kn from both sides and factoring out n yields 
b=(q-k)n+tr. 


Now since 0 =r < n, the uniqueness property of the quotient-remainder theorem 
guarantees that r is also the remainder obtained when b is divided by n. Thus a and 
b have the same remainder when divided by n. 


Proof that (4) = (5): Suppose that a and b have the same remainder when di- 
vided by n. It follows immediately from the definition of the mod function that 
amodn = bmodn. 
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Proof that (5) = (1): Suppose that a mod n = b mod n. By definition of the mod 
function, a and b have the same remainder when divided by n. Thus, by the quotient- 
remainder theorem, we can write 


a=qnt+r and b=qnt+r_ where q, q2, andr are integers and 0 S r < n. 


It follows that 


a-b = (qun+r)— (gon +r) = (q-q). 


Therefore, since qı — q3 is an integer, n|(a— b). 


Another consequence of the quotient-remainder theorem is this: When an integer a is 
divided by a positive integer n, a unique quotient g and remainder r are obtained with the 
property that a = nq +r and 0 S r < n. Because there are exactly n integers that satisfy 
the inequality 0 = r < n (the numbers from 0 through n — 1), there are exactly n possible 
remainders that can occur. These are called the least nonnegative residues modulo n or 
simply the residues modulo n. 


Given integers a and n with n > 1, the residue of a modulo n is a mod n, the non- 
negative remainder obtained when a is divided by n. The numbers 0, 1, 2,...,n—1 
are called a complete set of residues modulo n. To reduce a number modulo n 
means to set it equal to its residue modulo n. If a modulus n > 1 is fixed throughout 
a discussion and an integer a is given, the words “modulo n” are often dropped and 
we simply speak of the residue of a. 


The following theorem generalizes several examples from Section 8.3. 


Theorem 8.4.2 Congruence Modulo n Is an Equivalence Relation 


If n is any integer with n > 1, congruence modulo n is an equivalence relation on the 
set of all integers. The distinct equivalence classes of the relation are the sets [0], [1], 
ie all whereiioneachiai—T0 laa ule 


[a] = {m EZ|m = a (mod n)}, 
or, equivalently, 


[a] = {m E Z| m = a + kn for some integer k}. 


Proof: Suppose n is any integer with n > 1. We must show that congruence modulo n 
is reflexive, symmetric, and transitive. 


Proof of reflexivity: Suppose a is any integer. To show that a = a (mod n), we must show 
that n | (a — a). Now a — a = 0, and n | 0 because 0 = n: 0. Therefore a = a (mod n). 


Proof of symmetry: Suppose a and b are any integers such that a = b (mod n). We 
must show that b = a (mod n). Now since a = b (mod n), then n | (a— b). Thus, by 


(continued on page 528) 
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definition of divisibility, a — b = nk, for some integer k. Multiply both sides of this 
equation by — 1 to obtain 


—(a— b) = —nk, 
or, equivalently, 
b—a = n(—k). 


Thus, by definition of divisibility n | (b — a), and so, by definition of congruence 
modulo n, b = a (mod n). 


Proof of transitivity: This is left as exercise 5 at the end of the section. 


Proof that the distinct equivalence classes are [0], [1], [2], . . . , [n — 1]: This is left 
as exercise 6 at the end of the section. 


Observe that there is a one-to-one correspondence between the distinct equivalence classes 
for congruence modulo n and the elements of a complete set of residues modulo n. 


Modular Arithmetic 


A fundamental fact about congruence modulo n is that if you first perform an addition, 
subtraction, or multiplication on integers and then reduce the result modulo n, you will ob- 
tain the same answer as if you had first reduced each of the numbers modulo n, performed 
the operation, and then reduced the result modulo n. For instance, instead of computing 


(5-8) = 40 = 1 (mod 3) 
you will obtain the same answer if you compute 
(5 mod 3) (8 mod 3) = 2:2 = 4 = | (mod 3). 


The fact that this process works is a result of the following theorem. 


Theorem 8.4.3 Modular Arithmetic 


Let a, b, c, d, and n be integers with n > 1, and suppose 


a=c(modn) and b= d (mod n). 


1. (a+b) = (c+ d) (mod n) 
2. (a—b) = (c—d) (mod n) 
3. ab = cd (mod n) 
4 


. da” = c” (mod n) for every positive integer m. 


Proof: Because we will make greatest use of part 3 of this theorem, we prove it here 
and leave the proofs of the remaining parts of the theorem to exercises 9-11 at the 
end of the section. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


8.4 MODULAR ARITHMETIC WITH APPLICATIONS TO CRYPTOGRAPHY 529 


Proof of Part 3: Suppose a, b, c, d, and n are integers with n > 1, and suppose a = b 
(mod n) and c = d (mod n). By Theorem 8.4.1, there exist integers s and f such that 


a=ct+sn and b=d+tn. 
Then 
ab = (c+ sn)(d + tn) by substitution 
= cd + ctn + snd + snin 
= cd+n(ct+sd+ stn) by algebra. 


Let k = ct + sd + stn. Then kis an integer because it is a sum of products of integers, 
and ab = cd + nk. Thus by Theorem 8.4.1, ab = cd (mod n). 


Getting Started with Modular Arithmetic 


The most practical use of modular arithmetic is to reduce computations involving large 
integers to computations involving smaller ones. For instance, note that 55 = 3 (mod 4) 
because 55 — 3 = 52, which is divisible by 4, and 26 = 2 (mod 4) because 26 — 2 = 24, 
which is also divisible by 4. Verify the following statements. 

a. 55 +26 = (3 +2) (mod 4) b. 55 — 26 = (3 — 2) (mod 4) 

c. 55+26 = (3:2) (mod 4) d. 55° = 37 (mod 4) 


Solution 

a. Compute 55 + 26 = 81 and 3 + 2 = 5. By definition of congruence modulo n, to 
show that 81 = 5 (mod 4), you need to show that 4 | (81 — 5). But this is true because 
81—5 = 76, and 4 | 76 since 76 = 4-19. 


b. Compute 55 — 26 = 29 and 3 — 2 = 1. By definition of congruence modulo n, to 
show that 29 = 1 (mod 4), you need to show that 4 | (29 — 1). But this is true because 
29 — 1 = 28, and 4 | 28 since 28 = 4-7. 

c. Compute 55:26 = 1430 and 3-2 = 6. By definition of congruence modulo n, to show 
that 1430 = 6 (mod 4), you need to show that 4 | (1430 — 6). But this is true because 
1430 — 6 = 1424, and 4 | 1424 since 1424 = 4:356. 


d. Compute 55° = 3025 and 3° = 9. By definition of congruence modulo n, to show 
that 3025 = 9 (mod 4), you need to show that 4 | (3025 — 9). But this is true because 
3025 — 9 = 3016, and 4 | 3016 since 3016 = 4:754. E 


In order to facilitate the computations performed in this section, it is convenient to ex- 
press part 3 of Theorem 8.4.3 in a slightly differently form. 


Corollary 8.4.4 
Let a, b, and n be integers with n > 1. Then 
ab = [(a mod n)(b mod n)| (mod n), 
or, equivalently, 
ab mod n = [(a mod n)(b mod n)| mod n. 


In particular, if m is a positive integer, then 
a” = [(a mod n)"| (mod n). 
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eu eee Computing a Product Modulo n 


As in Example 8.4.2, note that 55 = 3 (mod 4) and 26 = 2 (mod 4). Because both 3 and 2 
are less than 4, each of these numbers is a least nonnegative residue modulo 4. Therefore, 
55 mod 4 = 3 and 26 mod 4 = 2. Use the notation of Corollary 8.4.4 to find the residue of 
55-26 modulo 4. 


Solution Recall that to use a calculator to compute remainders, you can use the for- 
mula nmodd = n— d-|n/d]. If you are using a hand calculator with an “integer part” 
feature and both n and d are positive, then |n/d] is the integer part of the division of n by 
d. When you divide a positive integer n by a positive integer d with a more basic calcula- 
tor, you can see |n/d] on the calculator display by simply ignoring the digits that follow 
the decimal point. 

By Corollary 8.4.4, 


(55:26) mod 4 = {(55 mod 4)(26 mod 4)} mod 4 
= (3:2) mod 4 because 55 mod 4 = 3 and 26 mod 4 = 2 


6 mod 4 
= 2 because 4|(6 — 2) and 2 < 4. E 


When modular arithmetic is performed with very large numbers, as is the case for 
RSA crytography, computations are facilitated by using two properties of exponents. The 
first is 


x1 = (x®? for all real numbers x and a with x = 0. 8.4.1 


Thus, for instance, if x is any positive real number, then 


x! mod n = (xy mod n because (x°) = x* 


= Q? mod n}? modn by Corollary 8.4.4. 


Hence you can reduce x* modulo n by reducing x? modulo n and then reducing the square 
of the result modulo n. Because all the residues are less than n, this process limits the size 
of the computations to numbers that are less than n, which makes them easier to work 
with, both for humans (when the numbers are relatively small) and for computers (when 
the numbers are very large). 

A second useful property of exponents is 


xt? = x4 forall real numbers x, a, and b with x = 0. 8.4.2 


For instance, because 7 = 4+2+1, 


Thus, by Corollary 8.4.4, 
x’ mod n = {a mod n(x? mod n(x! mod n)} mod n. 


We first show an example that illustrates the application of formula (8.4.1) and then an 
example that uses both (8.4.1) and (8.4.2). 
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Computing a“ mod n When k Is a Power of 2 
Find 144* mod 713. 


Solution Use property (8.4.1) to write 144* = (144°. Then 


1444 mod 713 = (1447)* mod 713 
= (144? mod 713} mod 713 


= (20736 mod 713) mod 713 because 144° = 20736 


= 59° mod 713 


= 3481 mod 713 
= 629 


Computing a“ mod n When k Is Not a Power of 
Find 12” mod 713. 


because 20736 mod 713 = 59 


because 59° = 3481 
because 3481 mod 713 = 629. 


2 


Solution First write the exponent as a sum of powers of 2: 


43 = 2+2 +2+1 =32+8+2+1. 


Next compute 12% for k = 0, 1, 2, 3, 4, and 5. 


12 mod 713 = 12 
12? mod 713 = 144 

12° mod 713 = 144°mod 713 = 59 
12° mod 713 = 59° mod 713 = 629 
12'° mod 713 = 629° mod 713 = 639 
12° mod 713 = 639° mod 713 = 485 


By property (8.4.2), 
1243 = 1282+8+2+1 — 4932 


Thus, by Corollary 8.4.4, 
12° mod 713 


= {(12 mod 713): (128 mod 713): (12? mod 713): (12 mod 713)} mod 713. 


By substitution, 


by Example 8.4.4 
by Example 8.4.4 
by the method of Example 8.4.4 
by the method of Example 8.4.4. 


-128-122-121 


12} mod 713 = (485-629-144: 12) mod 713 
= 527152320 mod 713 


= 48. 


531 


You should know how to do the computations in Example 8.4.5 by hand using only a 
simple electronic calculator, but if you are computing a lot of residues, especially ones 
involving large numbers, you may want to write a short computer or calculator program to 


do the computations for you. 


Extending the Euclidean Algorithm 


An extended version of the Euclidean algorithm can be used to find a concrete expression 


for the greatest common divisor of integers a and b. 
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An integer d is said to be a linear combination of integers a and b if, and only if, 
there exist integers s and ¢ such that as + bt = d. 


Theorem 8.4.5 Writing a Greatest Common Divisor as a Linear Combination 


For all integers a and b, not both zero, if d = gcd(a, b), then there exist integers s and 
t such that as + bt = d. 


Proof: Given integers a and b, not both zero, and given d = gcd(a, b), let 
S = {x | x is a positive integer and x = as + bt for some integers s and f}. 


Note that S is a nonempty set because (1) if a > 0 then 1-a+0-b ES, (2) ifa<0 
then (—1)-a+0-b E S, and (3) if a = 0 then, by assumption, b # 0, and hence 
0-a+1-b€S or0-a+(—1)-b E S. Thus, because S$ is a nonempty subset of posi- 
tive integers, by the well-ordering principle for the integers there is a least element c 
in S. By definition of S, 


c=as+bt for some integers s and t. 8.4.3 


We will show that (1) c = d, and (2) c S d, and we will therefore be able to conclude 
that c = d = gcd(a, b). 


(1) Proof that c = d: 

[In this part of the proof, we show that d is a divisor of c and thus that d = c.] Be- 
cause d = gcd(a, b), by definition of greatest common divisor, d | a and d | b. Hence 
a = dx and b = dy for some integers x and y. Then 


c=as+bt by (8.4.3) 
(dx)s + (dy)t by substitution 


= d(xs + y) by factoring out the d. 


Now xs + yt is an integer because it is a sum of products of integers. Thus, by defi- 
nition of divisibility, d | c. Both c and d are positive, and hence, by Theorem 4.4.1, 
c2d. 


(2) Proof that c Sd: 

[In this part of the proof, we show that c is a divisor of both a and b and therefore that 
c is less than or equal to the greatest common divisor of a and b, which is d.] Apply the 
quotient-remainder theorem to the division of a by c to obtain 


a=cq+r forsome integers q andr with 0 Sr < c. 8.4.4 
Thus for some integers q and r with 0 S r < c, 
r=a-cq. 
Now c = as + bt. Therefore, for some integers q and r with 0 S r < c, 


r=a-— (as + bt)q by substitution. 
= a(l — sq) — btq 
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Thus r is a linear combination of a and b. If r > 0, then r would be in S, and so r 
would be a smaller element of S than c, which would contradict the fact that c is the 
least element of S. Hence r = 0. By substitution into (8.4.4), 


a=cq 


and therefore c | a. 

An almost identical argument establishes that c | b and is left as exercise 30 at the 
end of the section. 

Because c | a and c | b, c is a common divisor of a and b. Hence c is less than or 
equal to the greatest common divisor of a and b. In other words, c = d. 

From (1) and (2), we conclude that c = d. It follows that d, the greatest common 
divisor of a and b, is equal to as + bt. 


The following example shows a practical method for expressing the greatest common 
divisor of two integers as a linear combination of the two. 


Cl kt Expressing a Greatest Common Divisor as a Linear Combination 


In Example 4.10.6 we showed how to use the Euclidean algorithm to find that the great- 
est common divisor of 330 and 156 is 6. Use the results of those calculations to express 
gcd(330, 156) as a linear combination of 330 and 156. 


Solution The first four steps of the solution restate and extend results from Example 
4.10.6, which were obtained by successive applications of the quotient-remainder theorem. 
The fifth step shows how to find the coefficients of the linear combination by substituting 
back through the results of the previous steps. 


Step 1: 330 = 156-2 + 18, which implies that 18 = 330 — 156-2. 
Step 2: 156 = 18-8 + 12, which implies that 12 = 156 — 18-8. 
Step 3: 18 = 12-1 + 6, which implies that 6 = 18 — 12-1. 

Step 4: 12 = 6-2 +0, which implies that gcd(330, 156) = 6. 
Step 5: By substituting back through steps 3 to 1: 


6 = 18= 12-1 from step 3 
= 18—-(156-—8-18)-1 by substitution from step 2 
= 9-18+(-—1)-156 by algebra 
= 9-(330 — 156-2) + (—1)-156 by substitution from step 1 
= 9-330 +(—19)-156 by algebra. 


Thus gcd(330, 156) = 9-330 + (— 19): 156. (It is always a good idea to check the result 
of a calculation like this to be sure you did not make a mistake. In this case, you find that 
9-330 + (—19)-156 does indeed equal 6.) B 


The Euclidean algorithm given in Section 4.10 can be adapted so as to compute the coef- 
ficients of the linear combination of the gcd at the same time as it computes the gcd itself. 
This extended Euclidean algorithm is described in the exercises at the end of the section. 
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Finding an Inverse Modulo n 
Suppose you want to solve the following congruence for x: 
2x = 3 (mod 5). 


Note that 3-2 = 6 = 1 (mod 5). So 3 appears to be a kind of inverse for 2 modulo 5. Thus 
multiplying both sides of the congruence to be solved by 3 might give a solution for x. 
When we try this, we obtain 


6x = 3:2x = 3-3 (mod 5) = 9 (mod 5) = 4 (mod 5). 
Now since 6 = | (mod 5), Theorem 8.4.3(3) implies that 
6x = 1-x (mod 5) = x (mod 5). 


So, 6x = 4 (mod 5) and 6x = x (mod 5), and, hence, by the symmetric and transitive prop- 
erties of modular congruence, 


x = 4 (mod 5). 


Therefore, a solution is x = 4. (You can check that 2-4 = 8 = 3 (mod 5).) 


Definition 


Given any integer a and any positive integer n, if there exists an integer s such that 
as = | (mod n), then s is called an inverse for a modulo n. 


Unfortunately, the method shown above cannot always be used to solve congruences 
because not every integer has an inverse modulo n. For instance, observe that 


2-1 = 2 (mod 4) 
2:2 = 0 (mod 4) 
2:3 =2 (mod 4). 


By Theorem 8.4.3, these calculations suffice to show that the number 2 does not have an 
inverse modulo 4. 

Describing the circumstances in which inverses exist in modular arithmetic requires the 
concept of relative primeness. 


Definition 


Integers a and b are relatively prime if, and only if, gcd(a, b) = 1. Integers a), ao, 
a3, . . . , 4, are pairwise relatively prime if, and only if, gcd(a;, a) = 1 for all inte- 
gers i and j with 1 Si, j S n, andi #/. 


Given the definition of relatively prime integers, the following corollary is an immedi- 
ate consequence of Theorem 8.4.5. 


Corollary 8.4.6 


If a and b are relatively prime integers, then there exist integers s and t such that 
as + bt = 1. 
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Seu eee Expressing 1 as a Linear Combination of Relatively Prime Integers 


Show that 660 and 43 are relatively prime, and find a linear combination of 660 and 43 
that equals 1. 


Solution 
Step 1: Divide 660 by 43 to obtain 660 = 43-15 + 15, which implies that 
15 = 660 — 43-15. 


Step 2: Divide 43 by 15 to obtain 43 = 15-2 + 13, which implies that 13 = 43 — 15-2. 
Step 3: Divide 15 by 13 to obtain 15 = 13-1 + 2, which implies that 2 = 15 — 13. 
Step 4: Divide 13 by 2 to obtain 13 = 2-6 + 1, which implies that 1 = 13 — 2-6. 


Step 5: Divide 2 by 1 to obtain 2 = 1-2 + 0, which implies that gcd(660, 43) = 1 and so 
660 and 43 are relatively prime. 


Step 6: To express 1 as a linear combination of 660 and 43, substitute back through steps 


4to 1: 

1 = 13-2-6 from step 4 
= 13-—(15-13)-6 by substitution from step 3 
= 7:13= 6-15 by algebra 
= 7:-(43—15-2)— 6-15 by substitution from step 2 
= 7-43 — 20:15 by algebra 
= 7-43 — 20: (660 — 43-15) by substitution from step 1 
= 307-43 — 20-660 by algebra. 


Thus gcd(660, 43) = 1 = 307-43 — 20-660. (And a check by direct computation confirms 
that 307-43 — 20-660 does indeed equal 1.) E 


A consequence of Corollary 8.4.6 is that under certain circumstances, it is possible to 
find an inverse for an integer modulo n. 


Corollary 8.4.7 Existence of Inverses Modulo n 


For all integers a and n, if gcd(a, n) = 1, then there exists an integer s such that 
as = 1 (mod n), and so s is an inverse for a modulo n. 


Proof: Suppose a and n are integers and gcd(a, n) = 1. By Corollary 8.4.6, there 
exist integers s and ¢ such that 


as+nt = 1. 


Subtracting nt from both sides gives that 


as = 1-—nt=1+(—dn. 
Thus, by definition of congruence modulo n, 


as = 1 (mod n). 
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Sore Finding an Inverse Modulo n 


a. Find an inverse for 43 modulo 660. That is, find an integer s such that 43s = 1 (mod 660). 


b. Find a positive inverse for 3 modulo 40. That is, find a positive integer s such that 
3s = 1 (mod 40). 


Solution 
a. By Example 8.4.7, 
307-43 — 20-660 = 1. 
Adding 20-660 to both sides gives that 
307-43 = 1 + 20-660. 


Thus, by definition of congruence modulo 660, 
307-43 = 1 (mod 660), 


so 307 is an inverse for 43 modulo 660. 


b. Use the technique of Example 8.4.7 to find a linear combination of 3 and 40 that 
equals 1. 


Step 1: Divide 40 by 3 to obtain 40 = 3-13 + 1. This implies that 1 = 40 — 3-13. 
Step 2: Divide 3 by 1 to obtain 3 = 3-1 + 0. This implies that gcd(3, 40) = 1. 
Step 3: Use the result of step 1 to write 

3-(—13) = 1+(—1)40. 


This result implies that — 13 is an inverse for 3 modulo 40. In other words, 3-(—13) = 
1 (mod 40). To find a positive inverse, compute 40 — 13. The result is 27, and 


27 = —13 (mod 40) 
because 27 — (— 13) = 40. So, by Theorem 8.4.3(3), 
3-27 = 3-(—13) = 1 (mod 40), 


and thus by the transitive property of congruence modulo n, 27 is a positive integer that is 
an inverse for 3 modulo 40. E 


RSA Cryptography 


At this point we have developed enough number theory to explain how to encrypt and de- 
crypt messages using the RSA cipher. The effectiveness of the system is based on the fact 
that although modern computer algorithms make it quite easy to find two distinct large in- 
tegers p and g—say on the order of several hundred digits each—that are virtually certain 
to be prime, even the fastest computers are not currently able to factor their product, an 
integer with approximately twice that many digits. In order to encrypt a message using the 
RSA cipher, a person needs to know the value of pq and of another integer e, both of which 
are made publicly available. But only a person who knows the individual values of p and q 
can decrypt an encrypted message. 

We first give an example to show how the cipher works and then discuss some of 
the theory to explain why it works. The example is unrealistic in the sense that because 
p and q are so small, it would be easy to figure out what they are just by knowing 
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their product. But working with small numbers conveys the idea of the system, while 
keeping the computations in a range that can be performed with a hand calculator. 

Suppose Alice decides to set up an RSA cipher. She chooses two prime numbers—say, 
p = 5 and q = 11—and computes pg = 55. She then chooses a positive integer e that is 
relatively prime to (p— 1)(q— 1). In this case, (p — 1)(g— 1) = 4:10 = 40, so she may 
take e = 3 because 3 is relatively prime to 40. (In practice, taking e to be small could 
compromise the secrecy of the cipher, so she would take a larger number than 3. However, 
the mathematics of the cipher works as well for 3 as for a larger number, and the smaller 
number makes for easier calculations.) 

The number pair (pq, e) is Alice’s public key, which she may distribute widely. Because 
the RSA cipher works only on numbers, Alice also informs people how she will interpret 
the numbers in the messages they send her. Let us suppose that she encodes letters of the 
alphabet in a similar way as was done for the Caesar cipher: 


A = 01, B = 02, C = 03,...,Z= 26. 


Let us also assume that the messages Alice receives consist of blocks, each of which, for 
simplicity, is taken to be a single, numerically encoded letter of the alphabet. 

Someone who wants to send Alice a message breaks the message into blocks, each con- 
sisting of a single letter, and finds the numeric equivalent for each block. The plaintext, M, 
in a block is converted into ciphertext, C, according to the following formula: 


C = M mod pq. 8.4.5 


Note that because (pq, e) is the public key, anyone who has it and knows modular arithme- 
tic can encrypt a message to send to Alice. 


Encrypting a Message Using RSA Cryptography 


Bob wants to send Alice the message HI. What is the ciphertext for his message? 


Solution Bob will send his message in two blocks, one for the H and another for the I. 
Because H is the eighth letter in the alphabet, it is encoded as 08, or 8. The corresponding 
ciphertext is computed using formula 8.4.5 as follows: 
C = 8 mod 55 
= 512 mod 55 
= 17. 
Because I is the ninth letter in the alphabet, it is encoded as 09, or 9. The corresponding 
ciphertext is 
C = 9 mod 55 
= 729 mod 55 
= 14. 
Accordingly, Bob sends Alice the message: 17 14. E 


To decrypt the message, the decryption key must be computed. It is a number d that is a 
positive inverse to e modulo (p — 1)(q — 1). The plaintext M is obtained from the ciphertext 
C by the formula 


M = C’ mod pq, where the number pair (pq, d) is Alice’s private key. 8.4.6 
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Note that because M + kpq = M (mod pq), M must be taken to be less than pq, as in 
the above example, in order for the decryption to be guaranteed to produce the original 
message. But because p and q are normally taken to be so large, this requirement does not 
cause problems. Long messages are broken into blocks of symbols to meet the restriction 
and several symbols are included in each block to prevent decryption based on knowledge 
of letter frequencies. 


Decrypting a Message Using RSA Cryptography 


Imagine that Alice has hired you to help her decrypt messages and has shared with you the 
values of p and q. Compute Alice’s private key (pq, d) and use the formula M = C’ mod Pq 
to decrypt the following ciphertext for her: 17 14. 


Solution Because p = 5 and q = 11, (p — 1)(q — 1) = 40, the decryption key d is a posi- 
tive inverse for 3 modulo 40. Knowing that you would need this number, we computed it 
in Example 8.4.8(b) and found it to be 27. Thus to decrypt the ciphertext 17, you need to 
compute 


M = 17! mod pq = 17” mod 55. 
To do so, note that 
27 = 16+8+2+1. 
Next, find the residues obtained when 17 is raised to successively higher powers of 2, up 
to 2" = 16: 
17 mod 55 = 17 mod 55 =17 
17° mod 55 = 17 mod 55 = 14 
17f mod 55 = (17°)? mod 55 = 14° mod 55 = 31 
17° mod 55 = (17*)” mod 55 = 31° mod 55 = 26 
17'° mod 55 = (178° mod 55 = 26° mod 55 = 16 


Then use the fact that 
177 = 1716+8+2+1 = 1716. 178- 172- 17! 


to write 


177 mod 55 = (17'°- 178 177 17) mod 55 


= [(17° mod 55)(17° mod 55)(17? mod 55)(17 mod 55)] (mod 55) 
by Corollary 8.4.4 


= (16:26-14:17) (mod 55) 

= 99008 (mod 55) 

= 8 (mod 55). 
Hence 177’ mod 55 = 8, and thus the plaintext of the first part of Bob’s message is 8, or 08. 
In the last step, you find the letter corresponding to 08, which is H. In exercises 14 and 15 


at the end of this section, you are asked to show that when you decrypt 14, the result is 9, 
which corresponds to the letter I, so you can tell Alice that Bob’s message is HI. E 


Figure 8.4.1 illustrates the process of sending and receiving a message using RSA 
cryptography. 
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Bob creates a Alice reads Bob’s 

plaintext block M. plaintext block M. 

Bob uses Alice’s Alice uses her 

public key to Bob sends encrypted private key to 
|___+-| A l 

encrypt M: block C to Alice. decrypt C: 

C = M° mod pq. M = C4 mod pq. 


FIGURE 8.4.1 Using RSA cryptography 


Euclid’s Lemma 


Another consequence of Theorem 8.4.5 is known as Euclid’s lemma. It is the crucial fact 
behind the unique factorization theorem for the integers and is also of great importance in 
many other parts of number theory. 


Theorem 8.4.8 Euclid’s Lemma 


For all integers a, b, and c, if gcd(a, c) = 1 anda | bc, then a | b. 


Proof: Suppose a, b, and c are integers, gcd(a, c) = 1, and a | bc. [We must show that 
a|b.] By Theorem 8.4.5, there exist integers s and ż so that 


as+ct= 1. 


Multiply both sides of this equation by b to obtain 


bas + bct = b. 8.4.7 
Since a| bc, by definition of divisibility there exists an integer k such that 
bc = ak. 8.4.8 


Substituting (8.4.8) into (8.4.7), rewriting, and factoring out an a gives that 
b = bas + (ak)t = a(bs + kt). 


Let r = bs + kt. Then r is an integer (because b, s, k, and t are all integers), and 
b = ar. Thus a|b by definition of divisibility. 


The unique factorization theorem for the integers states that any integer greater than 1 
has a unique representation as a product of prime numbers, except possibly for the order in 
which the numbers are written. The hint for exercise 13 of Section 5.4 outlined a proof of the 
existence part of the proof, and the uniqueness of the representation follows quickly from Eu- 
clid’s lemma. In exercise 41 at the end of this section, we outline a proof for you to complete. 

Another application of Euclid’s lemma is a cancellation theorem for congruence modulo 
n. This theorem allows us—under certain circumstances—to divide out a common factor 
in a congruence relation. 


Theorem 8.4.9 Cancellation Theorem for Modular Congruence 


For all integers a, b, c, and n with n > 1, if ged(c, n) = 1 and ac = bc (mod n), then 
a = b (mod n). 


(continued on page 540) 
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Proof: Suppose a, b, c, and n are any integers, gcd(c, n) = 1, and ac = bc (mod n). 
[We must show that a = b (mod n).] By definition of congruence modulo n, 

n | (ac — bc), 
and so, since ac — be = (a—b)ec, 
then 

n | (a— b)c. 
Because gcd(c, n) = 1, we may apply Euclid’s lemma to obtain 

n|(a—b), 
and so, by definition of congruence modulo n, 
a = b (mod n). 


An alternative proof for Theorem 8.4.9 uses Corollary 8.4.7. Because gcd(c, n) = 1, the 
corollary guarantees an inverse for c modulo n. In the proof of Theorem 8.4.9, let d denote 
an inverse for c. Apply Theorem 8.4.3(3) repeatedly, first to multiply both sides of ac = bc 
(mod n) by d to obtain (ac)d = (bd)d (mod n), and then to use the fact that cd = 1 (mod n) 
to simplify the congruence and conclude that a = b (mod n). 


Fermat's Little Theorem 


Fermat’s little theorem was given that name to distinguish it from Fermat’s last theo- 
rem, which we discussed in Section 4.1. It provides the theoretical underpinning for RSA 


cryptography. 


Theorem 8.4.10 Fermat’s Little Theorem 


If p is any prime number and a is any integer such that p | a, then a” ~ ' = 1 (mod p). 


Proof: Suppose p is any prime number and a is any integer such that p f a. Note that 
a + 0 because otherwise p would divide a. Consider the set of integers 


S = {a, 2a, 3a,...,(p— l)a}. 


We claim that no two elements of S are congruent modulo p. For suppose sa = 
ra (mod p) for some integers s and r with 1 = r < s S p — 1. Then, by definition of 
congruence modulo p, 


p|(sa—ra), or, equivalently, p |(s— rna. 


Now p | a by hypothesis, and because p is prime, gcd(a, p) = 1. Thus, by Euclid’s 
lemma, p | (s — r). But this is impossible because 0 < s — r < p. 

Consider the function F from S to the set T = {1, 2, 3,... , (p — 1)} that sends 
each element of S to its residue modulo p. Then F is one-to-one because no two ele- 
ments of S are congruent modulo p. In Section 9.4 we prove that if a function from 
one finite set to another is one-to-one, then it is also onto. Hence F is onto, and so the 
p — 1 residues of the p — 1 elements of S are exactly the numbers 1, 2, 3,...,(p — 1). 
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It follows by Theorem 8.4.3(3) that 
a:2a:3a:+:(p— Da =[1-2-3---(p— 1] (modp), 
or, equivalently, 
a’-'(p—1)! = (p— 1)! (mod p). 


Now because p is prime, p and (p — 1)! are relatively prime. Thus, by the cancella- 
tion theorem for modular congruence (Theorem 8.4.9), 


a’~' = | (mod p). 


Why Does the RSA Cipher Work? 
For the RSA cryptography method, the formula 
M = C’ mod Pq 


is supposed to produce the original plaintext message, M, when the encrypted message is 
C. How can we be sure that it always does so? Recall that we require that M < pq, and we 
know that C = M° mod pq. So, by substitution, 


ce mod pq = (M° mod pq” mod pq. 
By Theorem 8.4.3(4), 
(M° mod pq)! = M“ (mod pq). 
Thus C“ mod pq=M ed (mod pq), and so it suffices to show that 
M = M” (mod pq). 


Recall that d was chosen to be a positive inverse for e modulo (p — 1)(q — 1), which ex- 
ists because gcd(e, (p — 1)(q — 1)) = 1. In other words, 


ed = 1 (mod (p = 1)(q— 1), 
or, equivalently, 
ed = 1+k(p—1)(q-—1) for some positive integer k. 
Therefore, 
M“! = MItke-D@-D = M(M*P7 007D), 

If p | M, then by Fermat’s little theorem, M?”™! = 1 (mod p), and so 

M“ = MMDD) = MMT HETO = MATÀ (mod p) = M (mod p). 
Similarly, if q / M, then by Fermat’s little theorem, MT! = 1 (mod q), and so 

M“ = M(M*®7 04D) = MME !/KeP-D) = My?) (mod q) = M (mod q). 
Thus, if M is relatively prime to pq, 

M“ =M (mod p) and M“ =M (mod q). 
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If M is not relatively prime to pg, then either p| M or q |M. If p|M, then M“ =0= 
M (mod p), and if q | M, then M“ = 0 = M (mod Q). If q | M, then as above, M“ = M (mod Q), 
and if p Ii M, then as above, M ed — M (mod p). Thus, in all cases, 


M“=M(modp) and M = M (mod q). 
By Theorem 8.4.1, 
p|M“-M) and q|(M“—M), 
and, by definition of divisibility, 
M“! — M = pt for some integer t. 
By substitution, q| pt, 
and since q and p are distinct prime numbers, Euclid’s lemma applies to give 


q|t. 
Thus t= qu_ for some integer u 
by definition of divisibility. By substitution, 
M-M“ = pt = p(qu) = (pq)u, 


where u is an integer, and so, 


pq|(M-M“) 
by definition of divisibility. Thus 
M- M“ = 0 (mod pq) 
by definition of congruence, or, equivalently, 
M = M” (mod pq). 
Because M < pq, this last congruence implies that 
M = M” mod pq, 


and thus the RSA cipher gives the correct result. 


Message Authentication 


In some Internet networks, such as those that use blockchain technology, individuals have 
their own public keys but their actual identities are private. Suppose Alice and Bob are part 
of the network, so each knows the other’s public key. Alice wants to send Bob a message, 
but she has a problem. She wants to make sure that Bob will know that the message really 
is from her. After all everyone in the network has Bob’s public key, and so someone else 
could send Bob a message pretending that it is from Alice. 

The solution follows from the fact that in the formula M = M“! mod pq, e and d are 
interchangeable. Alice can use her private key (pq, d) and the formula 


M = M“ mod pq 


to encrypt some identifying information showing that she is the person she claims to be. 
Then she adds the encrypted information to her plaintext message for Bob and uses Bob’s 
public key to encrypt the communication as a whole. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


8.4 MODULAR ARITHMETIC WITH APPLICATIONS TO CRYPTOGRAPHY 543 


When Bob receives the communication, he uses his private key to decrypt it. He discov- 
ers that part is plaintext, claiming to be from Alice, and another part is encrypted. So he 
uses Alice’s public key to decrypt the encrypted part and finds Alice’s identifying infor- 
mation. He then knows the message is authentic because only Alice has the private key 
needed to perform the encryption. 

This process of message authentication is illustrated in Figure 8.4.2. 


Alice creates message A Bob reads the clear text in 
in plaintext; uses her message A, knowing that it 
private key to encrypt is from Alice because only 
her ID information; and Alice has the private key to 
adds it to A. encrypt her ID information. 


Bob uses his 
private key to 
Alice uses Bob’s 3 deciypt A, Thon 

. Alice sends encrypted he uses Alice’s 
public key to : 

message A to Bob. public key to 

encrypt A. aoa 
decrypt Alice’s 


enclosed ID 
information. 


FIGURE 8.4.2 Message authentication 


Additional Remarks on Number Theory and Cryptography 


The famous British mathematician G. H. Hardy (1877-1947) was fond of comparing the 
beauty of pure mathematics, especially number theory, to the beauty of art. Indeed, the 
theorems in this section have many beautiful and striking consequences beyond those 
we have had the space to describe, and the subject of number theory extends far beyond 
these theorems. Hardy also enjoyed describing pure mathematics as useless. Hence it is 
ironic that there are now whole books devoted to applications of number theory to com- 
puter science, RSA cryptography being just one such application. Furthermore, as the need 
for public-key cryptography has developed, techniques from other areas of mathematics, 
such as abstract algebra and algebraic geometry, have been used to develop additional 
cryptosystems. 


TEST YOURSELF 


1. When letters of the alphabet are encrypted using 4. Ifa,n, and k are positive integers with n > 1, an 
the Caesar cipher, the encrypted version of a letter efficient way to compute a‘ (mod n) is to write 
is kasa and use the facts about computing 


2. If a, b, and n are integers with n > 1, all of the fol- prods ane powers modulon. 


lowing are different ways to express the fact that 5. To express a greatest common divisor of two in- 
n | (a—b): ; : : : tegers as a linear combination of the integers, use 
: ‘ the extended version of the algorithm. 
3. Ifa, b,c, d, m, and n are integers with n > 1 8 
and if a = c (mod n) and b = d (mod n), then 6. To find an inverse for a positive integer a modulo 
a+b= ,a-b= , ab = , and an integer n with n > 1, you express the number 1 
m 
a; = as 
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7. To encrypt a message M using RSA cryptography 10. The crux of the proof that the RSA cipher works is 
with public key pq and e, you use the formula that if (1) p and q are distinct large prime num- 
______, and to decrypt a message C, you use the bers, (2) M < pq, (3) M is relatively prime to pq, 
formula , where (4) e is relatively prime to (p — 1)(q — 1), and 


(5) dis a positive inverse for e modulo 


8. Euclid’s lemma says that for all integers a, b, and c C=-Du-D them = 


if gcd(a, c) = 1 and a | bc, then ___. 


9. Fermat’s little theorem says that if p is any prime 
number and a is any integer such that p | a, then —. 


EXERCISE SET 8.4 


1. a. Use the Caesar cipher to encrypt the message 7. Verify the following statements. 
WHERE SHALL WE MEET. a. 128 = 2 (mod 7) and 61 = 5 (mod 7) 

b. Use the Caesar cipher to decrypt the message b. (128 +61) = (2 + 5) (mod 7) 

LQ WKH FDIHWHULD. c. (128 — 61) = (2 — 5) (mod 7) 


d. (128-61) = (2:5) (mod 7) 


2. a. Use the Caesar cipher to encrypt the message e. 128? = 2? (mod 7) 


AN APPLE A DAY. 
b. Use the Caesar cipher to decrypt the message 8. Verify the following statements. 
NHHSV WKH GRFWRU DZDB. a. 45 = 3 (mod 6) and 104 = 2 (mod 6) 


b. (45 + 104) = (3 + 2) (mod 6) 
c. (45 — 104) = (3 — 2) (mod 6) 


b. Explain why 25 = 19 (mod 3). a o ra a A (mode 
c. What value of k has the property that ` 7 
25 = 19 +3k? In 9-11, prove each of the given statements, assuming 
that a, b, c, d, and n are integers with n > 1 and that 
= c (mod n) and b = d (mod n). 


3. Leta = 25, b = 19, and n = 3. 
a. Verify that 3 | (25 — 19). 


d. What is the (nonnegative) remainder obtained 
when 25 is divided by 3? When 19 is divided 


by 3? 9. a. (a+b) = (c + d) (mod n) 
e. Explain why 25 mod 3 = 19 mod 3. b. (a— b) = (c — d) (mod n) 
4. Leta = 68, b = 33, and n = 7. 10. a’ = č (mod n) 


a. Verify that 7 | (68 — 33). 
b. Explain why 68 = 33 (mod 7). 
c. What value of k has the property that 


11. a” = c” (mod n) for every integer m = 1 (Use 


mathematical induction on m.) 


68 = 33 +7k? 12. a. Prove that for every integer n = 0, 10” = 

d. What is the (nonnegative) remainder obtained 1 (mod 9). 
when 68 is divided by 7? When 33 is divided b. Use part (a) to prove that a positive integer is 
by 7? divisible by 9 if, and only if, the sum of its 

e. Explain why 68 mod 7 = 33 mod 7. digits is divisible by 9. 

5. Prove the transitivity of modular congruence. That 13. a. Prove that for every integer n = 1, 

is, prove that for all integers a, b, c, and n with 10” = (—1)” (mod 11). 

n> 1,if a= b (mod n) and b = c (mod n) then b. Use part (a) to prove that a positive integer 

a = c (mod n). is divisible by 11 if, and only if, the alternat- 


ing sum of its digits is divisible by 11. (For 
instance, the alternating sum of the digits 
of 82,379 is 8—2 +3-—7+9 = 11 and 
82,379 = 11-7489.) 


H 6. Prove that the distinct equivalence classes of the 
relation of congruence modulo n are the sets [0], 
[1], [2],..., [2 — 1], where for each a = 0, 1, 2,..., 
n=l, 

14. Use the technique of Example 8.4.4 to find 14 


[a] = {m E Z| m= a (mod n)}. mod 55, 144 mod 55, 14° mod 55, and 14'° mod 55. 
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15. Use the result of exercise 14 and the technique of 
Example 8.4.5 to find 14°’ mod 55. 


In 16-18, use the techniques of Example 8.4.4 and 
Example 8.4.5 to find the given numbers. 


16. 675°” mod 713 17. 89” mod 713 


18. 48°” mod 713 


In 19-24, use the RSA cipher from Examples 8.4.9 and 
8.4.10. In 19-21, translate the message into its numeric 
equivalent and encrypt it. In 22-24, decrypt the cipher- 
text and translate the result into letters of the alphabet to 
discover the message. 


19. HELLO 20. WELCOME 21. EXCELLENT 


22. 13202009 23. 0805 15 24. 51 14 49 15 


H 25. Use Theorem 5.2.2 to prove that if a and n are 


positive integers and a” — 1 is prime, then a = 2 
and n is prime. 
In 26 and 27, use the extended Euclidean algorithm to find 
the greatest common divisor of the given numbers and 
express it as a linear combination of the two numbers. 


26. 6664 and 765 27. 4158 and 1568 


Exercises 28 and 29 refer to the following formal version 
of the extended Euclidean algorithm. 


Algorithm 8.4.1 Extended Euclidean Algorithm 

[Given integers A and B with A > B > 0, this algorithm 

computes gcd(A, B) and finds integers s and t such that 

sA + tB = gced(A, B).] 

Input: A, B [integers with A > B > 0] 

Algorithm Body: 
a:=A,b:=B,s:=1,t:=0,u:=0,v:= 1, 
[pre-condition: a = sA + tB and b = uA + vB] 
while (b # 0) 


[loop invariant: a = sA + tB and b = uA + vB, 
gcd(a, b) = gced(A, B)/ 
r:=amodb,q:=adivb 

a:=b,b:=r 


newu := s — uq, newv := t— vq 


Si=u,ti=v 
u := newu, v := newv 
end while 
gcd:=a 
[post-condition: gcd(A, B) = a = sA + tB] 
Output: gcd/a positive integer], s, t [integers] 


In 28 and 29, for the given values of A and B, make a table 
showing the value of s, t, and sA + tB before the start of 
the while loop and after each iteration of the loop. 


28. A = 330, B = 156 29. A = 284, B = 168 


30. Finish the proof of Theorem 8.4.5 by proving that 
if a, b, and c are as in the proof, then c|b. 


31. a. Find an inverse for 210 modulo 13. 
b. Find a positive inverse for 210 modulo 13. 
c. Find a positive solution for the congruence 
210x = 8 (mod 13). 


32. a. Find an inverse for 41 modulo 660. 
b. Find the least positive solution for the follow- 
ing congruence: 41x = 125 (mod 660). 


H 33. Use Theorem 8.4.5 to prove that for all integers a, 


b, and c, if gcd(a, b) = 1 and a | c and b | c, then 
ab|c. 


34. Give a counterexample to show that the state- 
ment of exercise 33 is false if the hypothesis that 
gcd(a, b) = 1 is removed. 


35. Corollary 8.4.7 guarantees the existence of an 
inverse modulo n for an integer a when a and n are 
relatively prime. Use Euclid’s lemma to prove that 
the inverse is unique modulo n. In other words, 
show that if s and f are any two integers whose 
product with a is congruent to 1 modulo n, then s 
and ¢ are congruent to each other modulo n. 

In 36, 37, 39, and 40, use the RSA cipher with public key 

n = 713 = 23-31ande = 43. In 36 and 37, encode the mes- 

sages into their numeric equivalents and encrypt them. 

In 39 and 40, decrypt the given ciphertext and find the 

original messages. 


36. HELP 37. COME 
38. Find the least positive inverse for 43 modulo 660. 
39. 675 089 089 048 


40. 028 018 675 129 


H 41. a. Use mathematical induction and Euclid’s 


lemma to prove that for every positive integer 
s, if p and q4, qo,---, Gs are prime numbers 
and p | q\q2°** qy then p = q; for some i with 
Ls, 

b. The uniqueness part of the unique factorization 
theorem for the integers says that given any 
integer n, if 


n = piP2` `` Pr = 192°" As 
for some positive integers r and s and prime num- 
bers pj = p St: =p,andq, = q =" = 4s, 
then r = s and p; = q; for every integer i with 
l=i=r. 
Use the result of part (a) to fill in the details 
of the following sketch of a proof: Suppose 
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that n is an integer with two different prime then a? | = 1 (mod p). Verify that this theorem 
factorizations: n = pi Po*** Pr = 192°" * Que gives correct results for the following: 
All the prime factors that appear on both sides a. a= 15andp=7 b. a =8andp=11 


can be cancelled (as many times as they appear 
on both sides) to arrive at the situation where 
PiP2` `° Pr = 9192" °° Is) Pi SPRS: S Pr 
qı Eq S: = qs, and p; # q; for any integers 
i and j. Then use part (a) to deduce a contradic- 
tion, and conclude that the prime factorization 
of n is unique except, possibly, for the order in 
which the prime factors are written. 


43. Fermat’s little theorem can be used to show 
that a number is not prime by finding a number 
a relatively prime to p with the property that 
a’? '#] (mod p). However, it cannot be used to 
show that a number is prime. Find an example 
to illustrate this fact. That is, find integers a 
and p such that a and p are relatively prime and 
a?~' = 1 (mod p) but p is not prime. 

42. According to Fermat’s little theorem, if p is a 
prime number and a and p are relatively prime, 


ANSWERS FOR TEST YOURSELF 


1. the letter that is three places in the alphabet to the right of (cd) (mod n); c” (mod n) 4. sum of powers of 2 

the given letter, with X wrapped around to A, Y to B, and Z to 5. Euclidean 6. a linear combination of a and n 

C 2.a=b(modn); a = b + kn for some integer k; a and b 7. C = M mod pq; M = em mod pq; d is a positive inverse 
have the same nonnegative remainder when divided by n; fore modulo (p—1)(q-1) 8.a|b 9. a?' = 1 (mod p) 
amodn = bmodn 3.(c+d) (mod n); (c — d) (mod n); 10. M mod Pq 


EJ Partial Order Relations 


There is no branch of mathematics, however abstract, which may not some day be 
applied to phenomena of the real world. —Nicolai Ivanovitch Lobachevsky, 1792-1856 


In order to obtain a degree in computer science at a certain university, a student must take 
a specified set of required courses, some of which must be completed before others can 
be started. Given the prerequisite structure of the program, one might ask what is the least 
number of school terms needed to fulfill the degree requirements, or what is the maximum 
number of courses that can be taken in the same term, or whether there is a sequence in 
which a part-time student can take the courses one per term. Later in this section, we will 
show how representing the prerequisite structure of the program as a partial order relation 
makes it relatively easy to answer such questions. 


Antisymmetry 


In Section 8.2 we defined three properties of relations: reflexivity, symmetry, and transitiv- 
ity. A fourth property of relations is called antisymmetry. In terms of the arrow diagram of 
a relation, saying that a relation is antisymmetric is the same as saying that whenever there 
is an arrow going from one element to another distinct element, there is not an arrow going 
back from the second to the first. 


Definition 


Let R be a relation on a set A. R is antisymmetric if, and only if, 


for every a and bin A, ifaRbandbRathena=b. 
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By taking the negation of the definition, you can see that a relation R is not antisym- 
metric if, and only if, 


there are elements a and b in A such thata Rb and b Ra buta =£ b. 


Testing for Antisymmetry of Finite Relations 


Let R, and R, be the relations on {0, 1, 2} defined as follows: Draw the directed graphs for 
R; and R, and indicate which relations are antisymmetric. 


a. R, = {(0, 2), (1, 2), (2, 0)} b. Ry = {(0, 0), (0, D, (0, 2), (d, D, (1, 2)} 


Solution 
a. Rj is not antisymmetric. 


0 1 Since 0 R; 2 and 2 R; 0 but O # 2, 
R; is not antisymmetric. 


b. R, is antisymmetric. 


In order for R, not to be antisymmetric, there 
>) would have to exist a pair of distinct elements 
0 1 of A such that each is related to the other 

by R,. But you can see by inspection that 
no such pair exists. 


Testing for Antisymmetry of “Divides” Relations 


Let R} be the “divides” relation on the set of all positive integers, and let R, be the “divides” 
relation on the set of all integers. 


For everya,b E Z", aRjb & ajb. 
For everya,b EZ, aR, b & alb. 


a. Is Rj antisymmetric? Prove or give a counterexample. 
b. Is R, antisymmetric? Prove or give a counterexample. 
Solution 

a. Ry is antisymmetric. 


Proof: Suppose a and b are positive integers such that a R; b and b R; a. [We must 
show that a = b.] By definition of R4, a | b and b | a. Thus, by definition of divides, 
there are integers kı and k, with b = ka and a = kyb. It follows that 


b = kja = ki(kb) = (kk). 


Dividing both sides by b gives 
kika =1. 
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Now since a and b are both integers, kı and k, are both positive integers also. And the 
only product of two positive integers that equals | is 1-1. Thus 


k=k=1 
and so a=kb=1-b=b 
[as was to be shown]. 
b. Ris not antisymmetric. 
Counterexample: 
Let a = 2 and b = —2. Then a | b [since —2 = (—1)-2] and bla [since 2 = (—1)(—2)]. 
Hence a R, band b R, a but a +£ b. | 


Example 8.5.2 illustrates the fact that a relation may be antisymmetric on a subset of a 
set but not antisymmetric on the set itself. 


Partial Order Relations 


A relation that is reflexive, antisymmetric, and transitive is called a partial order. 


Definition 


Let R be a relation defined on a set A. R is a partial order relation if, and only if, R 
is reflexive, antisymmetric, and transitive. 


Two fundamental partial order relations are the “less than or equal to” relation on a set 
of real numbers and the “subset” relation on a set of sets. These can be thought of as mod- 
els, or paradigms, for general partial order relations. 


Sen eee The “Subset” Relation 


Let & be any collection of sets and define the “subset” relation, C, on 4 as follows: For 


every U, VE A, 
UCV & foreach x, ifx E Uthenx €E V. 


By an argument almost identical to that of the solution for exercise 23 of Section 8.2, C 
is reflexive and transitive. Finish the proof that C is a partial order relation by proving 
that C is antisymmetric. 


Solution For C to be antisymmetric means that for all sets U and Vin 4, if U C V and 
V C U then U = V. This is true by definition of equality of sets. E 


A “Divides” Relation on a Set of Positive Integers 


Let | be the “divides” relation on a set A of positive integers. That is, for all a and b in A, 


a|b <œ b= ka for some integer k. 


Prove that | is a partial order relation on A. 
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Solution 
| is reflexive: [We must show that for each a E A, a | a.] Suppose a € A. Then a = 1-a, so 
a| a by definition of divisibility. 


| is antisymmetric: [We must show that for every a, b E A, ifa | b and b | a then a = b.] The 
proof of this is virtually identical to that of Example 8.5.2(a). 


| is transitive: To show transitivity means to show that for every a, b, c E A, if a | b and 
b |c then a |c. But this is the transitivity of divisibility property, which was proved as 
Theorem 4.4.3. 

Since | is reflexive, antisymmetric, and transitive, 


The “Less Than or Equal to” Relation 


Let S be a set of real numbers and define the “less than or equal to” relation, =, on S as 
follows: For all real numbers x and y in S, 


XxSy © x<yorx=y. 


Show that = is a partial order relation. 


is a partial order relation on A. E 


Solution 


< is reflexive: For < to be reflexive means that x < x for every real number x in S. But 
x S x means that x < x or x = x, and x = x is always true. 


< is antisymmetric: For = to be antisymmetric means that for all real numbers x and y in 
S, if x = y and y = x then x = y. This follows immediately from the definition of = and 
the trichotomy property (see Appendix A, T17), which says that given any real numbers 
x and y, exactly one of the following holds: x < y or x = y or x > y. 


< is transitive: For < to be transitive means that for all real numbers x, y, and z in S if 
x S yand y S z then x < z. This follows from the definition of = and the transitivity prop- 
erty of order (see Appendix A, T18), which says that given any real numbers x, y, and z, if 
x< yand y < z then x < z. 


Because < is reflexive, antisymmetric, and transitive, it is a partial order relation. E 


Because of the special paradigmatic role played by the < relation in the study of 
partial order relations, the symbol < is often used to refer to a general partial order 
relation, and the notation x = y is read “x is less than or equal to y” or “y is greater 
than or equal to x.” 


Lexicographic Order 


To figure out which of two words comes first in an English dictionary, you compare their 
letters one by one from left to right. If all letters have been the same to a certain point and 
one word runs out of letters, that word comes first in the dictionary. For example, play 
comes before playhouse. If all letters up to a certain point are the same and the next letters 
differ, then the word whose next letter is located earlier in the alphabet comes first in the 
dictionary. For instance, playhouse comes before playmate. 

More generally, if A is any set with a partial order relation, then a dictionary or lexi- 
cographic order can be defined on a set of strings over A as indicated in the following 
theorem. 
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Theorem 8.5.1 Lexicographic Order 


Let A be a set with a partial order relation R, and let S be a set of strings over A. 
Define a relation < on S as follows: 

Let s and ż be any strings in S of lengths m and n, respectively, where m and n are 
positive integers, and let s„ and ¢,, be the characters in the mth position for s and 1, 
respectively. 


1. If m = n and the first m characters of s and t are the same, then s ~ t. 
2. If the first m — 1 characters in s and t are the same, Sm R tm, and Sm Æ tm then s = t. 
3. If A is the null string then A <x s. 


If no strings are related by < other than by these three conditions, then < is a partial 
order relation on S. 


The proof of Theorem 8.5.1 is technical but straightforward. It is left for the exercises. 


Definition 


The partial order relation of Theorem 8.5.1 is called the lexicographic order for S 
that corresponds to the partial order R on A. 


SCR Testing Strings for Lexicographic Order 
Let A = {x, y} and let R be the following partial order relation on A: 


R = {(, x), (x, y), Q, y)}- 


Let S be the set of all strings over A, and denote by the lexicographic order for S that cor- 
responds to R. 


a. Isx <x? Isx~=xx? Is yx x yxy? 

b. Is xxxyyy < xy? 

c. Isx x y? 

d. IsA = xy? 

e. Is xyy < xyx? 

Solution 

a. Yes in all three cases, by property (1) of the definition of =. 
b. Yes in all cases, by property (2) of the definition of =. 


c. Yes in all cases, by property (2) of the definition of <. In this case m — 1 = 0, and the 
statement that the first zero characters of x and y are the same is true by default. 


d. Yes by property (3) of the definition of <. 


e. No because y is not related to x by R. E 


Hasse Diagrams 
Let A = {1, 2, 3, 9, 18} and consider the “divides” relation on A: For every a, b E A, 


a|b <œ b= ka for some integer k. 
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The directed graph of this relation has the following appearance: 


Note that there is a loop at every vertex, all other arrows point in the same direction (up- 
ward), and any time there is an arrow from one point to a second and from the second 
point to a third, there is an arrow from the first point to the third. Given any partial order 
relation defined on a finite set, it is possible to draw the directed graph in such a way that 
all of these properties are satisfied. This makes it possible to associate a somewhat simpler 
graph, called a Hasse diagram (after Helmut Hasse, a twentieth-century German number 
theorist), with a partial order relation defined on a finite set. To obtain a Hasse diagram, 
proceed as follows: 

Start with a directed graph of the relation, placing vertices on the page so that all arrows 
point upward. Then eliminate 


1. the loops at all the vertices, 
2. all arrows whose existence is implied by the transitive property, and 


3. the direction indicators on the arrows. 


For the relation given previously, the Hasse diagram is as follows: 


18 


EU eee Constructing a Hasse Diagram 
Consider the “subset” relation, C, on the set P({a, b, c}). That is, for all sets U and V in 


P({a, b, c}), 
UCV s VWx,ifx € Uthenx € V. 


Construct the Hasse diagram for this relation. 


Solution Draw the directed graph of the relation in such a way that all arrows except 
loops point upward. 
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Then strip away all loops, unnecessary arrows, and direction indicators to obtain the Hasse 
diagram. 


{a,b,c} 


{a, b} {b,c} 


{a} {c} 


Ø E 


To recover the directed graph of a relation from the Hasse diagram, just reverse the 
instructions given previously, using the knowledge that the original directed graph was 
sketched so that all arrows pointed upward: 


1. Reinsert the direction markers on the arrows making all arrows point upward. 
2. Add loops at each vertex. 


3. For each sequence of arrows from one point to a second and from that second point to 
a third, add an arrow from the first point to the third. 


Obtaining the Directed Graph of a Partial Order Relation 
from the Hasse Diagram of the Relation 


A partial order relation R has the following Hasse diagram. Find the directed graph of R. 


E 
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Solution 


Partially and Totally Ordered Sets 


Given any two real numbers x and y, either x = y or y = x. In a situation like this, the ele- 
ments x and y are said to be comparable. On the other hand, given two subsets A and B of 
{a, b, c}, it may be the case that neither A C B nor B C A. For instance, let A = {a, b} and 
B = {b,c}. Then A Œ Band B Ç A. In such a case, A and B are said to be noncomparable. 


Definition 


Suppose < is a partial order relation on a set A. Elements a and b of A are said 
to be comparable if, and only if, either a < b or b = a. Otherwise, a and b are 
called noncomparable. 


When all the elements of a partial order relation are comparable, the relation is called 
a total order. 


Definition 


If R is a partial order relation on a set A, and for any two elements a and b in A either 
a R b or b Ra, then R is a total order relation on A. 


Both the “less than or equal to” relation on sets of real numbers and the lexicographic 
order of the set of words in a dictionary are total order relations. Note that the Hasse dia- 
gram for a total order relation can be drawn as a single vertical “chain.” 

Many important partial order relations have elements that are not comparable and are, 
therefore, not total order relations. For instance, the subset relation on P({a, b, c}) is not a 
total order relation because, as shown previously, the subsets {a, b} and {a, c} of {a, b, c} 
are not comparable. In addition, a “divides” relation is not a total order relation unless the 
elements are all powers of a single integer. (See exercise 21 at the end of this section.) 

A set A is called a partially ordered set (or poset) with respect to a relation = if, and 
only if, < is a partial order relation on A. For instance, the set of real numbers is a partially 
ordered set with respect to the “less than or equal to” relation <, and a set of sets is partially 
ordered with respect to the “subset” relation C. It is entirely straightforward to show that 
any subset of a partially ordered set is partially ordered. (See exercise 35 at the end of this 
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section.) This, of course, assumes the “same definition” for the relation on the subset as for 
the set as a whole. A set A is called a totally ordered set with respect to a relation < if, and 
only if, A is partially ordered with respect to < and < is a total order. 

A set that is partially ordered but not totally ordered may have totally ordered subsets. 
Such subsets are called chains. 


Definition 


Let A be a set that is partially ordered with respect to a relation <. A subset B of A is 
called a chain if, and only if, the elements in each pair of elements in B are compa- 
rable. In other words, a < b or b < a for every a and b in B. The length of a chain 
is one less than the number of elements in the chain. 


Observe that if B is a chain in A, then B is a totally ordered set with respect to the “re- 
striction” of < to B. 


A Chain of Subsets 


The set P({a, b, c}) is partially ordered with respect to the subset relation. Find a chain of 
length 3 in P({a, b, c}). 


Solution Since ØC {a} C {a, b} C {a, b, c}, the set 
S = {Ø, {a}, {a, b}, {a, b, c}} 
is a chain of length 3 in ({a, b, c}). E 


In exercise 39 at the end of this section, you are asked to show that a set that is partially 
ordered with respect to a relation < is totally ordered with respect to < if, and only if, it 
is a chain. 

A maximal element in a partially ordered set is an element that is greater than or equal 
to every element to which it is comparable. (There may be many elements to which it is 
not comparable.) A greatest element in a partially ordered set is an element that is greater 
than or equal to every element in the set (so it is comparable to every element in the set). 
Minimal and least elements are defined similarly. 


Let a set A be partially ordered with respect to a relation <. 


1. An element a in A is called a maximal element of A if, and only if, for each b in 
A, either b < a or b and a are not comparable. 


2. An element a in A is called a greatest element of A if, and only if, for each b in 
A,b <a. 

3. An element a in A is called a minimal element of A if, and only if, for each b in 
A, either a < b or b and a are not comparable. 

4. An element a in A is called a least element of A if, and only if, for each b in A, 
a <b. 


A greatest element is maximal, but a maximal element need not be a greatest element. 
However, every finite subset of a totally ordered set has both a least element and a greatest 
element. (See exercise 40 at the end of the section.) Similarly, a least element is minimal, 
but a minimal element need not be a least element. Furthermore, a set that is partially or- 
dered with respect to a relation can have at most one greatest element and one least element 
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(see exercise 42 at the end of the section), but it may have more than one maximal or mini- 
mal element. The next example illustrates some of these facts. 


Sel e-eewee Maximal, Minimal, Greatest, and Least Elements 


Let A = {a, b, c, d, e, f, g, h, i} have the partial ordering < defined by the following Hasse 
diagram. Find all maximal, minimal, greatest, and least elements of A. 


g 
a h 
b A i 
c d 
Solution There is just one maximal element, g, which is also the greatest element. The 
minimal elements are c, d, and i, and there is no least element. E 
Topological Sorting 


Is it possible to input the sets of P({a, b, c}) into a computer in a way that is compatible 
with the subset relation C in the sense that if set U is a subset of set V, then U is input be- 
fore V? The answer, as it turns out, is yes. For instance, the following input order satisfies 
the given condition: 


©, {a}, {b}, {c}, {a, b}, {a, c}, {b, c}, {a, b, c}. 
Another input order that satisfies the condition is 


Ø, {a}, {b}, {a, b}, {c}, {b, c}, {a, c}, {a, b, c}. 


Definition 


Given partial order relations < and <’ on a set A, <' is compatible with < if, and 


only if, for every a and b in A, if a < b then a ~' b. 


Given an arbitrary partial order relation < on a set A, is there a total order =’ on A that is 
compatible with =? If the set on which the partial order is defined is finite, then the answer 
is yes. A total order that is compatible with a given order is called a topological sorting. 


Definition 


Given partial order relations < and <’ on a set A, <’ is a topological sorting for < if, 


and only if, <’ is a total order that is compatible with <. 


The construction of a topological sorting for a general finite partially ordered set is 
based on the fact that any partially ordered set that is finite and nonempty has a minimal 
element. (See exercise 41 at the end of the section.) To create a total order for a partially 
ordered set, simply pick any minimal element and make it number one. Then consider 
the set obtained when this element is removed. Since the new set is a subset of a partially 
ordered set, it is partially ordered. If it is empty, stop the process. If not, pick a minimal 
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element from it and call that element number two. Then consider the set obtained when 
this element also is removed. If this set is empty, stop the process. If not, pick a minimal 
element and call it number three. Continue in this way until all the elements of the set have 
been used up. 

Here is a somewhat more formal version of the algorithm: 


Constructing a Topological Sorting 


Let < be a partial order relation on a nonempty finite set A. To construct a topologi- 
cal sorting: 


1. Pick any minimal element x in A. [Such an element exists since A is nonempty.] 
2. Set A’: = A— {x}. 
3. Repeat steps a—c while A’ # ©. 

a. Pick any minimal element y in A’. 

b. Define x +’ y. 

c. Set A’ := A’ — {y} and x := y. 


[Completion of steps 1—3 of this algorithm gives enough information to construct the 
Hasse diagram for the total ordering <. We have already shown how to use the Hasse 
diagram to obtain a complete directed graph for a relation.] 


A Topological Sorting 


Consider the set A = {2, 3, 4, 6, 18, 24} ordered by the “divides” relation |. The Hasse 
diagram of this relation is the following: 


24 18 
4 6 
2 3 


The ordinary “less than or equal to” relation = on this set is a topological sorting for it 
since for positive integers a and b, if a|b then a S b. Find another topological sorting for 
this set. 


Solution The set has two minimal elements: 2 and 3. Either one may be chosen; suppose 
you pick 3. The beginning of the total order is 


total order: 3. 


Set A’ = A — {3}. You can indicate this by removing 3 from the Hasse diagram as shown 


below. 
24 18 
4 6 
2 


Next choose a minimal element from A’ — {3}. Only 2 is minimal, so you must pick it. The 
total order thus far is 


total order: 3 < 2. 
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Set A’ = (A— {3}) — {2} = A — {3,2}. You can indicate this by removing 2 from the 
Hasse diagram, as is shown below. 


24 rT] 18 
4 6 

Choose a minimal element from A’ — {3, 2}. Again you have two choices: 4 and 6. Sup- 

pose you pick 6. The total order for the elements chosen thus far is 


total order: 3 < 2 < 6. 


You continue in this way until every element of A has been picked. One possible sequence 
of choices gives 


total order: 3 < 2 < 6 < 18 < 4 ~ 24. 


You can verify that this order is compatible with the “divides” partial order by checking 
that for each pair of elements a and b in A such that a | b, then a < b. Note that it is not the 
case that if a < b then a | b. E 


An Application 


To return to the example that introduced this section, note that the following defines a 
partial order relation on the set of courses required for a university degree: For all required 
courses x and y, 


x<xy + x=y or xisa prerequisite fory. 


If the Hasse diagram for the relation is drawn, then the questions raised at the beginning 
of this section can be answered easily. For instance, consider the Hasse diagram for the 
requirements at a particular university, which is shown in Figure 8.5.1. 


CS 350 


CS 200 


MA 140 CS 150 


FIGURE 8.5.1 


The minimum number of school terms needed to complete the requirements is the size 
of a longest chain, which is 7 (150, 155, 225, 300, 340, 360, 390, for example). The maxi- 
mum number of courses that could be taken in the same term (assuming the university 
allows it) is the maximum number of noncomparable courses, which is 6 (350, 360, 345, 
301, 230, 200, for example). A part-time student could take the courses in a sequence 
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determined by constructing a topological sorting for the set. (One such sorting is 140, 150, 
141, 155, 200, 225, 230, 300, 250, 301, 340, 345, 350, 360, 390. There are many others.) 


PERT and CPM 


Two important and widely used applications of partial order relations are PERT (Program 
Evaluation and Review Technique) and CPM (Critical Path Method). These techniques 
were developed in the 1950s as planners came to grips with the complexities of scheduling 
the individual activities needed to complete very large projects, and although they are very 
similar, their developments were independent. PERT was developed by the U.S. Navy to 
help organize the construction of the Polaris submarine, and CPM was developed by the 
E. I. Du Pont de Nemours company for scheduling chemical plant maintenance. Here is a 
somewhat simplified example of the way the techniques work. 


A Job Scheduling Problem 


At an automobile assembly plant, the job of assembling an automobile can be broken down 
into these tasks: 


1. Build frame. 

. Install engine, power train components, gas tank. 
Install brakes, wheels, tires. 

. Install dashboard, floor, seats. 

Install electrical lines. 

. Install gas lines. 

Install brake lines. 


. Attach body panels to frame. 


oe Oo N DH FW WN 


Paint body. 


Certain of these tasks can be carried out at the same time, whereas some cannot be started 
until other tasks are finished. Table 8.5.1 summarizes the order in which tasks can be per- 
formed and the time required to perform each task. 


TABLE 8.5.1 
Task Immediately Preceding Tasks Time Needed to Perform Task 
1 7 hours 
2, 1 6 hours 
3 1 3 hours 
4 2 6 hours 
5 2,3 3 hours 
6 4 1 hour 
T 2,3 1 hour 
8 4,5 2 hours 
9 6, 7,8 5 hours 
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Let T be the set of all tasks, and consider the partial order relation < defined on T as 
follows: For all tasks x and y in 7, 


xxy 4 x=yorx precedes y. 


If the Hasse diagram of this relation is turned sideways (as is customary in PERT and 
CPM analysis), it has the appearance shown below. 


Task 4 
6 hours Task 6 
1 hour 
Task 2 Task 5 


7 hours 


6 hours 3 hours 
v4 a Task 8 
Task 1 2 hours Task 9 
5 hours 
| Task 3 Task 7 | 
3 hours 1 hour 


What is the minimum time required to assemble a car? You can determine this by 
working from left to right across the diagram, noting for each task (say, just above the box 
representing that task) the minimum time needed to complete that task starting from the 
beginning of the assembly process. For instance, you can put a 7 above the box for task 1 
because task 1 requires 7 hours. Task 2 requires completion of task 1 (7 hours) plus 6 hours 
for itself, so the minimum time required to complete task 2, starting at the beginning of 
the assembly process, is 7+6 = 13 hours. So you can put a 13 above the box for task 2. 
Similarly, you can put a 10 above the box for task 3 because 7+3 = 10. Now consider 
what number you should write above the box for task 5. The minimum times to complete 
tasks 2 and 3, starting from the beginning of the assembly process, are 13 and 10 hours 
respectively. Since both tasks must be completed before task 5 can be started, the mini- 
mum time to complete task 5, starting from the beginning, is the time needed for task 5 
itself (3 hours) plus the maximum of the times to complete tasks 2 and 3 (13 hours), and 
this equals 3 + 13 = 16 hours. Thus you should place the number 16 above the box for task 
5. The same reasoning leads you to place a 14 above the box for task 7. Similarly, you can 
place a 19 above the box for task 4, a 20 above the box for task 6, a 21 above the box for 
task 8, and a 26 above the box for task 9, as shown below. 


19 
Task 4 20 
6 hours Task 6 
13 16 1 hour 
Task 2 Task 5 21 
z 6 hours 3 hours 
Task 8 
Task 1 ið i i 2 hours Task 9 
7 hours 5 hours 
Task 3 Task 7 
3 hours 1 hour 


This analysis shows that at least 26 hours are required to complete task 9 starting from the 
beginning of the assembly process. When task 9 is finished, the assembly is complete, so 
26 hours is the minimum time needed to accomplish the whole process. 
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1. 
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Note that the minimum time required to complete tasks 1, 2, 4, 8, and 9 in sequence is 
exactly 26 hours. This means that a delay in performing any one of these tasks causes a 
delay in the total time required for assembly of the car. For this reason, the path through 
tasks 1, 2, 4, 8, and 9 is called a critical path. E 


For a relation R on a set A to be antisymmetric 
means that 


. To show that a relation R on an infinite set A is 


antisymmetric, you suppose that 
show that ——. 


and you 


. To show that a relation R on a set A is not antisym- 


metric, you 


To construct a Hasse diagram for a partial order 
relation, you start with a directed graph of the 
relation in which all arrows point upward and you 
eliminate ; , and 


If A is a set that is partially ordered with respect 
to a relation < and if a and b are elements of A, 
we say that a and b are comparable if, and only if, 
or 


A relation < on a set A is a total order if, and only 
if, 


EXERCISE SET 8.5 


1. 


Each of the following is a relation on {0, 1, 2, 3}. 
Draw directed graphs for each relation, and indi- 
cate which relations are antisymmetric. 


a. R, = {0, 0), ©, 2), C, 0), CL, 3), (2, 2), (3, 0), (3, D} 
b. Ry = {0, 1), 0, 2), (1, 1), (1, 2), C, 3), (2, 2), 3, D} 
c. R; = {0, 0), O, 3), (d, 0), CL, 3), (2, 2), (3, 3), (3, D} 
d. Ry = {(0, 0), (1, 0), (1, 2), (1, 3), (2, 0), (2, 1), 


(3, 2), G, 0)} 


. Let P be the set of all people in the world and 


define a relation R on P as follows: For all people 


x and y, 
xRy < xis no older than y. 


Is R antisymmetric? Prove or give a counterexample. 


. Let S be the set of all strings of a’s and b’s. Define 


a relation R on S as follows: For every s, t E S, 
sRt & L(s)SLM, 


where L(x) denotes the length of a string x. Is R 
antisymmetric? Prove or give a counterexample. 
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10. 


If A is a set that is partially ordered with respect 
to a relation <, and if B is a subset of A, then B is a 
chain if, and only if, for all a and b in B, 


. Let A be a set that is partially ordered with respect 


to a relation <, and let a be an element of A. 


(a) ais maximal if, and only if, 


(b) ais a greatest element of A if, and only if, 


(c) ais minimal if, and only if, 


(d) ais a least element of A if, and only if, 


Given a set A that is partially ordered with respect 
to a relation <, the relation <’ is a topological 
sorting for <, if, and only if, <’ is a and for 
alla and b in A if a < b then ____. 


PERT and CPM are used to produce efficient 


Let R be the “less than” relation on R, the set of 
all real numbers: For every x, y E R, 


xRy © x<y. 


Is R antisymmetric? Prove or give a counterexample. 


. Let R be the set of all real numbers and define a 


relation R on R X R as follows: For every (a, b) 


and (c, d) in R X R, 
(a, b)R(c,d) < eithera < cor botha = c 


and b £ d. 


Is R a partial order relation? Prove or give a 
counterexample. 


. Let P be the set of all people who have ever lived 


and define a relation R on P as follows: For every 


is SP: 
rRs © risan ancestor of sorr=s. 


Is R a partial order relation? Prove or give a 
counterexample. 


10. 


1. 


12. 
13. 


14. 


16. 


Define a relation R on Z, the set of all integers as 
follows: For every m, n E Z, 


mRn < every prime factor of m 


is a prime factor of n. 


Is R a partial order relation? Prove or give a 
counterexample. 


. Define a relation R on Z, the set of all integers as 


follows: For every m, n E Z, 


mRn & m+nis even. 


Is R a partial order relation? Prove or give a 
counterexample. 


Define a relation R on R, the set of all real num- 
bers as follows: For every x, y E R, 


xRy © r= y. 
Is R a partial order relation? Prove or give a 
counterexample. 


Suppose R and S are antisymmetric relations on a 
set A. Must R U S also be antisymmetric? Explain. 


Let A = {a, b}, and suppose A has the partial 
order relation R where R = {(a, a), (a, b), (b, b)}. 
Let S be the set of all strings in a’s and b’s and 
let be the corresponding lexicographic order on 
S. Indicate which of the following statements are 
true, and for each true statement cite as a reason 
part (1), (2), or (3) of the definition of lexicograph- 
ic order given in Theorem 8.5.1. 

a. aab < aaba b. bbab < bba 

c. A < aba d. aba < abb 

e. bbab < bbaa f. ababa < ababaa 
g. bbaba < bbabb 


Prove Theorem 8.5.1. 


Let A = {a, b}. Describe all partial order relations 
on A. 


Let A = {a,b,c}. 

a. Describe all partial order relations on A for 
which a is a maximal element. 

b. Describe all partial order relations on A for 
which a is a minimal element. 


. Suppose a relation R on a set A is reflexive, sym- 


metric, transitive, and antisymmetric. What can 
you conclude about R? Prove your answer. 


Consider the “divides” relation on each of the fol- 
lowing sets A. Draw the Hasse diagram for each 
relation. 


17. 


18. 


19. 


20. 


21, 
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a. A = {1, 2, 4, 5, 10, 15, 20} 
b. A = {2,3, 4, 6, 8, 9, 12, 18} 


Consider the “subset” relation on P(S) for each of 
the following sets S. Draw the Hasse diagram for 
each relation. 


a. S= {0, 1} b. S= {0, 1, 2} 


Let S = {0, 1} and consider the partial order rela- 
tion R defined on S X S as follows: For all ordered 
pairs (a, b) and (c, d) in S X S, 


(a,b) R (c,d) & eithera< c or both 
a=candb<=d, 


where < denotes the usual “less than” and = de- 
notes the usual “less than or equal to” relation for 
real numbers. Draw the Hasse diagram for R. 


Let S = {0, 1} and consider the partial order rela- 
tion R defined on S X S as follows: For all ordered 
pairs (a, b) and (c, d) in S X S, 


(a,b) R (c,d @& ascandb=d, 


where = denotes the usual “less than or equal to” 
relation for real numbers. Draw the Hasse diagram 
for R. 


Let S = {0, 1} and consider the partial order rela- 
tion R defined on S X S$ X S as follows: For all 


ordered triples (a, b, c) and (d, e, f) in S X S X S, 
(a,b,c) R (d,e, f) & asd,b=e,andc Sf, 


where = denotes the usual “less than or equal to” 
relation for real numbers. Draw the Hasse diagram 
for R. 


Consider the “divides” relation defined on the set 


A = {1,2, 2°, 23,...,2"}, where n is a nonnegative 

integer. 

a. Prove that this relation is a total order relation 
on A. 


b. Draw the Hasse diagram for this relation for 
n= 4. 


In 22-29, find all greatest, least, maximal, and minimal 
elements for the relations in each of the referenced 


exercises. 

22. Exercise 16(a) 23. Exercise 16(b) 
24. Exercise 17(a) 25. Exercise 17(b) 
26. Exercise 18 27. Exercise 19 


28. Exercise 20 29. Exercise 21 
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30. Each of the following sets is partially ordered 
with respect to the “less than or equal to” rela- 
tion, =, for real numbers. In each case, deter- 
mine whether the set has a greatest or least 
element. 
a R 


c. {xER|O<x< 1} 
31. Let A = {a, b, c, d}, and let R be the relation 


R = {(a, a), (b, b), (c, c), (d, d), (c, a), (a, d), 
(c, d), (b, c), (b, d), (b, a)}. 


Is R a total order on A? Justify your answer. 


32. Let A = {a, b, c, d}, and let R be the relation 


R = {(a, a), (b, b), (c, c), (d, d), (c, b), (a, d), 
(b, a), (b, d), (c, d), (c, a)}. 


Is R a total order on A? Justify your answer. 


33. Consider the set A = {12, 24, 48, 3, 9} ordered by 
the “divides” relation. Is A totally ordered with 
respect to the relation? Justify your answer. 


Suppose that R is a partial order relation on a set A 
and that B is a subset of A. The restriction of R to 
B is defined as follows: 


The restriction of R to B 


= {(x, y) |xEB,y E B, and (x, y) E R}. 


In other words, two elements of B are related by 
the restriction of R to B if, and only if, they are 
related by R. Prove that the restriction of R to B is 
a partial order relation on B. (In less formal lan- 
guage, this says that a subset of a partially ordered 
set is partially ordered.) 


35. The set P({w, x, y, z}) is partially ordered with 
respect to the “subset” relation C. Find a chain of 
length 4 in P({w, x, y, z}). 


36. The set A = {2, 4, 3, 6, 12, 18, 24} is partially 
ordered with respect to the “divides” relation. Find 
a chain of length 3 in A. 


37. Find a chain of length 2 for the relation defined in 
exercise 19. 


38. Prove that a partially ordered set is totally ordered 
if, and only if, it is a chain. 


39. Suppose that A is a totally ordered set. Use 
mathematical induction to prove that for any 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


b. {xER|OSx=1} 
d. {xE€Z|0<x< 10} 


40. 


41. 


42. 


43. 


44. 


45. 


46 


47. 


48. 


49. 


50. 


integer n = 1, every subset of A with n ele- 
ments has both a least element and a greatest 
element. 


Prove that a nonempty, finite, partially ordered 
set has 

a. at least one minimal element, 

b. at least one maximal element. 


Prove that a finite, partially ordered set has 
a. at most one greatest element, 
b. at most one least element. 


Draw a Hasse diagram for a partially ordered set 
that has two maximal elements and two minimal 
elements and is such that each element is compa- 
rable to exactly two other elements. 


Draw a Hasse diagram for a partially ordered 

set that has three maximal elements and three 
minimal elements and is such that each element is 
either greater than or less than exactly two other 
elements. 


Use the algorithm given in the text to find a to- 
pological sorting for the relation of exercise 16(a) 
that is different from the “less than or equal to” 
relation =. 


Use the algorithm given in the text to find a to- 
pological sorting for the relation of exercise 16(b) 
that is different from the “less than or equal to” 
relation =. 


Use the algorithm given in the text to find a topo- 
logical sorting for the relation of exercise 19. 


Use the algorithm given in the text to find a topo- 
logical sorting for the relation of exercise 20. 


Use the algorithm given in the text to find a 
topological sorting for the “subset” relation on 
P({a, b, c, d}). 


Refer to the prerequisite structure shown in 

Figure 8.5.1. 

a. Find a list of six noncomparable courses that is 
different from the list given in the text. 

b. Find two topological sortings that are different 
from the one given in the text. 


A set S of jobs can be ordered by writing x < y to 
mean that either x = y or x must be done before y, 
for all x and y in S. The following is a Hasse dia- 

gram for this relation for a particular set S of jobs. 


Ww 


10 8 


a. If one person is to perform all the jobs, one 
after another, find an order in which the jobs 
can be done. 

b. Suppose enough people are available to per- 
form any number of jobs simultaneously. 

(i) If each job requires one day to perform, 
what is the least number of days needed to 
perform all ten jobs? 

(ii) What is the maximum number of jobs that 
can be performed at the same time? 


ANSWERS FOR TEST YOURSELF 


8.5 PARTIAL ORDER RELATIONS 563 


51. Suppose the tasks described in Example 8.5.12 
require the following performance times: 


Task | Time Needed to Perform Task 
1 9 hours 
2 7 hours 
3 4 hours 
4 5 hours 
5 7 hours 
6 3 hours 
7 2 hours 
8 4 hours 
9 6 hours 


a. What is the minimum time required to 
assemble a car? 
b. Find a critical path for the assembly process. 


1. for every a and bin A, if a R b and b R a then a = b 

2. a and b are any elements of A with a R b and b R a; 
a=b _ 3. show that there are elements a and b in A such 
thata RbandbRaanda +b 4. all loops; all arrows 
whose existence is implied by the transitive property; the 
direction indicators on the arrows 5.a<b,b<a 


6. for any two elements a and b in A, either a < b or b <a 

7. a and b are comparable 8. (a) for every b in A either b < a 
or b and a are not comparable (b) for every b in A, b <a 

(c) for every b in A either a < b or b and a are not comparable 
(d) for every bin A,a<b 9. total order; a <' b 

10. scheduling of tasks 
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COUNTING AND 
PROBABILITY 


“It’s as easy as 1-2-3.” 

That’s the saying. And in certain ways, counting is easy. But other aspects of counting 
aren’t so simple. Have you ever agreed to meet a friend “in three days” and then real- 
ized that you and your friend might mean different things? For example, on the European 
continent, to meet in eight days means to meet on the same day as today one week hence; 
on the other hand, in English-speaking countries, to meet in seven days means to meet one 
week hence. The difference is that on the continent, all days including the first and the last 
are counted. In the English-speaking world, it’s the number of 24-hour periods that are 


counted. 
Continental countries 1 2 3 4 5 6 7 8 
| l l T l J J l 
Sun Mon Tue Wed Thu Fri Sat Sun 
English-speaking countries 1 2 3 4 5 6 7 


The English convention for counting days follows the almost universal convention for 
counting hours. If it is 9 A.M. and two people anywhere in the world agree to meet in three 
hours, they mean that they will get back together again at 12 noon. 

Musical intervals, on the other hand, are universally reckoned the way the Continentals 
count the days of a week. An interval of a third consists of two tones with a single tone 
in between, and an interval of a second consists of two adjacent tones. (See Figure 9.1.1.) 


C E C D 
Interval of a third Interval of a second 
FIGURE 9.1.1 


Of course, the complicating factor in all these examples is not how to count but rather 
what to count. And, indeed, in the more complex mathematical counting problems dis- 
cussed in this chapter, it is what to count that is the central issue. 


EXE Introduction to Probability 


Imagine tossing two coins and observing whether 0, 1, or 2 heads are obtained. It would 
be natural to guess that each of these events occurs about one-third of the time, but in 
fact this is not the case. Table 9.1.1 shows actual data obtained from tossing two quarters 
50 times. 


564 
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TABLE 9.1.1 Experimental Data Obtained from Tossing Two Quarters 50 Times 


Frequency (Number 
of times the event 


Relative Frequency 
(Fraction of times 


Event Tally occurred) the event occurred) 
2 heads obtained Lat JET | 11 22% 
1 head obtained M LAT II 27 54% 
0 heads obtained WT II 12 24% 


As you can see, the relative frequency of obtaining exactly 1 head was roughly twice as 
great as that of obtaining either 2 heads or 0 heads. It turns out that the mathematical the- 
ory of probability can be used to predict that a result like this will almost always occur. To 
see how, call the two coins A and B, and suppose that each is perfectly balanced. Then each 
has an equal chance of coming up heads or tails, and when the two are tossed together, the 
four outcomes pictured in Figure 9.1.2 are all equally likely. 


2 heads obtained 


1 head obtained 0 heads obtained 


FIGURE 9.1.2 Equally Likely Outcomes from Tossing Two Balanced Coins 


Figure 9.1.2 shows that there is a 1 in 4 chance of obtaining two heads and a 1 in 4 
chance of obtaining no heads. The chance of obtaining one head, however, is 2 in 4 be- 
cause either A could come up heads and B tails or B could come up heads and A tails. So if 
you repeatedly toss two balanced coins and record the number of heads, you should expect 
relative frequencies similar to those shown in Table 9.1.1. 

To formalize this analysis and extend it to more complex situations, we introduce the 
notions of random process, sample space, event, and probability. To say that a process 
is random means that when it takes place, one outcome from some set of outcomes is 
sure to occur, but it is impossible to predict with certainty which outcome that will be. 
For instance, if an ordinary person performs the experiment of tossing an ordinary coin 
into the air and allowing it to fall flat on the ground, it can be predicted with certainty 
that the coin will land either heads up or tails up (so the set of outcomes can be denoted 
{heads, tails}), but it is not known for sure whether heads or tails will occur. We re- 
stricted this experiment to ordinary people because a skilled magician can toss a coin 
in a way that appears random but is not, and a physicist equipped with first-rate measur- 
ing devices may be able to analyze all the forces on the coin and correctly predict its 
landing position. Just a few of many examples of random processes or experiments are 
choosing winners in state lotteries, selecting respondents in public opinion polls, and 
choosing subjects to receive treatments or serve as controls in medical experiments. 
The set of outcomes that can result from a random process or experiment is called a 
sample space. 
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Definition 


A sample space is the set of all possible outcomes of a random process or experi- 
ment. An event is a subset of a sample space. 


In the case where an experiment has finitely many outcomes and all outcomes are 
equally likely to occur, the probability of an event (set of outcomes) is just the ratio of 
the number of outcomes in the event to the total number of outcomes. Strictly speaking, 
this result can be deduced from a set of axioms for probability formulated in 1933 by the 
Russian mathematician A. N. Kolmogorov. In Section 9.8 we discuss the axioms and show 
how to derive their consequences formally. At present, we take a less formal approach to 
probability and simply state the result as a principle. 


Equally Likely Probability Formula 


If S is a finite sample space in which all outcomes are equally likely and E is an 
event in S, then the probability of E, denoted P(E), is 


the number of outcomes in £ 


P(E) = 


the total number of outcomes in S` 


For any finite set A, N(A) denotes the number of elements in A. 


With this notation, the equally likely probability formula becomes 


mult) 
NBS 


Probabilities for a Deck of Cards 


An ordinary deck of cards contains 52 cards divided into four suits. The red suits are dia- 
monds (¢) and hearts (V), and the black suits are clubs (#) and spades ($). Each suit con- 
tains 13 cards of the following denominations: 2, 3, 4, 5, 6, 7, 8, 9, 10, J (jack), Q (queen), 
K (king), and A (ace). The cards J, Q, and K are called face cards. 

Mathematician Persi Diaconis, working with David Aldous in 1986 and Dave Bayer 
in 1992, showed that seven shuffles are needed to “thoroughly mix up” the cards in an 
ordinary deck. In 2000 mathematician Nick Trefethen, working with his father, Lloyd 
Trefethen, a mechanical engineer, used a somewhat different definition of “thoroughly mix 
up” to show that six shuffles will nearly always suffice. Imagine that the cards in a deck 
have become—by some method—so thoroughly mixed up that if you spread them out face 
down and pick one at random, you are as likely to get any one card as any other. 


a. What is the sample space of outcomes? 
b. What is the event that the chosen card is a black face card? 


c. What is the probability that the chosen card is a black face card? 
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Solution 
a. The outcomes in the sample space S are the 52 cards in the deck. 


b. Let E be the event that a black face card is chosen. The outcomes in E are the jack, 
queen, and king of clubs and the jack, queen, and king of spades. Symbolically: 


E = {J&, Q&, K&, JA, Q4, K4). 


c. By part (b), N(E) = 6, and according to the description of the situation, all 52 outcomes 
in the sample space are equally likely. Therefore, by the equally likely probability 
formula, the probability that the chosen card is a black face card is 


_ME)_ 6 _ 
P(E) = is a 11.5% E 


ZEWJGEAWA Rolling a Pair of Dice 


A die is one of a pair of dice. It is a cube with six sides, each containing from one to six 
dots, called pips. Suppose a blue die and a gray die are rolled together, and the numbers of 
dots that occur face up on each are recorded. The possible outcomes can be listed as fol- 
lows, where in each case the die on the left is blue and the one on the right is gray. 


A more compact notation identifies, say, i ES with the notation 24, E E with 53, and 
so forth. 


a. Use the compact notation to write the sample space S of possible outcomes. 


b. Use set notation to write the event E that the numbers showing face up have a sum of 6 
and find the probability of this event. 


Solution 
a. S = {11, 12, 13, 14, 15, 16, 21, 22, 23, 24, 25, 26, 31, 32, 33, 34, 35, 36, 41, 42, 43, 44, 
45, 46, 51, 52, 53, 54, 55, 56, 61, 62, 63, 64, 65, 66}. 
b. E = {15, 24, 33, 42, 51}. 
The probability that the sum of the numbers is 6 = P(E) x“ 2 ; E 
N(S) 36 

The next example is called the Monty Hall problem, named for the first host of the game 
show “Let’s Make A Deal.” When it was originally publicized in a newspaper column and 
on a radio show, it created tremendous controversy. Many highly educated people, even some 
with Ph.D.’s, submitted incorrect solutions or argued vociferously against the correct solution. 
Before you read the answer, think about what your own response to the situation would be. 
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The Monty Hall Problem 


There are three doors on the set for a game show. Let’s call them A, B, and C. If you pick 
the correct door, you win the prize. You pick door A. The host of the show then opens one 
of the other doors and reveals that there is no prize behind it. Keeping the remaining two 
doors closed, he asks you whether you want to switch your choice to the other closed door 
or stay with your original choice of door A. What should you do if you want to maximize 
your chance of winning the prize: stay with door A or switch—or would the likelihood of 
winning be the same either way? 


Solution At the point just before the host opens one of the closed doors, there is no infor- 
mation about the location of the prize. Thus there are three equally likely possibilities for 
what lies behind the doors: (Case 1) the prize is behind A (meaning it is not behind either 
B or C); (Case 2) the prize is behind B; or (Case 3) the prize is behind C. 

Since there is no prize behind the door the host opens, in Case | the host could open 
either door and you would win by staying with your original choice: door A. In Case 2 the 
host must open door C, and so you would win by switching to door B. In Case 3 the host 
must open door B, and so you would win by switching to door C. Thus, in two of the three 
equally likely cases, you would win by switching from A to the other closed door. In only 
one of the three equally likely cases would you win by staying with your original choice. 
Therefore, you should switch. a 


The analysis used for the solution in Example 9.1.3 applies only if the host always opens 
one of the closed doors and offers the contestant the choice of staying with the original 
choice or switching. In the original show, the host made this offer only occasionally—most 
often when he knew the contestant had already chosen the correct door. 

Many of the fundamental principles of probability were formulated in the mid-1600s in 
an exchange of letters between Pierre de Fermat and Blaise Pascal in response to questions 
posed by a French nobleman interested in games of chance. In 1812, Pierre-Simon Laplace 
published the first general mathematical treatise on the subject and extended the range of 
applications to a variety of scientific and practical problems. 


Bettmann/Getty Images 


Pierre-Simon Laplace 
(1749-1827) 


Counting the Elements of a List 


Some counting problems are as simple as counting the elements of a list. For instance, how 
many integers are there from 5 through 12? To answer this question, imagine going along 
the list of integers from 5 to 12, counting each in turn. 


list! 5 6 7 8 9 10 II 


count: 


So the answer is 8. 
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More generally, if m and n are integers and m S n, how many integers are there from 
m through n? To answer this question, note that n = m + (n — m), where n — m = Q [since 
n = m]. Note also that the element m + 0 is the first element of the list, the element m + 1 
is the second element, the element m + 2 is the third, and so forth. In general, the element 
m + iis the (i+ 1)st element of the list. 


list: m(=m+0) m+1 m+2 ... n(=m+(n-m)) 
t 
count: 1 2 3 ies (n-m)+1 


And so the number of elements in the list is n— m + 1. 

This general result is important enough to be restated as a theorem, the formal proof of 
which uses mathematical induction. (See exercise 33 at the end of this section.) The heart 
of the proof is the observation that if the list m, m + 1,..., k has k—m + 1 numbers, then 
the list m,m + 1,..., k,k+ 1 has (k—-m+1)+1 =(k+1)—m+ 1 numbers. 


Theorem 9.1.1 The Number of Elements in a List 


If m and n are integers and m <n, then there are n—m + 1 integers from m to n 
inclusive. 


Counting the Elements of a Sublist 
a. How many three-digit integers (integers from 100 to 999 inclusive) are divisible by 5? 
b. What is the probability that a randomly chosen three-digit integer is divisible by 5? 


Solution 


a. Imagine writing the three-digit integers in a row, noting those that are multiples of 5 
and drawing arrows between each such integer and its corresponding multiple of 5. 


100 101 102 103 104 105 106 107 108 109 110 ... 994 995 996 997 998 999 
t t t t 
5-20 5-21 5:22 5:199 


From the sketch it is clear that there are as many three-digit integers that are mul- 
tiples of 5 as there are integers from 20 to 199 inclusive. By Theorem 9.1.1, there are 
199 — 20 + 1, or 180, such integers. Hence there are 180 three-digit integers that are 
divisible by 5. 


b. By Theorem 9.1.1 the total number of integers from 100 through 999 is 999 — 100 + 1 = 
900. By part (a), 180 of these are divisible by 5. Hence the probability that a randomly 
chosen three-digit integer is divisible by 5 is 180/900 = 1/5. E 


Application: Counting Elements of a One-Dimensional Array 


Analysis of many computer algorithms requires skill at counting the elements of a one-dimen- 
sional array. Let A[1], A[2], . . . , Alm] be a one-dimensional array, where n is a positive integer. 


a. Suppose the array is cut at a middle value A[m] so that two subarrays are formed: 
(D A[]], A[2],..., Alm] and (2) A[m+ 1], A[m+ 2],..., Alm]. 


How many elements does each subarray have? 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


570 CHAPTER9 COUNTING AND PROBABILITY 


b. What is the probability that a randomly chosen element of the array has an even 
subscript 


(i) if nis even? (ii) if n is odd? 


Solution 


a. Array (1) has the same number of elements as the list of integers from 1 through m. 
So by Theorem 9.1.1, it has m, or m — 1 + 1, elements. Array (2) has the same number 
of elements as the list of integers from m + 1 through n. So by Theorem 9.1.1, it has 
n—m, orn—(m+1)+1, elements. 


b. (i) Ifnis even, each even subscript starting with 2 and ending with n can be matched 
up with an integer from 1 to n/2. 


12345678910... n 
to ¢ to f t t 
2-1 2-2 2:3 24 25 2+(n/2) 


So there are n/2 array elements with even subscripts. Since the entire array has n 
elements, the probability that a randomly chosen element has an even subscript is 


eae 


(ii) If n is odd, then the greatest even subscript of the array is n — 1. So there are as 
many even subscripts between 1 and n as there are from 2 through n — 1. Then the 


reasoning of (i) can be used to conclude that there are (n — 1)/2 array elements 
with even subscripts. 


GO. 225 n—-1 n 
J J 
3 2-[(#— 1)/2)] 


Since the entire array has n elements, the probability that a randomly chosen ele- 
ment has an even subscript is 


(n=1)/2 __n-1 
n 2n ` 


Observe that as n gets larger and larger, this probability gets closer and closer to 1/2. 


Note that the answers to (i) and (ii) can be combined using the floor notation. By Theo- 


rem 4.6.2, the number of array elements with even subscripts is |n/2], so the probability that 


_. [7/2] 
a randomly chosen element has an even subscript is / ; E 
n 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. A sample space of a random process or 3. To compute the probability of an event using the 
experiment is ____. equally likely probability formula, you take the 
ratio of the to the 


2. An event in a sample space is 


4. If m =n, the number of integers from m to n 
inclusive is 
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1. 


9.1 INTRODUCTION TO PROBABILITY 571 


Toss two coins 30 times and make a table showing 
the relative frequencies of 0, 1, and 2 heads. How do 
your values compare with those shown in Table 9.1.1? 


. In the example of tossing two quarters, what is the 


probability that at least one head is obtained? that 
coin A is a head? that coins A and B are either both 
heads or both tails? 


In 3-6 use the sample space given in Example 9.1.1. Write 
each event as a set and compute its probability. 


3. 


The event that the chosen card is red and is not a 
face card. 


The event that the chosen card is black and has an 
even number on it. 


The event that the denomination of the chosen 
card is at least 10 (counting aces high). 


The event that the denomination of the chosen 
card is at most 4 (counting aces high). 


In 7-10, use the sample space given in Example 9.1.2. 
Write each of the following events as a set and compute 
its probability. 


7. The event that the sum of the numbers showing 


10. 


Tl. 


face up is 8. 


. The event that the numbers showing face up are 


the same. 


The event that the sum of the numbers showing 
face up is at most 6. 


The event that the sum of the numbers showing 
face up is at least 9. 


Suppose that a coin is tossed three times and 

the side showing face up on each toss is noted. 

Suppose also that on each toss heads and tails 

are equally likely. Let HHT indicate the outcome 

heads on the first two tosses and tails on the third, 

THT the outcome tails on the first and third tosses 

and heads on the second, and so forth. 

a. List the eight elements in the sample space 
whose outcomes are all the possible head-tail 
sequences obtained in the three tosses. 

b. Write each of the following events as a set and 
find its probability: 


(i) The event that exactly one toss results in a 
head. 
Gi) The event that at least two tosses result in 
a head. 
(iii) The event that no head is obtained. 


12. Suppose that each child born is equally likely to 


13. 


14. 


be a boy or a girl. Consider a family with exactly 
three children. Let BBG indicate that the first two 
children born are boys and the third child is a girl, 
let GBG indicate that the first and third children 
born are girls and the second is a boy, and so forth. 
a. List the eight elements in the sample space 
whose outcomes are all possible genders of the 
three children. 
b. Write each of the events in the next column as 
a set and find its probability. 
(i) The event that exactly one child is a girl. 
(ii) The event that at least two children are 
girls. 
(iii) The event that no child is a girl. 


Suppose that on a true/false exam you have no 
idea at all about the answers to three questions. 
You choose answers randomly and therefore have 
a 50-50 chance of being correct on any one ques- 
tion. Let CCW indicate that you were correct on 
the first two questions and wrong on the third, let 
WCW indicate that you were wrong on the first 
and third questions and correct on the second, and 
so forth. 

a. List the elements in the sample space whose 
outcomes are all possible sequences of correct 
and incorrect responses on your part. 

b. Write each of the following events as a set and 
find its probability: 

(i) The event that exactly one answer is 
correct. 
(ii) The event that at least two answers are 
correct. 
(iii) The event that no answer is correct. 


Three people have been exposed to a certain ill- 

ness. Once exposed, a person has a 50-50 chance 

of actually becoming ill. 

a. What is the probability that exactly one of the 
people becomes ill? 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
iven. The symbol * signals that an exercise is more challenging than usual. 
8 y S ging 
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b. What is the probability that at least two of the 
people become ill? 

c. What is the probability that none of the three 
people becomes ill? 


15. When discussing counting and probability, we 


16 


17. 


18. 


often consider situations that may appear frivolous 
or of little practical value, such as tossing coins, 
choosing cards, or rolling dice. The reason is that 
these relatively simple examples serve as models 
for a wide variety of more complex situations in 
the real world. In light of this remark, comment on 
the relationship between your answer to exercise 
11 and your answers to exercises 12-14. 


Two faces of a six-sided die are painted red, two 
are painted blue, and two are painted yellow. The 
die is rolled three times, and the colors that appear 
face up on the first, second, and third rolls are 
recorded. 

a. Let BBR denote the outcome where the color 
appearing face up on the first and second 
rolls is blue and the color appearing face up 
on the third roll is red. Because there are as 
many faces of one color as of any other, the 
outcomes of this experiment are equally likely. 
List all 27 possible outcomes. 

b. Consider the event that all three rolls produce 
different colors. One outcome in this event is 
RBY and another RYB. List all outcomes in the 
event. What is the probability of the event? 

c. Consider the event that two of the colors that 
appear face up are the same. One outcome in 
this event is RRB and another is RBR. List all 
outcomes in the event. What is the probability 
of the event? 


Consider the situation described in exercise 16. 

a. Find the probability of the event that exactly 
one of the colors that appears face up is red. 

b. Find the probability of the event that at least 
one of the colors that appears face up is red. 


An urn contains two blue balls (denoted B, and 
B») and one white ball (denoted W). One ball is 
drawn, its color is recorded, and it is replaced in 
the urn. Then another ball is drawn, and its color 
is recorded. 

a. Let B,W denote the outcome that the first ball 
drawn is B, and the second ball drawn is W. 
Because the first ball is replaced before the 
second ball is drawn, the outcomes of the 
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19. 


* 20. 


21. 


22. 


experiment are equally likely. List all nine 
possible outcomes of the experiment. 

b. Consider the event that the two balls that are 
drawn are both blue. List all outcomes in the 
event. What is the probability of the event? 

c. Consider the event that the two balls that are 
drawn are of different colors. List all outcomes 
in the event. What is the probability of the 
event? 


An urn contains two blue balls denoted B, and 
B,) and three white balls (denoted W,, W>, and 
W3). One ball is drawn, its color is recorded, and it 
is replaced in the urn. Then another ball is drawn 
and its color is recorded. 

a. Let B,W, denote the outcome that the first ball 
drawn is B, and the second ball drawn is W32. 
Because the first ball is replaced before the 
second ball is drawn, the outcomes of the ex- 
periment are equally likely. List all 25 possible 
outcomes of the experiment. 

b. Consider the event that the first ball that is 
drawn is blue. List all outcomes in the event. 
What is the probability of the event? 

c. Consider the event that only white balls are 
drawn. List all outcomes in the event. What is 
the probability of the event? 


Refer to Example 9.1.3. Suppose you are appear- 
ing on a game show with a prize behind one of 
five closed doors: A, B, C, D, and E. If you pick 
the correct door, you win the prize. You pick 
door A. The game show host then opens one of 
the other doors and reveals that there is no prize 
behind it. Then the host gives you the option of 
staying with your original choice of door A or 
switching to one of the other doors that is still 
closed. 
a. If you stick with your original choice, what is 
the probability that you will win the prize? 
b. If you switch to another door, what is the prob- 
ability that you will win the prize? 


a. How many positive two-digit integers are mul- 
tiples of 3? 

b. What is the probability that a randomly chosen 
positive two-digit integer is a multiple of 3? 

c. What is the probability that a randomly chosen 
positive two-digit integer is a multiple of 4? 


a. How many positive three-digit integers are 
multiples of 6? 


b. What is the probability that a randomly chosen 
positive three-digit integer is a multiple of 6? 

c. What is the probability that a randomly chosen 
positive three-digit integer is a multiple of 7? 


23. Suppose A[1], A[2], A[3],..., A[m] is a one- 


dimensional array and n > 50. 
a. How many elements are in the array? 
b. How many elements are in the subarray 


A[4], A[5], . . . , A[39]? 


c. If3 <m S n, what is the probability that 
a randomly chosen array element is in the 
subarray 


A[3], AIA], . . . , Alm]? 


d. What is the probability that a randomly chosen 
array element is in the subarray shown below 
ifn = 39? 


Al\n/2)], Alln/2) + 1),..., Alm] 


24. Suppose A[1], A[2],..., A[m] is a one-dimensional 


array and n = 2. Consider the subarray 
A[1], A[2],..., Alln/2)]. 


a. How many elements are in the subarray (i) if n 
is even? and (ii) if n is odd? 

b. What is the probability that a randomly chosen 
array element is in the subarray (i) if n is even? 
and (ii) if n is odd? 


9.2 POSSIBILITY TREES AND THE MULTIPLICATION RULE 


25. 


26. 


27. 


28. 


29. 


30. 
31. 


32. 


* 33. 
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Suppose A[1], A[2],..., Alm] is a one-dimensional 
array and n = 2. Consider the subarray 


Afln/2)], Alln/2)+1],..., Aln]. 


a. How many elements are in the subarray (i) if n 
is even? and (ii) if n is odd? 

b. What is the probability that a randomly chosen 
array element is in the subarray (i) if n is even? 
and (ii) if n is odd? 


What is the 27th element in the one-dimensional 
array A[42], A[43],..., A[100]? 


What is the 62nd element in the one-dimensional 
array B[29], B[30],..., B[100]? 


If the largest of 56 consecutive integers is 279, 
what is the smallest? 


If the largest of 87 consecutive integers is 326, 
what is the smallest? 


How many even integers are between | and 1,001? 


How many integers that are multiples of 3 are 
between 1 and 1,001? 


A certain non-leap year has 365 days, and January 
1 occurs on a Monday. 

a. How many Sundays are in the year? 

b. How many Mondays are in the year? 


Prove Theorem 9.1.1. (Let m be any integer and 
prove the theorem by mathematical induction on n.) 


ANSWERS FOR TEST YOURSELF 


1. the set of all outcomes of the random process or experiment 
4.n—m+1 


2. a subset of the sample space 3. number of outcomes in the 
event; total number of outcomes 


Possibility Trees and the Multiplication Rule 


Don’t believe anything unless you have thought it through for yourself. 
—Anna Pell Wheeler, 1883-1966 


A tree structure is a useful tool for keeping systematic track of all possibilities in situations 
in which events happen in order. The following example shows how to use such a structure 
to count the number of different outcomes of a tournament. 


Example 9.2.1 


Possibilities for Tournament Play 


Teams A and B are to play each other repeatedly until one wins two games in a row or a 
total of three games. One way in which this tournament can be played is for A to win the 
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first game, B to win the second, and A to win the third and fourth games. Denote this by 
writing A-B—A-—A. 


a. How many ways can the tournament be played? 


b. Assuming that all the ways of playing the tournament are equally likely, what is the 
probability that five games are needed to determine the tournament winner? 


Solution 


a. The possible ways for the tournament to be played are represented by the distinct 
paths from “root” (the start) to “leaf” (a terminal point) in the tree shown sideways in 
Figure 9.2.1. The label on each branching point indicates the winner of the game. The 
notations in parentheses indicate the winner of the tournament. 


Winner of Winner of Winner of Winner of Winner of 
game 1 game 2 game 3 game 4 game 5 
A (A wins) A (A wins) 


A (A wins) 


B (Bwins) B (B wins) 


Start A (A wins) A (A wins) 


B (B wins) 


B (B wins) B (B wins) 


FIGURE 9.2.1 The Outcomes of a Tournament 


The fact that there are ten paths from the root of the tree to its leaves shows that 
there are ten possible ways for the tournament to be played. They are (moving from 
the top down): A-A, A-B—A—A, A-B—-A-B-—A, A-B—-A-B-B, A-B-B, B—A—A, B-A 
B-A-A, B—-A-B—A-B, B—-A-B-B, and B-B. In five cases A wins, and in the other five 
B wins. The least number of games that must be played to determine a winner is two, 
and the most that will need to be played is five. 


b. Since all the possible ways of playing the tournament listed in part (a) are assumed to 
be equally likely, and the listing shows that five games are needed in four different 
cases (A-B—A—B—A, A-B—A-B-B, B-A—B-—A-B, and B—A—B-—A-—A), the probability 
that five games are needed is 4/10 = 2/5 = 40%. a 


The Multiplication Rule 


Consider the following example. Suppose a computer installation has four input/output 
units (A, B, C, and D) and three central processing units (X, Y, and Z). Any input/output 
unit can be paired with any central processing unit. How many ways are there to pair an 
input/output unit with a central processing unit? 

To answer this question, imagine the pairing of the two types of units as a two-step 
operation: 


Step 1: Choose the input/output unit. 


Step 2: Choose the central processing unit. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.2 POSSIBILITY TREES AND THE MULTIPLICATION RULE 575 


The possible outcomes of this operation are illustrated in the possibility tree of 
Figure 9.2.2. 


Step 1: Choose the Step 2: Choose the 
input/output unit. central processing unit. 


A 


Start 


FIGURE 9.2.2 Pairing Objects Using a Possibility Tree 


The topmost path from “root” to “leaf” indicates that input/output unit A is to be paired 
with central processing unit X. The next lower branch indicates that input/output unit A is 
to be paired with central processing unit Y. And so forth. 

Thus the total number of ways to pair the two types of units is the same as the number 
of branches of the tree, which is 


3434343 =4-3 = 12. 


The idea behind this example can be used to prove the following rule. A formal proof uses 
mathematical induction and is left to the exercises. 


Theorem 9.2.1 The Multiplication Rule 


If an operation consists of k steps and 


the first step can be performed in nı ways, 


the second step can be performed in ng ways [regardless of how the first 
step was performed], 


the kth step can be performed in ną ways [regardless of how the preceding 
steps were performed], 


then the entire operation can be performed in nnz °° + ng ways. 


To apply the multiplication rule, think of the objects you are trying to count as the 
output of a multistep operation. The possible ways to perform a step may depend on how 
preceding steps were performed, but the number of ways to perform each step must be 
constant regardless of the action taken in prior steps. 


Counting Personal Identification Numbers (PINs) 


A certain personal identification number (PIN) is required to be a sequence of any four 
symbols chosen from the 26 uppercase letters in the Roman alphabet and the ten digits. 


a. How many different PINs are possible if repetition of symbols is allowed? 
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b. How many different PINs are possible if repetition of symbols is not allowed? 

c. What is the probability that a PIN does not have a repeated symbol assuming that all 
PINs are equally likely? 

Solution 


a. Some possible PINs are RCAE, 3387, B92B, and so forth. You can think of forming 
a PIN as a four-step operation where each step involves placing a symbol into one of 
four positions, as shown below. 


Pool of available 
symbols: A, B, C, D, E, F, G, 
H, I, J, K, L, M,N, O, P, Q, R, 
S, T, U, V, W, X, Y, Z, 

0, 1,:2,3;:4, 5; 6,.7;.8,'9 


Step 1: Choose a symbol to place in position 1. 
Step 2: Choose a symbol to place in position 2. 
Step 3: Choose a symbol to place in position 3. 
Step 4: Choose a symbol to place in position 4. 


There is a fixed number of ways to perform each step, namely 36, regardless of how 
preceding steps were performed. And so, by the multiplication rule, there are 
36-36-36-36 = 36° = 1,679,616 PINs in all. 


b. Again think of forming a PIN as a four-step operation: Choose the first symbol, then 
the second, then the third, and then the fourth. There are 36 ways to choose the first 
symbol, 35 ways to choose the second (since the first symbol cannot be used again), 
34 ways to choose the third (since the first two symbols cannot be reused), and 33 
ways to choose the fourth (since the first three symbols cannot be reused). Thus, the 
multiplication rule can be applied to conclude that there are 36:35-34:33 = 1,413,720 
different PINs with no repeated symbol. 


c. By part (b) there are 1,413,720 PINs with no repeated symbol, and by part (a) there are 


1,679,616 PINs in all. Thus the probability that a PIN chosen at random contains no 


repeated symbol is a = 0.8417. In other words, approximately 84% of PINs 


have no repeated symbol. E 


Another way to look at the PINs of Example 9.2.2 is as ordered 4-tuples. For example, 
you can think of the PIN M2ZM as the ordered 4-tuple (M, 2, Z, M). Therefore, the total 
number of PINs is the same as the total number of ordered 4-tuples whose elements are 
either letters or digits. One of the most important uses of the multiplication rule is to derive 
a general formula for the number of elements in any Cartesian product of a finite number 
of finite sets. In Example 9.2.3, this is done for a Cartesian product of four sets. 


SCWGEY EA The Number of Elements in a Cartesian Product 


Suppose A), A, A3, and A; are sets with 1), n2, nz, and n4 elements, respectively. Show that 
the set A; X A, X A3 X Ay has nyn2n3n4 elements. 
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Solution Each element in A; X A, X A3 X A; is an ordered 4-tuple of the form (ay, a>, 
a3, a4), Where a, E Ay, a E Ap, a3 E A}, and a4 E A4. Imagine the process of constructing 
these ordered tuples as a four-step operation: 


Step 1: Choose the first element of the 4-tuple. 
Step 2: Choose the second element of the 4-tuple. 
Step 3: Choose the third element of the 4-tuple. 
Step 4: Choose the fourth element of the 4-tuple. 


There are n, ways to perform step 1, n, ways to perform step 2, n3 ways to perform step 3, 
and n4 ways to perform step 4. Hence, by the multiplication rule, there are njn.n3n4 
ways to perform the entire operation. Therefore, there are n,n .n3n,4 distinct 4-tuples in 
Ay X As X Ay X A4. E 


Any circuit with two input signals P and Q has an input/output table consisting of four rows 
corresponding to the four possible assignments of values to P and Q: 11, 10, 01, and 00. The 
next example shows that there are only 16 distinct ways in which such a circuit can function. 


Number of Input/Output Tables for a Circuit with Two Input Signals 


Consider the set of all circuits with two input signals P and Q. Each such circuit has a cor- 
responding input/output table, but, as shown in Section 2.4, two such input/output tables 
may be the same. How many distinct input/output tables are there for a circuit with input/ 
output signals P and Q? 


Solution Fix the order of the input values for P and Q. Then two input/output tables are 
distinct if their output values differ in at least one row. For example, the input/output tables 
shown below are distinct, because their output values differ in the first row. 


P Q | Output P Q | Output 
1 1 1 1 1 0 
1 0 0 1 0 0 
(0) 1 1 0 1 1 
0 0 0 0 0 0 


For a fixed ordering of input values, you can obtain a complete input/output table by 
filling in the entries in the output column. You can think of this as a four-step operation: 
Step 1: Fill in the output value for the first row. 

Step 2: Fill in the output value for the second row. 
Step 3: Fill in the output value for the third row. 
Step 4: Fill in the output value for the fourth row. 
Each step can be performed in exactly two ways: either a | or a 0 can be filled in. Hence, 
by the multiplication rule, there are 
2:2:2:2 = 16 


ways to perform the entire operation. It follows that there are 2f = 16 distinct input/output 
tables for a circuit with two input signals P and Q. This means that such a circuit can func- 
tion in only 16 distinct ways. E 
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Observe that in Example 9.2.2, the set of all PINs of length 4 is the same as the set of all 
strings of length 4 over the set 


S = {x|x is a letter of the uppercase Roman alphabet or x is a digit}. 


Also observe that another way to think of Example 9.2.4 is to realize that there are as many 
input/output tables for a circuit with two input signals as there are bit strings of length 4 
(written vertically) that can be used to fill in the output values. As another example, here is 
a listing of all bit strings of length 3: 


000, 001, O10, 100, 011, 101, 110, 1. 


Seu eee Counting the Number of Iterations of a Nested Loop 
Consider the following nested loop: 


for i := 1 to4 
for j := 1 to3 
[Statements in body of inner loop. 


None contain branching statements 
that lead out of the inner loop.] 


next j 
next i 


How many times will the inner loop be iterated when the algorithm is implemented 
and run? 


Solution The outer loop is iterated four times, and during each iteration of the outer 
loop, there are three iterations of the inner loop. Hence by the multiplication rule, the 
total number of iterations of the inner loop is 4:3 = 12. This is illustrated by the trace 
table below. 


When the Multiplication Rule Is Difficult or Impossible to Apply 
Consider the following problem: 


Three officers—a president, a treasurer, and a secretary—are to be chosen from 
among four people: Ann, Bob, Cyd, and Dan. Suppose that, for various reasons, 
Ann cannot be president and either Cyd or Dan must be secretary. How many 
ways can the officers be chosen? 


It is natural to try to solve this problem using the multiplication rule. A person might an- 
swer as follows: 


There are three choices for president (all except Ann), three choices for treasurer 
(all except the one chosen as president), and two choices for secretary (Cyd or 
Dan). Therefore, by the multiplication rule, there are 3-3-2 = 18 choices in all. 


Unfortunately, this analysis is incorrect. The number of ways to choose the secretary var- 
ies depending on who is chosen for president and treasurer. For instance, if Bob is chosen 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.2 POSSIBILITY TREES AND THE MULTIPLICATION RULE 579 


for president and Ann for treasurer, then there are two choices for secretary: Cyd and Dan. 
But if Bob is chosen for president and Cyd for treasurer, then there is just one choice for 
secretary: Dan. The clearest way to see all the possible choices is to construct the possibil- 
ity tree, as is shown in Figure 9.2.3. 


Step 1: Choose Step 2: Choose Step 3: Choose 
the president. the treasurer. the secretary. 


Cyd 


Ann 


Dan 
Dan 
Cyd 
Start Dan 


Dan 


Cyd 


Cyd 
FIGURE 9.2.3 


From the tree it is easy to see that there are only eight ways to choose a president, treasurer, 
and secretary so as to satisfy the given conditions. 

Another way to solve this problem is somewhat surprising. It turns out that the steps can 
be reordered in a slightly different way so that the number of ways to perform each step is 
constant regardless of the way previous steps were performed. 


A More Subtle Use of the Multiplication Rule 


Reorder the steps for choosing the officers in the previous example so that the total number 
of ways to choose officers can be computed using the multiplication rule. 


Solution 


Step 1: Choose Step 2: Choose Step 3: Choose 
the secretary. the president. the treasurer. 


Step 1: Choose the secretary. 
Step 2: Choose the president. 
Step 3: Choose the treasurer. 


There are exactly two ways to perform step 1 
(either Cyd or Dan may be chosen), two ways 
to perform step 2 (neither Ann nor the person 
chosen in step 1 may be chosen but either of the Start 
other two may), and two ways to perform step 3 
(either of the two people not chosen as secretary 
or president may be chosen as treasurer). Thus, 
by the multiplication rule, the total number of 
ways to choose officers is 2-2-2 = 8. A possi- 
bility tree illustrating this sequence of choices is 
shown in Figure 9.2.4. Note how balanced this 
tree is compared with the one in Figure 9.2.3. FIGURE 9.2.4 
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Permutations 


A permutation of a set of objects is an ordering of the objects in a row. For example, the 
set of elements a, b, and c has six permutations. 


abc acb cba bac bca cab 


In general, given a set of n objects, how many permutations does the set have? Imagine 
forming a permutation as an n-step operation: 


Step 1: Choose an element to write first. 


Step 2: Choose an element to write second. 


Step n: Choose an element to write nth. 


Any element of the set can be chosen in step 1, so there are n ways to perform step 1. 
Any element except that chosen in step 1 can be chosen in step 2, so there are n — 1 ways to 
perform step 2. In general, the number of ways to perform each successive step is one less 
than the number of ways to perform the preceding step. At the point when the nth element 
is chosen, there is only one element left, so there is only one way to perform step n. Hence, 
by the multiplication rule, there are 


n(n—1)\(n—2) =: 2-1=n! 


ways to perform the entire operation. In other words, there are n! permutations of a set of 
n elements. This reasoning is summarized in the following theorem. A formal proof uses 
mathematical induction and is left as an exercise. 


Theorem 9.2.2 


For any integer n with n = 1, the number of permutations of a set with n elements 
is n!. 


sel wa Permutations of the Letters in a Word 


a. How many ways can the letters in the word COMPUTER be arranged in a row? 


b. How many ways can the letters in the word COMPUTER be arranged if the letters CO 
must remain next to each other (in order) as a unit? 


c. If letters of the word COMPUTER are randomly arranged in a row, what is the prob- 
ability that the letters CO remain next to each other (in order) as a unit? 


Solution 


a. All eight letters in the word COMPUTER are distinct, so the number of ways in which 
you can arrange the letters equals the number of permutations of a set of eight ele- 
ments. This equals 8! = 40,320. 


b. If the letter group CO is treated as a unit, then there are effectively only seven objects 
that are to be arranged in a row. 


CO} [M| P||U|IT||E||R 
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Hence there are as many ways to write the letters as there are permutations of a set of 
seven elements, namely, 7! = 5,040. 


c. When the letters are arranged randomly in a row, the total number of arrangements is 
40,320 by part (a), and the number of arrangements with the letters CO next to each 
other (in order) as a unit is 5,040. Thus the probability is 


5,040 1 
= -= 12.5%. E 
40,320 8 aM 


Permutations of Objects Around a Circle 


At a meeting of diplomats, the six participants are to be seated around a circular table. 
Since the table has no ends to confer particular status, it doesn’t matter who sits in which 
chair. But it does matter how the diplomats are seated relative to each other. In other words, 
two seatings are considered the same if one is a rotation of the other. How many different 
ways can the diplomats be seated? 


Solution Call the diplomats by the letters A, B, C, D, E, and F. Since only relative posi- 
tion matters, you can start with any diplomat (say, A), place that diplomat anywhere (say, 
in the top seat of the diagram shown in Figure 9.2.5), and then consider all arrangements 
of the other diplomats around that one. The five diplomats B through F can be arranged 
in the seats around diplomat A in all possible orders. So there are 5! = 120 ways to seat 
the group. 


Five other 
diplomats 
to be seated: 

B, C, D, E, F 


FIGURE 9.2.5 a 


Permutations of Selected Elements 


Given the set {a, b, c}, there are six ways to select two letters from the set and write them 
in order. 


ab ac ba bc ca cb 


Each such ordering of two elements of {a, b, c} is called a 2-permutation of {a, b, c}. 


Definition 


An r-permutation of a set of n elements is an ordered selection of r elements taken 
from the set of n elements. The number of r-permutations of a set of n elements is 
denoted P(n, r). 
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Theorem 9.2.3 


If n and r are integers and 1 = r <n, then the number of r-permutations of a set of 
n elements is given by the formula 


P(n, r) = n(n— 1)\(n—2)°::(1—r+1) first version 


or, equivalently, 


n 
BUP = second version. 
kd 


A formal proof of this theorem uses mathematical induction and is based on the multi- 
plication rule. The idea of the proof is the following. 

Suppose a set of n elements is given. Formation of an r-permutation can be thought of as 
an r-step process. Step 1 is to choose the element to be first. Since the set has n elements, 
there are n ways to perform step 1. Step 2 is to choose the element to be second. Since the 
element chosen in step 1 is no longer available, there are n— 1 ways to perform step 2. 
Step 3 is to choose the element to be third. Since neither of the two elements chosen in the 
first two steps is available, there are n — 2 choices for step 3. This process is repeated r 
times, as shown below. 


Pool of available 
elements: x), Xoe «+53 


n choices 


n — 1 choices 


n — 2 choices 
n — (r — 1) choices 


Position 1 Position 2 Position 3 tee Position r 


The number of ways to perform each successive step is one less than the number of ways 
to perform the preceding step. Step r is to choose the element to be rth. At the point just 
before step r is performed, r— 1 elements have already been chosen, and so there are 


n—(r—l)=n-r+1 


left to choose from. Hence there are n—r+ 1 ways to perform step r. It follows by the 
multiplication rule that the number of ways to form an r-permutation is 


P(n, r) = n(n—-1)(n—-2)-+-(n-r+1). 
Note that 
n! — na-in- n- r+ Danar 
a=! mnr 


= n(n— 1)\(n— 2): (n=r+1). 


Thus the formula can also be written as 


n! 


P(n, r) = u-A 
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The second version of the formula is easier to remember, but if you use it with a calcula- 
tor, don’t first compute n! and (n — r)! and then divide the first by the second. Because fac- 
torials become so large so fast, using this method can overload a calculator’s capacity for 
exact arithmetic even when n and r are quite small. For instance, if n = 15 and r = 2, then 


n! 15! 1,307,674,368,000 


(n—r)! 13! 6,227,020,800 


On the other hand, if you cancel (n— r)! = 13! from the numerator and denominator 
before multiplying out, you reduce the expression to the first version of the formula for 
P(n, r), which is much easier to compute: 


n! 15! 15-14-4131 
(n=! 13! BI 


In fact, many scientific calculators allow you to compute P(n, r) simply by entering the 
values of n and r and pressing a key or making a menu choice. Alternative notations for 
P(n, r) are „ P,, Py, and "P,. 


CT ewe” Evaluating r-Permutations 
a. Evaluate P(5, 2). 


b. How many 4-permutations are there of a set of seven objects? 


= 15-14 = 210. 


c. How many 5-permutations are there of a set of five objects? 


Solution 
5! 5-4-34 
. PS, 2) = = = 20 

a PODS Soy A 
b. The number of 4-permutations of a set of seven objects is 

7! 7:-6-5-4- 

P(7, 4) = = = 7:6°5:4 = 840. 
= (7-4)! Ww 
c. The number of 5-permutations of a set of five objects is 
5! 5! +5! 
P(S, 5) = 5! = 120. 


(5-5! Of 1 


Note that the definition of 0! as 1 makes this calculation come out as it should, for the 
number of 5-permutations of a set of five objects is certainly equal to the number of 
permutations of the set. a 


Permutations of Selected Letters of a Word 


a. How many different ways can three of the letters of the word BYTES be chosen and 
written in a row? 


b. How many different ways can this be done if the first letter must be B? 

Solution 

a. The answer equals the number of 3-permutations of a set of five elements. This equals 
5! 5:4:3-24 

(5-3)! wn 


P(3, 3) = = 5-4-3 = 60. 
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b. Since the first letter must be B, there are effectively only two letters to be chosen and 
placed in the other two positions. And since the B is used in the first position, there 
are four letters available to fill the remaining two positions. 


Pool of available 
letters: Y, T, E, S 


B 


Position 1 


Position 2 Position 3 


Hence the answer is the number of 2-permutations of a set of four elements, which is 
4! 4-3-21 
(4-2)! M 


P(4, 2) = =4-3 = 12. E 


In many applications of the mathematics of counting, it is necessary to be skillful in 
working algebraically with quantities of the form P(n, r). The next example shows a kind 
of problem that gives practice in developing such skill. 


Proving a Property of P(n, r) 


Prove that for every integer n = 2, 


P(n, 2) + P(n, 1) = n°. 


Solution Suppose n is any integer that is greater than or equal to 2. By Theorem 9.2.3, 


n! n(n — 1)\(n—2)! 


P(n,2) = (n—2)! = aD = n(n- 1) 
and 
! ; l 
Pon, D= = = re Si; 
Hence 
P(n,2)+P(n,1)= n(n Dtn=r ntn=n, 
which is what we needed to show. a 


TEST YOURSELF 


1. The multiplication rule says that if an opera- 4. Anr-permutation of a set of n elements 


tion can be performed in k steps and, for each i 
with 1 =i < k, the ith step can be performed in 
n; ways (regardless of how previous steps were 
performed), then the operation as a whole can be 
performed in 


2. A permutation of a set of elements is 


3. The number of permutations of a set of n elements 
equals 
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5. The number of r-permutations of a set of n ele- 
ments is denoted 


6. One formula for the number of r-permutations of 
a set of n elements is and another formula 
is 


EXERCISESET9.2 


9.2 POSSIBILITY TREES AND THE MULTIPLICATION RULE 


585 


In 1-4, use the fact that in baseball’s World Series, the 
first team to win four games wins the series. 


1. 


Suppose team A wins the first three games. How 
many ways can the World Series be completed? 
(Draw a tree.) 


. Suppose team A wins the first two games. How 


many ways can the World Series be completed? 
(Draw a tree.) 


. How many ways can a World Series be played if 


team A wins four games in a row? 


How many ways can a World Series be played if 
no team wins two games in a row? 


In a competition between players X and Y, the first 
player to win three games in a row or a total of 
four games wins. How many ways can the com- 
petition be played if X wins the first game and Y 
wins the second and third games? (Draw a tree.) 


One urn contains two black balls (labeled B, and 

B,) and one white ball. A second urn contains one 

black ball and two white balls (labeled W, and 

W,). Suppose the following experiment is per- 

formed: One of the two urns is chosen at random. 

Next a ball is randomly chosen from the urn. Then 

a second ball is chosen at random from the same 

urn without replacing the first ball. 

a. Construct the possibility tree showing all pos- 
sible outcomes of this experiment. 

b. What is the total number of outcomes of this 
experiment? 

c. What is the probability that two black balls are 
chosen? 

d. What is the probability that two balls of op- 
posite color are chosen? 


One urn contains one blue ball (labeled B,) and 
three red balls (labeled R}, R}, and R3). A second 
urn contains two red balls (R4 and R;) and two 
blue balls (B, and B3). An experiment is per- 
formed in which one of the two urns is chosen at 
random and then two balls are randomly chosen 
from it, one after the other without replacement. 
a. Construct the possibility tree showing all pos- 
sible outcomes of this experiment. 
b. What is the total number of outcomes of this 
experiment? 
c. What is the probability that two red balls are 
chosen? 


10. 


Tl. 


12. 


13. 


. A person buying a personal computer system is 


offered a choice of three models of the basic unit, 
two models of keyboard, and two models of print- 
er. How many distinct systems can be purchased? 


Suppose there are three roads from city A to city B 

and five roads from city B to city C. 

a. How many ways is it possible to travel from 
city A to city C via city B? 

b. How many different round-trip routes are there 
from city A to B to C to B and back to A? 

c. How many different routes are there from cit- 
ies A to B to C to B and back to A in which no 
road is traversed twice? 


Suppose there are three routes from North Point to 

Boulder Creek, two routes from Boulder Creek to 

Beaver Dam, two routes from Beaver Dam to Star 

Lake, and four routes directly from Boulder Creek 

to Star Lake. (Draw a sketch.) 

a. How many routes from North Point to Star 
Lake pass through Beaver Dam? 

b. How many routes from North Point to Star 
Lake bypass Beaver Dam? 


a. A bit string is a finite sequence of 0’s and ?’s. 
How many bit strings have length 8? 

b. How many bit strings of length 8 begin with 
three 0’s? 

c. How many bit strings of length 8 begin and 
end with a 1? 


Hexadecimal numbers are made using the sixteen 

hexadecimal digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, 

C, D, E, F and are denoted using the subscript 16. 

For example, 9A2D,, and BC54 are hexadecimal 

numbers. 

a. How many hexadecimal numbers begin with 
one of the digits 3 through B, end with one of 
the digits 5 through F, and are 5 digits long? 

b. How many hexadecimal numbers begin with 
one of the digits 4 through D, end with one of 
the digits 2 through E, and are 6 digits long? 


A coin is tossed four times. Each time the result H 
for heads or T for tails is recorded. An outcome of 
HHTT means that heads were obtained on the first 
two tosses and tails on the second two. Assume 
that heads and tails are equally likely on each toss. 
a. How many distinct outcomes are possible? 
b. What is the probability that exactly two heads 
occur? 
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14. 


15. 


16. 


17. 


18. 
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c. What is the probability that exactly one head 
occurs? 


Suppose that in a certain state, all automobile 

license plates have four uppercase letters followed 

by three digits. 

a. How many different license plates are 
possible? 

b. How many license plates could begin with A 
and end in 0? 

c. How many license plates could begin with 
TGIF? 

d. How many license plates are possible in which 
all the letters and digits are distinct? 

e. How many license plates could begin with AB 
and have all letters and digits distinct? 


A combination lock requires three selections of 

numbers, each from | through 30. 

a. How many different combinations are 
possible? 

b. Suppose the locks are constructed in such a 
way that no number may be used twice. 
How many different combinations are 
possible? 


a. How many integers are there from 10 
through 99? 

b. How many odd integers are there from 10 
through 99? 

c. How many integers from 10 through 99 have 
distinct digits? 

d. How many odd integers from 10 through 99 
have distinct digits? 

e. What is the probability that a randomly chosen 
two-digit integer has distinct digits? has dis- 
tinct digits and is odd? 


a. How many integers are there from 1000 
through 9999? 

b. How many odd integers are there from 1000 
through 9999? 

c. How many integers from 1000 through 9999 
have distinct digits? 

d. How many odd integers from 1000 through 
9999 have distinct digits? 

e. What is the probability that a randomly chosen 
four-digit integer has distinct digits? has dis- 
tinct digits and is odd? 


The following diagram shows the keypad for an 
automatic teller machine. As you can see, the 
same sequence of keys represents a variety of 
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19. 


20. 


different PINs. For instance, 2133, AZDE, and 
BQ3F are all keyed in exactly the same way. 


1 2 3 
QZ ABC || | DEF 
4 5 6 
GHI JKL || | MNO 
7 8 9 
PRS || | TUV | |Wxy 

0 | 


a. How many different PINs are represented by 
the same sequence of keys as 2133? 

b. How many different PINs are represented by 
the same sequence of keys as 5031? 

c. How many different numeric sequences on the 
machine contain no repeated digit? 


Three officers—a president, a treasurer, and a sec- 
retary—are to be chosen from among four people: 
Ann, Bob, Cyd, and Dan. Suppose that Bob is not 
qualified to be treasurer and Cyd’s other commit- 
ments make it impossible for her to be secretary. 
How many ways can the officers be chosen? 

Can the multiplication rule be used to solve this 
problem? 


Modify Example 9.2.4 by supposing that a 

PIN must not begin with any of the letters A-M 
and must end with a digit. Continue to assume 
that no symbol may be used more than once 
and that the total number of PINs is to be 
determined. 

a. Find the error in the following “solution.” 


“Constructing a PIN is a four-step process. 
Step 1: Choose the left-most symbol. 

Step 2: Choose the second symbol from the left. 
Step 3: Choose the third symbol from the left. 


Step 4: Choose the right-most symbol. 
Because none of the thirteen letters from A 
through M may be chosen in step 1, there are 
36 — 13 = 23 ways to perform step 1. There 
are 35 ways to perform step 2 and 34 ways to 
perform step 3 because previously used sym- 
bols may not be used. Since the symbol chosen 
in step 4 must be a previously unused digit, 


there are 10 — 3 = 7 ways to perform step 4. 
Thus there are 23 -35-34-7 = 191,590 differ- 
ent PINs that satisfy the given conditions.” 

b. Reorder steps 1—4 in part (a) as follows: 


Step 1: Choose the right-most symbol. 
Step 2: Choose the left-most symbol. 
Step 3: Choose the second symbol from 
the left. 
Step 4: Choose the third symbol from the left. 


Use the multiplication rule to find the number 
of PINs that satisfy the given conditions. 


H 21. Suppose A is a set with m elements and B is a set 

with n elements. 

a. How many relations are there from A to B? 
Explain. 

b. How many functions are there from A to B? 
Explain. 

c. What fraction of the relations from A to B are 
functions? 


22. a. How many functions are there from a set with 

three elements to a set with four elements? 

b. How many functions are there from a set with 
five elements to a set with two elements? 

c. How many functions are there from a set with 
m elements to a set with n elements, where m 


and n are positive integers? 


23. In Section 2.5 we showed how integers can be 
represented by strings of 0’s and 1’s inside a 
digital computer. In fact, through various cod- 
ing schemes, strings of 0’s and l’s can be used 
to represent all kinds of symbols. One com- 
monly used code is the Extended Binary-Coded 
Decimal Interchange Code (EBCDIC) in which 
each symbol has an 8-bit representation. How 
many distinct symbols can be represented by 
this code? 

In each of 24-28, determine how many times the inner- 

most loop will be iterated when the algorithm segment is 

implemented and run. (Assume that m, n, p, a, b, c, and d 

are all positive integers.) 


24. fori := 1 to 30 
for j:= 1 to 15 


[Statements in body of inner loop. 
None contain branching statements that 
lead outside the loop.] 


next j 


next i 
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25. for j := 1 tom 
fork := 1 ton 


[Statements in body of inner loop. 
None contain branching statements that 
lead outside the loop.] 


next k 
next j 


26. fori := 1 tom 
for j:= 1 ton 
for k:= | top 


[Statements in body of inner loop. 
None contain branching statements 
that lead outside the loop.] 
next k 
next j 
next i 


27. for i := 5 to 50 
for j := 10 to 20 


[Statements in body of inner loop. 
None contain branching statements that 
lead outside the loop.] 


next j 
next i 


28. Assume a £ b and c =£ d. 


for i := a to b 
for j:= ctod 
[Statements in body of inner loop. 


None contain branching statements that 
lead outside the loop.] 


next j 


next i 


H* 29. Consider the numbers 1 through 99,999 in their 
ordinary decimal representations. How many con- 


tain exactly one of each of the digits 2, 3, 4, and 5? 


*30. Letn = ph p% ss - phn where Pj, Po,--- , Pm are 

distinct prime numbers and ky, k2, . . . , km are 

positive integers. How many ways can n be written 

as a product of two positive integers that have no 

common factors, assuming the following? 

a. Order matters (that is, 8-15 and 15-8 are 
regarded as different). 

b. Order does not matter (that is, 8-15 and 15-8 


are regarded as the same). 
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a. Ifp is a prime number and a is a positive inte- 
ger, how many distinct positive divisors does 
p° have? 

b. Ifp and q are distinct prime numbers and a 
and b are positive integers, how many distinct 
positive divisors does pg have? 

c. Ifp, q, and rare distinct prime numbers and 
a, b, and c are positive integers, how many 
distinct positive divisors does p“gq’r* have? 

d. If p), po,..., Pm are distinct prime num- 
bers and aj, a2, . . . , Am are positive integers, 
how many distinct positive divisors does 
PiP? Pw’ have? 

e. What is the smallest positive integer with 
exactly 12 divisors? 


a. How many ways can the letters of the word 
ALGORITHM be arranged in a row? 

b. How many ways can the letters of the word 
ALGORITHM be arranged in a row if A and L 
must remain together (in order) as a unit? 

c. How many ways can the letters of the word 
ALGORITHM be arranged in a row if the let- 
ters GOR must remain together (in order) as 
a unit? 


Six people attend the theater together and sit in a 

row with exactly six seats. 

a. How many ways can they be seated together in 
the row? 

b. Suppose one of the six is a doctor who must sit 
on the aisle in case she is paged. How many 
ways can the people be seated together in the 
row with the doctor in an aisle seat? 

c. Suppose the six people consist of three mar- 
ried couples and each couple wants to sit 
together with the older partner on the left. How 
many ways can the six be seated together in 
the row? 


Five people are to be seated around a circular 
table. Two seatings are considered the same if one 
is a rotation of the other. How many different seat- 
ings are possible? 


Write all the 2-permutations of {W, X, Y, Z}. 
Write all the 3-permutations of {s, t, u, v}. 


Evaluate the following quantities. 


a. P(6,4) b. P(6,6) c. P(6,3) d. P(6,1) 


a. How many 3-permutations are there of a set of 
five objects? 
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39. 


40. 


41. 


42. 


43. 


44. 


H 45. 


*46. 


47. 


b. How many 2-permutations are there of a set of 
eight objects? 


a. How many ways can three of the letters of the 
word ALGORITHM be selected and written in 
arow? 

b. How many ways can six of the letters of the 
word ALGORITHM be selected and written in 
arow? 

c. How many ways can six of the letters of the 
word ALGORITHM be selected and written in 
a row if the first letter must be A? 

d. How many ways can six of the letters of the 
word ALGORITHM be selected and written in 
a row if the first two letters must be OR? 


Prove that for every integer n = 2, 
P(in+1,3) = nn. 


Prove that for every integer n = 2, 
P(n+1,2)— P(n, 2) = 2P(n, 1). 
Prove that for every integer n = 3, 
P(n+ 1,3)— P(n, 3) = 3P(n, 2). 
Prove that for every integer n = 2, 
P(n, n) = P(n, n— 1). 
Prove Theorem 9.2.1 by mathematical induction. 
Prove Theorem 9.2.2 by mathematical induction. 
Prove Theorem 9.2.3 by mathematical induction. 


A permutation on a set can be regarded as a 
function from the set to itself. For instance, one 
permutation of {1, 2, 3, 4} is 2341. It can be iden- 
tified with the function that sends each position 
number to the number occupying that position. 
Since position 1 is occupied by 2, 1 is sent to 2 or 
1 — 2; since position 2 is occupied by 3, 2 is sent 
to 3 or 2 — 3; and so forth. The entire permuta- 
tion can be written using arrows as follows: 


1 2 3 A 
t 4 4d 
2 3 4 4 


a. Use arrows to write each of the six permuta- 
tions of {1, 2, 3}. 

b. Use arrows to write each of the permutations 
of {1, 2, 3, 4} that keep 2 and 4 fixed. 

c. Which permutations of {1, 2, 3} keep no ele- 
ments fixed? 

d. Use arrows to write all permutations of 
{1, 2, 3, 4} that keep no elements fixed. 
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ANSWERS FOR TEST YOURSELF 


1. njny**+n ways 2. an ordering of the elements of the setinarow 3.n! 4. an ordered selection of r of the elements of the 


set 5.P(n,r) 6.n(n—1)(n—2):-:(n-r+1); 4 
(n—r)! 


EEJ Counting Elements of Disjoint Sets: 
The Addition Rule 


The whole of science is nothing more than a refinement of everyday thinking. 
—Albert Einstein, 1879-1955 


In the last section we discussed counting problems that can be solved using possibility trees. 
In this section we look at counting problems that can be solved by counting the number of 
elements in the union of two sets, the difference of two sets, or the intersection of two sets. 

The basic rule underlying the calculation of the number of elements in a union or dif- 
ference or intersection is the addition rule. This rule states that the number of elements in 
a union of mutually disjoint finite sets equals the sum of the number of elements in each 
of the component sets. 


Theorem 9.3.1 The Addition Rule 


Suppose a finite set A equals the union of k distinct mutually disjoint subsets Aj, 
A>, pees Ax. Then 


N(A) = N(A,) + M(A2) +: -+ N(Ap). 


A formal proof of this theorem uses mathematical induction and is left to the exercises. 


SecA Counting the Number of Integers Divisible by 5 
How many three-digit integers (integers from 100 to 999 inclusive) are divisible by 5? 


Solution One solution to this problem was discussed in Example 9.1.4. Another approach 
uses the addition rule. Integers that are divisible by 5 end either in 5 or in 0. Thus the set of 
all three-digit integers that are divisible by 5 can be split into two mutually disjoint subsets 
A, and A, as shown in Figure 9.3.1. 


Three-Digit Integers That Are Divisible by 5 


A, UA, = the set of all three-digit integers 
three-digit integers three-digit integers that are divisible by 5 
that end in 0 that end in 5 A, NA, = 


Ay 


> 
ne) 


FIGURE 9.3.1 


Now there are as many three-digit integers that end in 0 as there are possible choices 
for the left-most and middle digits (because the right-most digit must be a 0). As illustrated 
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below, there are nine choices for the left-most digit (the digits 1 through 9) and ten choices 
for the middle digit (the digits 0 through 9). Hence M(A,) = 9-10 = 90. 


t | 


9 choices 10 choices number ends in 0 
1, 2,354.5, 6, 7,89 0, 1,2,3,4, 5,6, 7,8, 9 
Similar reasoning shows that there are as many three-digit integers that end in 5 as 
there are possible choices for the left-most and middle digits, which are the same as for the 
integers that end in 0. Hence, N(Az) = 90. So 


the number of 


three-digit integers = N(A,) + N(A2) = 90+ 90 = 180. E 
that are divisible by 5 


The Difference Rule 


An important consequence of the addition rule is the fact that if the number of elements in 
a set A and the number in a subset B of A are both known, then the number of elements that 
are in A and not in B can be computed. 


Theorem 9.3.2 The Difference Rule 
If A is a finite set and B is a subset of A, then 


N(A — B) = N(A) — N(B). 


The difference rule is illustrated in Figure 9.3.2. 


A (n elements) 


B (k elements) A-B (n-— k elements) 


FIGURE 9.3.2 The Difference Rule 


The difference rule holds for the following reason: If B is a subset of A, then the two 
sets B and A — B have no elements in common and B U (A — B) = A. Hence, by the addi- 
tion rule, 


N(B) + N(A — B) = NA). 
Subtracting N(B) from both sides gives the equation 
N(A — B) = N(A) — N(B). 


Counting PINs with Repeated Symbols 


Consider again the PINs discussed in Example 9.2.2. These are made from exactly four 
symbols chosen from the 26 uppercase letters of the Roman alphabet and the ten digits. 
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Example 9.2.2 showed that there are 1,679,616 PINs with repetition allowed and 265,896 
PINs with no repeated symbol. 


a. How many PINs contain at least one repeated symbol? 

b. If all PINs are equally likely, what is the probability that a randomly chosen PIN con- 
tains at least one repeated symbol? 

Solution 


a. Let S be the set of all the PINs with repetition allowed, and let A be the set of PINs 
with no repeated symbol. Then S — A is the set of PINs with at least one repeated sym- 
bol, and, by the difference rule, 


NMS —A) = NS) — NA) 
= 1,679,616 — 1,413,720 
= 265,896. 


Hence, there are 265,896 PINs that contain at least one repeated symbol. 


b. Solution 1: Because there are 1,679,616 PINs in all and 265,896 of these contain at 
least one repeated symbol, by the equally likely probability formula, the probability 
that a randomly chosen PIN contains a repeated symbol is 


265,896 
1,679,616 
Solution 2: P(A) is the probability that a randomly chosen PIN has no repeated sym- 


bol, and so P(S — A) is the probability that a randomly chosen PIN has at least one 
repeated symbol. Then 


= 0.158 = 15.8%. 


N(S— A) 
P(S—A) = NG) by definition of probability in the equally likely case 
N(S) — N(A) 
osama —. by the difference rule 
NS) 
NS) NA) 
= a by the laws of fractions 
MS) NS) 
= 1— P(A) by definition of probability in the equally likely case 
= 1—-0.842 by Example 9.2.2 
= 0.158 = 15.8%. a 


Solution 2 illustrates a general property of probabilities: that the probability of the 
complement of an event is obtained by subtracting the probability of the event from the 
number 1. In Section 9.8 we derive this formula from the axioms for probability. 


Formula for the Probability of the Complement of an Event 
If S is a finite sample space and A is an event in S, then 


P(A‘) = 1—P(A), 


where A° = S—A, the complement of A in S. 
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Passwords with 3-5 Letters 


A certain computer access password consists of 3 through 5 uppercase letters chosen from 
the 26 letters in the Roman alphabet, with repetitions allowed. 


a. How many different passwords are possible? 
b. How many different passwords have no repeated letter? 
c. How many different passwords have at least one repeated letter? 


d. If all passwords are equally likely, what is the probability that a randomly chosen 
password has at least one repeated letter? 


Solution 


a. The set of all passwords can be partitioned into three subsets consisting of passwords 
with lengths 3, 4, and 5, as shown in Figure 9.3.3. 


Set of All Passwords of Lengths 3, 4, or 5 


passwords passwords passwords 
of length 3 of length 4 of length 5 
FIGURE 9.3.3 


By the addition rule, the total number of passwords equals the number with length 3, 
plus the number with length 4, plus the number with length 5. The multiplication rule 
can be used to compute the number of passwords of each length. Thus the 


number of passwords with length 3 = 26° because forming such a password can 
be thought of as a three-step process 
with 26 ways to perform each step 


number of passwords with length 4 = 26* because forming such a password can 
be thought of as a four-step process 
with 26 ways to perform each step 


number of passwords with length 5 = 26° because forming such a password can 
be thought of as a five-step process with 
26 ways to perform each step. 


Hence the total number of passwords is 


26° + 264 + 26° = 12,355,928. 


b. Constructing a password with length 3 and no repeated letter is a three-step process 
with 26 choices for step 1, 25 choices for step 2, and 24 choices for step 3. Thus there 
are 26-25-24 passwords with length three and no repeated letter. Similarly, there are 
26:-25-+24-23 passwords with length 4 and no repeated letter and 26-25-24-23-22 
passwords with length 5 and no repeated letter. Hence the total number of passwords 
with no repeated letter is 


26-25-24 + 26-25-24-23 + 26-25-24-23-22 = 8,268,000. 
c. By part (a) the total number of passwords is 12,355,928, and by part (b) 8,268,000 of 


these passwords do not have a repeated letter. Thus, by difference rule, the number of 
passwords with at least one repeated letter is 4,087,928. 
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d. Given the assumption that all passwords are equally likely, the equally likely prob- 
ability formula can be used. So the probability that a randomly chosen password has at 
least one repeated letter is 


# of passwords with 
no repeated letter 4,087,928 
total # of passwords ~ 12,355,928 


= 33.1%. a 


Number of Python Identifiers of Eight or Fewer Characters 


In the computer language Python, identifiers must start with one of 53 symbols: either 
one of the 52 letters of the upper- and lower-case Roman alphabet or an underscore 
(_). The initial character may stand alone, or it may be followed by any number of 
additional characters chosen from a set of 63 symbols: the 53 symbols allowed as an 
initial character plus the ten digits. Certain keywords, however, such as and, if, print, 
and so forth, are set aside and may not be used as identifiers. In one implementation 
of Python there are 31 such reserved keywords, none of which has more than eight 
characters. How many Python identifiers are there that are less than or equal to eight 
characters in length? 


Solution The set of all Python identifiers with eight or fewer characters can be parti- 
tioned into eight subsets—identifiers of length 1, identifiers of length 2, and so on—as 
shown in Figure 9.3.4. The reserved words have various lengths (all less than or equal to 
8), so the set of reserved words is shown overlapping the various subsets. 


Set of Python Identifiers with Eight or Fewer Characters 


length | length | length | length | length | length | length) length 
1 2 3 4 5 6 7 8 


Reserved words 


FIGURE 9.3.4 


According to the rules for creating Python identifiers, there are 


53 potential identifiers of length 1 because there are 53 choices for the first char- 
acter 


53-63 potential identifiers of length 2 because the first character can be any one of 
53 symbols, and the second character can be 
any one of 63 symbols 

53-63 potential identifiers of length 3 because the first character can be any one of 


53 symbols, and each of the next two characters 
can be any one of 63 symbols 


53-63 potential identifiers of length 8 because the first character can be any one of 
53 symbols, and each of the next seven charac- 
ters can be any one of 63 symbols. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


594 CHAPTER9 COUNTING AND PROBABILITY 


Thus, by the addition rule, the number of potential Python identifiers with eight or fewer 
characters is 


53 + 53-63 + 53-63? + 53 -63° + 53-631 + 53 -635 + 53-636 + 53-63! 


63°—1 
= 53 (S) by Theorem 5.2.2 


= 212,133,167,002,880. 


Now 31 of these potential identifiers are reserved, so by the difference rule, the actual 
number of Python identifiers with eight or fewer characters is 


212,133,167,002,880 — 31 = 212,133,167,002,849. | 


Sek eee Internet Addresses 


In order to communicate effectively, each computer in a network needs a distinguish- 
ing name called an address. For the Internet this address is currently a 32-bit number 
called the Internet Protocol (IP) address (although 128-bit addresses are being phased 
in to accommodate the growth of the Internet). For technical reasons some computers 
have more than one address, whereas other sets of computers, which use the Internet only 
sporadically, may share a pool of addresses that are assigned on a temporary basis. Like 
telephone numbers, IP addresses are divided into parts: one, the network ID, specifies the 
local network to which a given computer belongs, and the other, the host ID, specifies the 
particular computer. 

An example of an IP address is 10001100 11000000 00100000 10001000, where the 
32 bits have been divided into four groups of 8 for easier reading. To make the reading even 
easier, IP addresses are normally written as “dotted decimals,” in which each group of 8 bits 
is converted into a decimal number between 0 and 255. For instance, the IP address above 
converts into 140.192.32.136. 

In order to accommodate the various sizes of the local networks connected through the 
Internet, the network IDs are divided into several classes, the most important of which are 
called A, B, and C. In every class, a host ID may not consist of either all 0’s or all I’s. 

Class A network IDs are used for very large local networks. The left-most bit is set to 
0, and the left-most 8 bits give the full network ID. The remaining 24 bits are used for 
individual host IDs. However, neither 00000000 nor 01111111 is allowed as a network ID 
for a class A IP address. 


[ Network ID [ Host ID 


Class B network IDs are used for medium to large local networks. The two left-most 
bits are set to 10, and the left-most 16 bits give the full network ID. The remaining 16 bits 
are used for individual host IDs. 


[ Network ID [ Host ID 


Class C network IDs are used for small local networks. The three left-most bits are set 
to 110, and the left-most 24 bits give the full network ID. The remaining 8 bits are used for 
individual host IDs. 
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[ Network ID [ Host ID 


a. Check that the dotted decimal form of 10001100 11000000 00100000 10001000 is 
140.192.32.136. 


b. How many Class B networks can there be? 


c. What is the dotted decimal form of the IP address for a computer in a Class B 
network? 


d. How many host IDs can there be for a Class B network? 


Solution 

a. 10001100 = 1-2’+ 1-2? + 1-2? = 128+8+4 = 140 
11000000 = 1-27 + 1:2 = 128 + 64 = 192 
00100000 = 1-2° = 32 
10001000 = 1-2’ + 1-2° = 128 +8 = 136 

b. The network ID for a Class B network consists of 16 bits and begins with 10. Because 


there are two choices for each of the remaining 14 positions (either 0 or 1), the total 
number of possible network IDs is ga or 16,384. 


c. The network ID part of a Class B IP address goes from 
10000000 00000000 to 10111111 11111111. 


As dotted decimals, these numbers range from 128.0 to 191.255 because 10000000, = 
12810, 00000000; = Ojo, 10111111; = 191), 11111111, = 255). Thus the dotted deci- 
mal form of the IP address of a computer in a Class B network is w.x.y.z, where 

128 = w = 191, 0 =£ x = 255, 0 = y = 255, and 0 = z = 255. However, y and z are 
not allowed both to be 0 or both to be 255 because host IDs may not consist of either 
all 0’s or all V’s. 


d. For a class B network, 16 bits are used for host IDs. Having two choices (either 0 or 1) for 
each of 16 positions gives a potential total of 2'°, or 65,536, host IDs. But because two of 
these are not allowed (all 0’s and all 1’s), the total number of host IDs is 65,534. E 


The Inclusion/Exclusion Rule 


The addition rule says how many elements are in a union of sets if the sets are mutually 
disjoint. Now consider the question of how to determine the number of elements in a union 
of sets when some of the sets overlap. For simplicity, begin by looking at a union of two 
sets A and B, as shown in Figure 9.3.5. 


AUB 


FIGURE 9.3.5 
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First observe that the number of elements in A U B varies according to the number 
of elements the two sets have in common. If A and B have no elements in common, then 
N(A U B) = MA)+ N(B). If A and B coincide, then N(A U B) = MA). Thus any general 
formula for N(A U B) must contain a reference to the number of elements the two sets have 
in common, M(A N B), as well as to N(A) and M(B). 

The simplest way to derive a formula for N(A U B) is to reason as follows: The number 
N(A) counts the elements that are in A and not in B as well as the elements that are in both 
A and B. Similarly, the number N(B) counts the elements that are in B and not in A as well 
as the elements that are in both A and B. Hence when the two numbers N(A) and N(B) are 
added, the elements that are in both A and B are counted twice. To get an accurate count of 
the elements in A U B, it is necessary to subtract the number of elements that are in both A 
and B. Because these are exactly the elements that are in A N B, 


Note An alternative MA U B) = N(A) + N(B) — MA N B). 

proof is outlined in exer- 

cise 46 at the end of this A similar analysis gives a formula for the number of elements in a union of three sets, 
section. as shown in Theorem 9.3.3. 


Theorem 9.3.3 The Inclusion/Exclusion Rule for Two or Three Sets 


If A, B, and C are any finite sets, then 
N(A U B) = N(A) + N(B)— MAN B) 
and 


N(A U B U C) = MA) N(B) FNO- MAN B)- NAN C) 
NBN ORNAMEN E): 


It can be shown using mathematical induction (see exercise 48 at the end of this sec- 
tion) that formulas analogous to those of Theorem 9.3.3 hold for unions of any finite 
number of sets. 


Counting Elements of a General Union 
a. How many integers from 1 through 1,000 are multiples of 3 or multiples of 5? 
b. How many integers from 1 through 1,000 are neither multiples of 3 nor multiples of 5? 
Solution 


a. Let A = the set of all integers from 1 through 1,000 that are multiples of 3. 
Let B = the set of all integers from 1 through 1,000 that are multiples of 5. 


Then 


A U B = the set of all integers from 1 through 1,000 that are multiples of 3 
or multiples of 5 


and 


A N B = the set of all integers from 1 through 1,000 that are multiples 
of both 3 and 5 
= the set of all integers from 1 through 1,000 that are multiples of 15. 


[Now calculate N(A), N(B), and N(A N B) and use the inclusion/exclusion rule to solve for 
N(A U B).] 
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Because every third integer from 3 through 999 is a multiple of 3, each can be rep- 
resented in the form 3k, for some integer k from | through 333. Hence there are 333 
multiples of 3 from 1 through 1,000, and so MA) = 333. 


123 4 5 6 ... 996 997 998 999 
t t t t 
3-1 3-2 3:332 3:333 


Similarly, each multiple of 5 from | through 1,000 has the form 5k, for some integer k 
from 1 through 200. 


123 45 678 9 10 ... 995 996 997 998 999 1,000 
t t t t 
5-1 5-2 5-199 5-200 


Thus there are 200 multiples of 5 from 1 through 1,000 and M(B) = 200. Finally, 
each multiple of 15 from 1 through 1,000 has the form 15k, for some integer k from 1 
through 66 (since 990 = 66: 15). 


L 2 eed sas BO ee 975- cee “990 22. 999° 1,000 


t t t 
15-1 15:2 15:65 15:66 


Hence there are 66 multiples of 15 from 1 through 1,000, and N(A N B) = 66. It fol- 
lows by the inclusion/exclusion rule that 
N(A U B) = N(A) + N(B) — NAN B) 
= 333 + 200 — 66 
= 467. 


Thus, 467 integers from 1 through 1,000 are multiples of 3 or multiples of 5. 


b. There are 1,000 integers from 1 through 1,000, and by part (a), 467 of these are multi- 
ples of 3 or multiples of 5. Thus, by the set difference rule, there are 1,000 — 467 = 533 
that are neither multiples of 3 nor multiples of 5. E 


Note that the solution to part (b) of Example 9.3.6 hid a use of De Morgan’s law. The 
number of elements that are neither in A nor in B is N(A° N B°), and by De Morgan’s 
law, A° N BS = (A UB). So N((A U B)‘) was calculated using the set difference rule: 
N((A U B)® = N(U)— N(A U B), where the universe U was the set of all integers from 
1 through 1,000. Exercises 37-39 at the end of this section explore this technique further. 


See Counting the Number of Elements in an Intersection 


A professor in a discrete mathematics class passes out a form asking students to check all 
the mathematics and computer science courses they have recently taken. She found that, 
out of a total of 50 students in the class, 


30 took precalculus; 16 took both precalculus and Python; 
18 took calculus; 8 took both calculus and Python; 
26 took Python; 47 took at least one of the three courses. 


9 took both precalculus and calculus; 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


598 CHAPTER9 COUNTING AND PROBABILITY 


Note that when we write “30 students took precalculus,” we mean that the total number 
of students who took precalculus is 30, and we allow for the possibility that some of these 
students may have taken one or both of the other courses. If we want to say that 30 students 
took precalculus only (and not either of the other courses), we will say so explicitly. 


a. How many students did not take any of the three courses? 

b. How many students took all three courses? 

c. How many students took precalculus and calculus but not Python? How many students 
took precalculus but neither calculus nor Python? 

Solution 


a. By the difference rule, the number of students who did not take any of the three 
courses equals the number in the class minus the number who took at least one course. 
Thus the number of students who did not take any of the three courses is 


50-47 = 3. 


b. Let 


P = the set of students who took precalculus 
C = the set of students who took calculus 


Y = the set of students who took Python. 
Then, by the inclusion/exclusion rule, 
MPUCUY)=MP)+MOC)+NY)- MPN C)- MPN Y) 
—-NMCNY)+MPNCN Y) 
Substituting known values, we get 
47 = 30+ 18+26—-—9-16-8+MPNCNY). 
Solving for N(P N C N Y) gives 
NPACAY=6. 


Hence there are six students who took all three courses. In general, if you know any 
seven of the eight terms in the inclusion/exclusion formula for three sets, you can solve 
for the eighth term. 


c. To answer the questions of part (c), look at the diagram in Figure 9.3.6. 


The numter of 
students who 
took all three 
courses 


fo 
l N 


students who 
took both 
precalculus and 
calculus 

but not Python 


FIGURE 9.3.6 


Since N(P N C N Y) = 6, put the number 6 inside the innermost region. Then work 
outward to find the numbers of students represented by the other regions of the diagram. 
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For example, since nine students took both precalculus and calculus and six took all three 
courses, 9 — 6 = 3 students took precalculus and calculus but not Python. Similarly, 
since 16 students took precalculus and Python and six took all three courses, 16 — 6 = 10 
students took precalculus and Python but not calculus. Now the total number of students 
who took precalculus is 30. Of these 30, three also took calculus but not Python, ten took 
Python but not calculus, and six took both calculus and Python. That leaves 11 students 
who took precalculus but neither of the other two courses. 

A similar analysis can be used to fill in the numbers for the other regions of the 


diagram. 


1. The addition rule says that if a finite set A equals 4. The inclusion/exclusion rule for two sets says that 
the union of k distinct mutually disjoint subsets if A and B are any finite sets, then 
Ahed Ag hgh i 5. The inclusion/exclusion rule for three sets says that 
2. The difference rule says that if A is a finite set and if A, B, and C are any finite sets, then 
B is a subset of A, then 
3. If Sis a finite sample space and A is an event in S, 
then the probability of A° equals — 
EXERCISE SET 9.3 
1. a. How many bit strings consist of from one 5. a. How many five-digit integers (integers from 
through four digits? (Strings of different 10,000 through 99,999) are divisible by 5? 
lengths are considered distinct. Thus 10 and b. What is the probability that a five-digit integer 
0010 are distinct strings.) chosen at random is divisible by 5? 
b. ne 7 2 Sere ane 6. Inacertain state, all license plates consist of from 
8a eig gits: four to six symbols chosen from the 26 uppercase 
2. a. How many strings of hexadecimal digits con- letters of the Roman alphabet together with the ten 


sist of from one through three digits? (Recall 
that hexadecimal numbers are constructed 
using the 16 digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, 
B, C, D, E, F) 

b. How many strings of hexadecimal digits con- 
sist of from two through five digits? 


a. How many integers from 1 through 999 do not 
have any repeated digits? 

b. How many integers from 1 through 999 have at 
least one repeated digit? 

c. What is the probability that an integer chosen 
at random from | through 999 has at least one 
repeated digit? 


How many arrangements in a row of no more than 
three letters can be formed using the letters of the 
word NETWORK (with no repetitions allowed)? 


digits 0-9. 

a. How many license plates are possible if repeti- 
tion of symbols is allowed? 

b. How many license plates do not contain any 
repeated symbols? 

c. How many license plates have at least one 
repeated symbol? 

d. What is the probability that a license plate cho- 
sen at random has at least one repeated symbol? 


At a certain company, passwords must be from 
3-5 symbols long and composed from the 26 
uppercase letters of the Roman alphabet, the ten 
digits 0-9, and the 14 symbols !, @, #, $, %, ^, &, 
Thk ayy 4,and 4}, 


Ha. How many passwords are possible if repetition 


of symbols is allowed? 
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b. How many passwords contain no repeated 
symbols? 

c. How many passwords have at least one re- 
peated symbol? 

d. What is the probability that a password chosen 
at random has at least one repeated symbol? 


In a certain country license plates consist of zero 

or one digit followed by four or five uppercase let- 

ters from the Roman alphabet. 

a. How many different license plates can the 
country produce? 

b. How many license plates have no repeated letter? 

c. How many license plates have at least one 
repeated letter? 

d. What is the probability that a license plate has 
a repeated letter? 


. a. Consider the following algorithm segment: 


for i:= 1 to4 
for j:= 1 toi 
[Statements in body of inner loop. 


None contain branching statements 
that lead outside the loop.] 


next j 


next i 


How many times will the inner loop be iterated 
when the algorithm is implemented and run? 

b. Letn be a positive integer, and consider the 
following algorithm segment: 


for i:= 1 ton 
for j:= 1 toi 
[Statements in body of inner loop. 


None contain branching statements 
that lead outside the loop.] 


next j 


next i 
How many times will the inner loop be iterated 
when the algorithm is implemented and run? 


A calculator has an eight-digit display and a 
decimal point that is located at the extreme right 
of the number displayed, or at the extreme left, 
or between any pair of digits. The calculator can 
also display a minus sign at the extreme left of 
the number. How many distinct numbers can the 
calculator display? (Note that certain numbers 
are equal, such as 1.9, 1.90, and 01.900, and 
should, therefore, not be counted twice.) 
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16. 


a. How many ways can the letters of the word 
QUICK be arranged in a row? 

b. How many ways can the letters of the word 
QUICK be arranged in a row if the Q and 
the U must remain next to each other in the 
order QU? 

c. How many ways can the letters of the word 
QUICK be arranged in a row if the letters QU 
must remain together but may be in either the 
order QU or the order UQ? 


a. How many ways can the letters of the word 
THEORY be arranged in a row? 

b. How many ways can the letters of the word 
THEORY be arranged in a row if T and H must 
remain next to each other as either TH or HT? 


A group of eight people are attending the movies 

together. 

a. Two of the eight insist on sitting side-by-side. 
In how many ways can the eight be seated 
together in a row? 

b. Two of the people do not like each other and 
do not want to sit side-by-side. Now how 
many ways can the eight be seated together in 
a row? 


An early compiler recognized variable names 
according to the following rules: Numeric vari- 
able names had to begin with a letter, and then 
the letter could be followed by another letter or a 
digit or by nothing at all. String variable names 
had to begin with the symbol $ followed by a 
letter, which could then be followed by another 
letter or a digit or by nothing at all. How many 
distinct variable names were recognized by this 
compiler? 


. Identifiers in a certain database language must 


begin with a letter, and then the letter may be fol- 
lowed by other characters, which can be letters, 
digits, or underscores (_). However, 82 keywords 
(all consisting of 15 or fewer characters) are 
reserved and cannot be used as identifiers. How 
many identifiers with 30 or fewer characters are 
possible? (Write the answer using summation 
notation and evaluate it using a formula from 
Section 5.2.) 


a. If any seven digits could be used to form a 
telephone number, how many seven-digit tele- 
phone numbers would not have any repeated 
digits? 


17. 


18. 


* 20. 


H* 21. 


* 22. 
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b. How many seven-digit telephone numbers 
would have at least one repeated digit? 

c. What is the probability that a randomly chosen 
seven-digit telephone number would have at 
least one repeated digit? 


a. How many strings of four hexadecimal digits 
do not have any repeated digits? 

b. How many strings of four hexadecimal digits 
have at least one repeated digit? 

c. What is the probability that a randomly chosen 
string of four hexadecimal digits has at least 
one repeated digit? 


Just as the difference rule gives rise to a formula 
for the probability of the complement of an event, 
so the addition and inclusion/exclusion rules give 
rise to formulas for the probability of the union of 
mutually disjoint events and for a general union of 
(not necessarily mutually exclusive) events. 

a. Prove that for mutually disjoint events A and B, 


P(A U B) = P(A) + P(B). 
b. Prove that for any events A and B, 
P(A U B) = P(A) + P(B)— P(A A B). 


. A combination lock requires three selections of 


numbers, each from | through 39. Suppose the 
lock is constructed in such a way that no number 
may be used twice in a row but the same number 
may occur both first and third. For example, 20 13 
20 would be acceptable, but 20 20 13 would not. 
How many different combinations are possible? 


a. How many integers from | through 100,000 
contain the digit 6 exactly once? 

b. How many integers from | through 100,000 
contain the digit 6 at least once? 

c. If an integer is chosen at random from 1 
through 100,000, what is the probability that it 
contains two or more occurrences of the digit 6? 


Six new employees, two of whom are married to 
each other, are to be assigned six desks that are 
lined up in a row. If the assignment of employees 
to desks is made randomly, what is the probabil- 
ity that the married couple will have nonadjacent 
desks? (Hint: The event that the couple have non- 
adjacent desks is the complement of the event that 
they have adjacent desks.) 


Consider strings of length n over the set {a, b, c, d}. 
a. How many such strings contain at least one 
pair of adjacent characters that are the same? 


23. 


24. 


25. 


26. 


Hc. 


H e. 


27. 
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b. Ifa string of length ten over {a, b, c, d} is cho- 
sen at random, what is the probability that it 
contains at least one pair of adjacent characters 
that are the same? 


a. How many integers from 1 through 1,000 are 
multiples of 4 or multiples of 7? 

b. Suppose an integer from 1 through 1,000 is 
chosen at random. Use the result of part (a) to 
find the probability that the integer is a mul- 
tiple of 4 or a multiple of 7. 

c. How many integers from 1 through 1,000 are 
neither multiples of 4 nor multiples of 7? 


a. How many integers from 1 through 1,000 are 
multiples of 2 or multiples of 9? 

b. Suppose an integer from 1 through 1,000 is 
chosen at random. Use the result of part (a) to 
find the probability that the integer is a mul- 
tiple of 2 or a multiple of 9. 

c. How many integers from 1 through 1,000 are 
neither multiples of 2 nor multiples of 9? 


Counting Strings: 

a. Make a list of all bit strings of lengths 0, 1, 2, 
3, and 4 that do not contain the bit pattern 111. 

b. For each integer n = 0, let d, = the number of 
bit strings of length n that do not contain the 
bit pattern 111. Find dọ, dj, do, d3, and d4. 

c. Find a recurrence relation for do, di, do, .... 

d. Use the results of parts (b) and (c) to find the 
number of bit strings of length 5 that do not 
contain the pattern 111. 


Counting Strings: Consider the set of all strings of 

a’s, b’s, and c’s. 

a. Make a list of all of these strings of lengths 0, 
1, 2, and 3 that do not contain the pattern aa. 

b. For each integer n = 0, let s,, = the number of 

strings of a’s, b’s, and c’s of length n that do not 

contain the pattern aa. Find so, 51, 52, and s3. 

Find a recurrence relation for So, 51, S2,.... 

d. Use the results of parts (b) and (c) to find the 

number of strings of a’s, b’s, and c’s of length 

four that do not contain the pattern aa. 

Use the technique described in Section 5.8 to 

find an explicit formula for so, 5), So,.... 


For each integer n = 0, let a, be the number of bit 
strings of length n that do not contain the pattern 101. 
a. Show that a, = ay) + ag-3 tag- tote + 
ao + 2, for every integer k = 3. 
b. Use the result of part (a) to show that if k = 3, 
then a, = 2a,—) — dy—2 + Ay_3. 
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For each integer n = 2 let a, be the number of per- 
mutations of {1, 2, 3,..., n} in which no number 
is more than one place removed from its “natural” 
position. Thus a, = 1 since the one permutation 
of {1}, namely, 1, does not move | from its natural 
position. Also, a = 2 since neither of the two 
permutations of {1, 2}, namely, 12 and 21, moves 
either number more than one place from its natu- 
ral position. 

a. Find a3. 

b. Find a recurrence relation for a4, a2, a3,.... 


Refer to Example 9.3.5. 
a. Write the following IP address in dotted deci- 
mal form: 


11001010 00111000 01101011 11101110 


b. How many Class A networks can there be? 

c. What is the dotted decimal form of the IP ad- 
dress for a computer in a Class A network? 

d. How many host IDs can there be for a Class A 
network? 

e. How many Class C networks can there be? 

f. What is the dotted decimal form of the IP ad- 
dress for a computer in a Class C network? 

g. How many host IDs can there be for a Class 
C network? 

h. How can you tell, by looking at the first of the 
four numbers in the dotted decimal form of an 
IP address, what kind of network the address is 
from? Explain. 

i. An IP address is 140.192.32.136. What class of 
network does it come from? 

j. An IP address is 202.56.107.238. What class of 
network does it come from? 


A row in a classroom has n seats. Let s„ be the num- 
ber of ways nonempty sets of students can sit in the 
row so that no student is seated directly adjacent to 
any other student. (For instance, a row of three seats 
could contain a single student in any of the seats or a 
pair of students in the two outer seats. Thus s3 = 4.) 
Find a recurrence relation for 5), 53, 53,.... 


Assume that birthdays are equally likely to occur 

in any one of the 12 months of the year. 

a. Given a group of four people, A, B, C, and D, 
what is the total number of ways in which birth 
months could be associated with A, B, C, and 
D? (For instance, A and B might have been born 
in May, C in September, and D in February. As 
another example, A might have been born in Jan- 
uary, B in June, C in March, and D in October.) 
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b. How many ways could birth months be associ- 
ated with A, B, C, and D so that no two people 
would share the same birth month? 

c. How many ways could birth months be associ- 
ated with A, B, C, and D so that at least two 
people would share the same birth month? 

d. What is the probability that at least two people 
out of A, B, C, and D share the same birth 
month? 

e. How large must n be so that in any group of n 
people, the probability that two or more share 
the same birth month is at least 50%? 


Assuming that all years have 365 days and all birth- 
days occur with equal probability, how large must n 
be so that in any randomly chosen group of n people, 
the probability that two or more have the same 
birthday is at least 1/2? (This is called the birthday 
problem. Many people find the answer surprising.) 


A college conducted a survey to explore the aca- 
demic interests and achievements of its students. It 
asked students to place checks beside the num- 
bers of all the statements that were true of them. 
Statement #1 was “I was on the Dean’s list last 
term,” statement #2 was “I belong to an academic 
club, such as the math club or the Spanish club,” 
and statement #3 was “I am majoring in at least 
two subjects.” Out of a sample of 100 students, 28 
checked #1, 26 checked #2, and 14 checked #3, 8 
checked both #1 and #2, 4 checked both #1 and 
#3, 3 checked both #2 and #3, and 2 checked all 
three statements. 

a. How many students checked at least one of the 
statements? 

b. How many students checked none of 
the statements? 

c. Let H be the set of students who checked #1, 
C the set of students who checked #2, and D 
the set of students who checked #3. Fill in the 
numbers for all eight regions of the diagram. 


Sample of Students 
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d. How many students checked #1 and #2 but 
not #3? 

e. How many students checked #2 and #3 but 
not #1? 

f. How many students checked #2 but neither of 
the other two? 


34. A study was done to determine the efficacy of 
three different drugs—A, B, and C—in relieving 
headache pain. Over the period covered by the 
study, 50 subjects were given the chance to use all 
three drugs. The following results were obtained: 


21 reported relief from drug A 

21 reported relief from drug B 

31 reported relief from drug C 

9 reported relief from both drugs A and B 

14 reported relief from both drugs A and C 

15 reported relief from both drugs B and C 

41 reported relief from at least one of the drugs. 


Note that some of the 21 subjects who reported 

relief from drug A may also have reported relief 

from drugs B or C. A similar occurrence may be 
true for the other data. 

a. How many people got relief from none of the 
drugs? 

b. How many people got relief from all three 
drugs? 

c. Let A be the set of all subjects who got relief 
from drug A, B the set of all subjects who got 
relief from drug B, and C the set of all subjects 
who got relief from drug C. Fill in the numbers 
for all eight regions of the following diagram. 


Sample of Subjects 


d. How many subjects got relief from A only? 


35. An interesting use of the inclusion/exclusion rule 
is to check survey numbers for consistency. For 
example, suppose a public opinion polltaker re- 
ports that out of a national sample of 1,200 adults, 
675 are married, 682 are from 20 to 30 years old, 
684 are female, 195 are married and are from 


20 to 30 years old, 467 are married females, 318 
are females from 20 to 30 years old, and 165 are 
married females from 20 to 30 years old. Are the 
polltaker’s figures consistent? Could they have oc- 
curred as a result of an actual sample survey? 


36. Fill in the reasons for each step below. If A and B 
are sets in a finite universe U, then 


MA N B) = NU) -NA N BY’) a 
= NC) NAP U B°) b) 
= N(U) — (N(A°) + N(B®) — MAS NB’) ©). 


For each of exercises 37-39, the number of elements 
in a certain set can be found by computing the num- 
ber in a larger universe that are not in the set and 
subtracting this from the total in the larger universe. 
In each of these, as was the case for the solution to 
Example 9.3.6(b), De Morgan’s laws and the inclusion/ 
exclusion rule can be used. 


37. How many positive integers less than 1,000 have 
no common factors with 1,000? 


38. How many permutations of abcde are there in 
which the first character is a, b, or c and the last 
character is c, d, or e? 


39. How many integers from | through 999,999 
contain each of the digits 1, 2, and 3 at least once? 
(Hint: For each i = 1, 2, and 3, let A; be the set of 
all integers from 1 through 999,999 that do not 
contain the digit i.) 

For 40 and 41, use the definition of the Euler phi function 

¢ from Section 7.1, exercises 51-53. 


H 40. Use the inclusion/exclusion principle to prove the 


following: If n = pq, where p and q are distinct 
prime numbers, then y(n) = (p— 1)(q- 1). 


41. Use the inclusion/exclusion principle to prove the 
following: If n = pqr, where p, q, and r are distinct 
prime numbers, then g(7) = (p — 1)(¢g-— 1)(r- 1). 


42. A gambler decides to play successive games of 
blackjack until he loses three times in a row. (Thus 
the gambler could play five games by losing the 
first, winning the second, and losing the final 
three or by winning the first two and losing the 
final three. These possibilities can be symbolized 
as LWLLL and WWLLL.) Let g,, be the number of 
ways the gambler can play n games. 

a. Find g3, g4, and gs. 
b. Find ge. 
H c. Finda recurrence relation for g3, g4, g5,-.-- 
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* 43. A derangement of the set {1, 2,..., n} is a permu- 


44 


45. 


46. 


ANSWERS FOR TEST YOURSELF 


tation that moves every element of the set away 
from its “natural” position. Thus 21 is a derange- 
ment of {1, 2}, and 231 and 312 are derangements 
of {1, 2, 3}. For each positive integer n, let d„ be the 
number of derangements of the set {1, 2,..., n}. 

a. Find dj, d, and d3. 

b. Find d4. 

c. Find a recurrence relation for d4, do, d3,.... 


Note that a product x;x x3 may be parenthesized 
in two different ways: (x1x2)x3 and xı(x2x3). Simi- 
larly, there are several different ways to paren- 
thesize xı x2x3x4. Two such ways are (x1x2)(x3x4) 
and x,((x2x3)x4). Let P,, be the number of different 
ways to parenthesize the product x).x2...x4. Show 
that if P; = 1, then 


n-1 
P,= X PPa- for every integer n = 2. 
k=1 
(It turns out that the sequence P4, P2, P3,... 
is the same as the sequence of Catalan num- 
bers: P, = C,— for every integer n = 1. 
See Example 5.6.4.) 


Use mathematical induction to prove Theorem 9.3.1. 


Prove the inclusion/exclusion rule for two sets A 
and B by showing that A U B can be partitioned 


47. 
H* 48. 


* 49. 


into A N B, A—(A N B), and B — (A N B), and 
then using the addition and difference rules. (See 
the hint for exercise 39 in Section 6.2.) 


Prove the inclusion/exclusion rule for three sets. 


Use mathematical induction to prove the general 
inclusion/exclusion rule: 


If A), A2, . . . , A, are finite sets, then 
N(A,; UA, U ... UA,) 
= > MAj) — > NA; N Aj) 
1Si=n 1<i<jsn 
+ >` N(A; N A; Ay) 
1<i<j<k<n 


=e (DTNA NA N NA,). 


(The notation Disi<j<n N(A; N Aj) means that 
quantities of the form N(A; N Aj) are to be added 
together for all integers i andj with 1 S i < j =n) 


A circular disk is cut into n distinct sectors, each 

shaped like a piece of pie and all meeting at the 

center point of the disk. Each sector is to be paint- 

ed red, green, yellow, or blue in such a way that no 

two adjacent sectors are painted the same color. 

Let S,, be the number of ways to paint the disk. 

a. Find a recurrence relation for S, in terms of 
S;,—; and S;,_» for each integer k = 4. 

b. Find an explicit formula for S, for n = 2. 


1. the number of elements in A equals N(A,) + N(A) +*+: + N(A„) 2. the number of elements in A — B is the difference 


between the number of elements in A and the number of elements in B, that is, N(A — B) = N(A) — N(B). 


3. 1— P(A) 


4. N(A U B) = N(A) + N(B) —- N(A N B) 5.N(A U B U ©) = N(A)+ N(B) + N(C)- MAN B)- MAN C)— 
NBACHNANABNAO 


The Pigeonhole Principle 


The shrewd guess, the fertile hypothesis, the courageous leap to a tentative 


conclusion—these are the most valuable coin of the thinker at work. 


—Jerome S. Bruner, 1960 


The pigeonhole principle states that if n pigeons fly into m pigeonholes and n > m, 
then at least one hole must contain two or more pigeons. This principle is illustrated in 
Figure 9.4.1 for n = 5 and m = 4. Illustration (a) shows the pigeons perched next to their 
holes, and (b) shows the correspondence from pigeons to pigeonholes. The pigeonhole 
principle is sometimes called the Dirichlet box principle because it was first stated for- 
mally by J. P. G. L. Dirichlet (1805-1859). 
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Pigeons Pigeonholes 


(a) 
FIGURE 9.4.1 


Illustration (b) suggests the following mathematical way to phrase the principle. 


Pigeonhole Principle 


A function from one finite set to a smaller finite set cannot be one-to-one: There must 
be at least two elements in the domain that have the same image in the co-domain. 


Thus an arrow diagram for a function from a finite set to a smaller finite set must have 
at least two arrows from the domain that point to the same element of the co-domain. In 
Figure 9.4.1(b), arrows from pigeons | and 4 both point to pigeonhole 3. 

Since the truth of the pigeonhole principle is easy to accept intuitively, we move imme- 
diately to applications, leaving a formal proof to the end of the section. Applications of the 
pigeonhole principle range from the totally obvious to the extremely subtle. A representa- 
tive sample is given in the examples and exercises that follow. 


Applying the Pigeonhole Principle 


a. In a group of six people, must there be at least two who were born in the same month? 
In a group of thirteen people, must there be at least two who were born in the same 
month? Why? 


b. Among the residents of New York City, must there be at least two people with the 
same number of hairs on their heads? Why? 


Solution 

a. A group of six people need not contain two who were born in the same month. For in- 
stance, the six people could have birthdays in each of the six months January through 
June. 

A group of 13 people, however, must contain at least two who were born in the 
same month, for there are only 12 months in a year and 13 > 12. To get at the essence 
of this reasoning, think of the thirteen people as the pigeons and the twelve months of 
the year as the pigeonholes. Denote the thirteen people by the symbols x), x2,..., X13 
and define a function B from the set of people to the set of twelve months as shown in 
the following arrow diagram. 
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13 people (pigeons) 12 months (pigeonholes) 


B 
— 


B(x;) = birth month of x; 


The pigeonhole principle says that no matter what the particular assignment of months 
to people, there must be at least two arrows pointing to the same month. Thus at least 
two people must have been born in the same month. 


b. The answer is yes. In this example the pigeons are the people of New York City and 
the pigeonholes are all possible numbers of hairs on any individual’s head. Call the 
population of New York City P. It is known that P is at least 8,000,000. Also, the 
maximum number of hairs on any person’s head is known to be less than 300,000. 
Define a function H from the set of people in New York City {x), x2, . . - , Xp} to the set 
{0, 1, 2, 3,..., 300,000}, as shown in the arrow diagram. 


People in New York City Possible number of hairs on 
(pigeons) a person’s head (pigeonholes) 
H 
— 
H(x;) = the number of 
hairs on x;’s head 


e 300,000 


Since the number of people in New York City is larger than the number of possible 
hairs on their heads, the function H is not one-to-one; at least two arrows point to the 
same number. And this means that at least two people have the same number of hairs 
on their heads. E 


Finding the Number to Pick to Ensure a Result 


A drawer contains ten black and ten white socks. You reach in and pull some out without 
looking at them. What is the least number of socks you must pull out to be sure to get a 
matched pair? Explain how the answer follows from the pigeonhole principle. 


Solution If you pick just two socks, they may have different colors. But when you pick a 
third sock, it must be the same color as one of the socks already chosen. Hence the answer 
is three. 

This answer could be phrased more formally as follows: Let the socks pulled out be 
denoted sj, S2, 53,..., Sn and consider the function C that sends each sock to its color, as 
shown on the next page. 
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Socks pulled out (pigeons) Colors (pigeonholes) 
C 
es 
C(s;) = color of s; ë white 
e black 


If n = 2, C could be a one-to-one correspondence (if the two socks pulled out were of dif- 
ferent colors). But if n > 2, then the number of elements in the domain of C is larger than 
the number of elements in the co-domain of C. Thus by the pigeonhole principle, C is not 
one-to-one: C(s;) = C(s;) for some s; # s;. This means that if at least three socks are pulled 
out, then at least two of them have the same color. E 


ZEWG Selecting a Pair of Integers with a Certain Sum 
Let A = {1, 2, 3, 4, 5, 6, 7, 8}. 
a. If five integers are selected from A, must at least one pair of the integers have a sum of 9? 


b. If four integers are selected from A, must at least one pair of the integers have a sum of 9? 


Solution 
a. Yes. Partition the set A into the following four disjoint subsets: 


{1,8} {2,7}, {3,6}, and {4, 5}. 


Observe that each of the integers in A occurs in exactly one of the four subsets and 
that the sum of the integers in each subset is 9. Thus if five integers from A are chosen, 
then by the pigeonhole principle, two must be from the same subset. It follows that the 
sum of these two integers is 9. 

To see precisely how the pigeonhole principle applies, let the pigeons be the five 
selected integers (call them ay, a>, a3, a4, and as) and let the pigeonholes be the subsets 
of the partition. The function P from pigeons to pigeonholes is defined by letting P(a,;) 
be the subset that contains qj. 


The 5 selected integers The 4 subsets in the partition of A 
(pigeons) (pigeonholes) 


P 
— 


P(a;) = the subset that 
contains a; 


The function P is well defined because for each integer a; in the domain, a; belongs 
to one of the subsets (since the union of the subsets is A) and a; does not belong to 
more than one subset (since the subsets are disjoint). 
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Because there are more pigeons than pigeonholes, at least two pigeons must go to 
the same hole. Thus two distinct integers are sent to the same set. But that implies 
that those two integers are the two distinct elements of the set, so their sum is 9. More 
formally, by the pigeonhole principle, since P is not one-to-one, there are integers a; 
and a; such that 


P(aqj;) E P(a;) and di # aj. 


But then, by definition of P, a; and a; belong to the same subset. Since the elements in 
each subset add up to 9, a; + aj = 9. 


b. The answer is no. This is a case where the pigeonhole principle does not apply; the 
number of pigeons is not larger than the number of pigeonholes. For instance, if you 
select the integers 1, 2, 3, and 4, then since the largest sum of any two of these integers 
is 7, no two of them add up to 9. E 


Application to Decimal Expansions of Fractions 
One important consequence of the pigeonhole principle is the fact that 
the decimal expansion of any rational number either terminates or repeats. 


A terminating decimal is one like 


3.625, 
and a repeating decimal is one like 
Note Strictly speaking, a -F 
terminating decimal like 2.38246, 
3.625 can be regarded where the bar over the digits 246 means that these digits are repeated forever. 
as a repeating decimal 
by adding trailing zeros: Recall that a rational number is one that can be written as a ratio of integers—in other 


3.625 = 3.6250. This can words, as a fraction. Recall also that the decimal expansion of a fraction is obtained by 
also be written as 3.6249. dividing its numerator by its denominator using long division. For example, the decimal 
expansion of 4/33 is obtained as follows: 


A 2IQ21T2 12... 


‘These are the same number. 


Because the number 4 reappears as a remainder in the long-division process, the sequence 
of quotients and remainders that give the digits of the decimal expansion repeats forever; 
hence the digits of the decimal expansion repeat forever. 

In general, when one integer is divided by another, it is the pigeonhole principle (to- 
gether with the quotient-remainder theorem) that guarantees that such a repetition of re- 
mainders and hence decimal digits must always occur. This is explained in the following 
example. The analysis in the example uses an obvious generalization of the pigeonhole 
principle, namely, that a function from an infinite set to a finite set cannot be one-to-one. 
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The Decimal Expansion of a Fraction 


Let a and b be integers and consider a fraction a/b, where for simplicity a and b are both 
assumed to be positive. The decimal expansion of a/b is obtained by dividing a by b as 
illustrated here for a = 3 and b = 14. 

2142857142857... 


14 Bo 000000000000000 


28 — 179 = 3 
@0 — n =2 
14 

@0 >2=6 
56 
o > r3 =4 
28 
IDo > rg = 12 
112 
0 >r5=8 
70 
100 > 76 = 10 
98 
@0 $= 2—7] 
Ae 
©0 —— rs 6 r2 
56 
Do 


> rn =4=7Fr3 


Let rọ = a and let r1, r2, r3, . . . be the successive remainders obtained in the long division 
of a by b. By the quotient-remainder theorem, each remainder must be between 0 and 
b — 1. (In this example, a is 3 and b is 14, and so the remainders are from 0 to 13.) If some 
remainder r; = 0, then the division terminates and a/b has a terminating decimal expan- 
sion. If no r; = 0, then the division process and hence the sequence of remainders contin- 
ues forever. By the pigeonhole principle, since there are more remainders than values that 
the remainders can take, some remainder value must repeat: r; = rg, for some indices j and 
k with j < k. This is illustrated below for a = 3 and b = 14. 


Sequence of remainders Values of remainders when b = 14 


F(r;) = value of r, 
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If follows that the decimal digits obtained from the divisions between r; and r,_, repeat 
forever. In the case of 3/14, the repetition begins with r} = 2 = r; and the decimal ex- 
pansion repeats the quotients obtained from the divisions from rı through rę forever: 
3/14 = 0.2142857. a 


Note that since the decimal expansion of any rational number either terminates or 
repeats, if a number has a decimal expansion that neither terminates nor repeats, then 
it cannot be rational. Thus, for example, the following number cannot be rational: 
0.01011011101111011111... (where each string of 1’s is one longer than the previous 
string). 


Generalized Pigeonhole Principle 


A generalization of the pigeonhole principle states that if n pigeons fly into m pigeon- 
holes and, for some positive integer k, km < n, then at least one pigeonhole contains k + 1 
or more pigeons. This is illustrated in Figure 9.4.2 for m = 4, n = 9, and k = 2. Since 
2-4 < 9, at least one pigeonhole contains three (2 + 1) or more pigeons. (In this example, 
pigeonhole 3 contains three pigeons.) 


Pigeons Pigeonholes 


FIGURE 9.4.2 


Generalized Pigeonhole Principle 


For any function ffrom a finite set X with n elements to a finite set Y with m elements 
and for any positive integer k, if km < n, then there is some y € Y such that y is the 
image of at least k + 1 distinct elements of X. 


Applying the Generalized Pigeonhole Principle 


Show how the generalized pigeonhole principle implies that in a group of 85 people, at 
least 4 must have the same last initial. 


Solution In this example the pigeons are the 85 people and the pigeonholes are the 26 
possible last initials of their names. 
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Consider the function L from people to initials defined by the following arrow diagram. 


85 people (pigeons) 26 initials (pigeonholes) 
L 
— 
L(x;) = the initial of 
xps last name 


Since 3-26 = 78 < 85, the generalized pigeonhole principle states that some initial must 
be the image of at least four (3 + 1) people. Thus at least four people have the same last 
initial. E 


Consider the following contrapositive form of the generalized pigeonhole principle. 


Generalized Pigeonhole Principle (Contrapositive Form) 


For any function f from a finite set X with n elements to a finite set Y with m elements 
and for any positive integer k, if for each y € Y, f m y) has at most k elements, then 
X has at most km elements; in other words, n S km. 


You may find it natural to use the contrapositive form of the generalized pigeonhole prin- 
ciple in certain situations. For instance, the result of Example 9.4.5 can be explained as 
follows: 


Suppose no 4 people out of the 85 had the same last initial. Then at most 3 would 
share any particular one. By the generalized pigeonhole principle (contrapositive 
form), this would imply that the total number of people is at most 3:26 = 78. But 
this contradicts the fact that there are 85 people in all. Hence at least 4 people share 
a last initial. 


Using the Contrapositive Form of the Generalized Pigeonhole Principle 


There are 42 students who are to share 12 computers. Each student uses exactly 1 com- 
puter, and no computer is used by more than 6 students. Show that at least 5 computers are 
used by 3 or more students. 


Solution 

a. Using an Argument by Contradiction: Suppose not. Suppose that 4 or fewer comput- 
ers are used by 3 or more students. /A contradiction will be derived.] Then 12— 4 = 8 
or more computers are used by 2 or fewer students. Divide the set of computers into 
two subsets: C, and C}. Into C, place 8 of the computers used by 2 or fewer students; 
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into C, place the computers used by 3 or more students plus any remaining computers 
(to make a total of 4 computers in C3). (See Figure 9.4.3.) 


The Set of 12 Computers 


Each of these computers Some or all of these computers serve 
serves at most 2 students. 3 or more students. Each computer 


So the maximum number serves at most 6 students. So the 
served by these computers is maximum number served by these 
2 85G computers is 6 - 4 = 24. 


FIGURE 9.4.3 


Since at most 6 students are served by any one computer, by the contrapositive 
form of the generalized pigeonhole principle, the computers in set C, serve at most 
6:4 = 24 students. Since at most 2 students are served by any one computer in C}, 
by the generalized pigeonhole principle (contrapositive form), the computers in set 
C; serve at most 2:8 = 16 students. Hence the total number of students served by the 
computers is 24 + 16 = 40. But this contradicts the fact that each of the 42 students 
is served by a computer. Therefore, the supposition is false: At least 5 computers are 
used by 3 or more students. 


b. Using a Direct Argument: Let k be the number of computers used by 3 or more 
students. [We must show that k = 5.] Because each computer is used by at most 6 
students, these computers are used by at most 6k students (by the contrapositive 
form of the generalized pigeonhole principle). Each of the remaining 12 — k comput- 
ers is used by at most 2 students. Hence, taken together, they are used by at most 
2(12 — k) = 24 — 2k students (again, by the contrapositive form of the generalized 
pigeonhole principle). Thus the maximum number of students served by the comput- 
ers is 6k + (24 — 2k) = 4k + 24. Because 42 students are served by the computers, 
4k + 24 = 42. Solving for k gives that k > 4.5, and since k is an integer, this implies 
that k = 5 [as was to be shown]. | 


Proof of the Pigeonhole Principle 


The truth of the pigeonhole principle depends essentially on the sets involved being finite. 
Recall from Section 7.4 that a set is called finite if, and only if, it is the empty set or there 
is a one-to-one correspondence from {1, 2,..., n} to it, where n is a positive integer. In the 
first case the number of elements in the set is said to be 0, and in the second case it is said 
to be n. A set that is not finite is called infinite. 

Thus any finite set is either empty or can be written in the form {x), X2, . . - , x,} where 
n is a positive integer. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.4 THE PIGEONHOLE PRINCIPLE 613 


Theorem 9.4.1 The Pigeonhole Principle 


For any function f from a finite set X with n elements to a finite set Y with m ele- 
ments, if n > m, then fis not one-to-one. 


Proof: Suppose f is any function from a finite set X with n elements to a finite set Y 
with m elements where n > m. Denote the elements of Y by y1, yo, ..., Ym. Recall that 
for each y; in Y, the inverse image set fo = {x E X| f(x) = y,}. Now consider 
the collection of all the inverse image sets for all the elements of Y: 


FODS OD -f Om. 


By definition of function, each element of X is sent by f to some element of Y. Hence 
each element of X is in one of the inverse image sets, and so the union of all these 
sets equals X. But also, by definition of function, no element of X is sent by f to 
more than one element of Y. Thus each element of X is in only one of the inverse 
image sets, and so the inverse image sets are mutually disjoint. By the addition rule, 
therefore, 


NX = NF ODEN ODE = ENE On). 9.4.1 


Now suppose that fis one-to-one /which is the opposite of what we want to prove]. 
Then each set f lo) has at most one element, and so 


NF OD) +N ODE +N On) S L414 +1 =m 942 


m terms 
Putting equations (9.4.1) and (9.4.2) together gives that 
n= NX) =m = N(Y). 


This contradicts the fact that n > m, and so the supposition that fis one-to-one must 
be false. Hence fis not one-to-one fas was to be shown]. 


An important theorem that follows from the pigeonhole principle states that a function 
from one finite set to another finite set of the same size is one-to-one if, and only if, it is 
onto. As shown in Section 7.4, this result does not hold for infinite sets. 


Theorem 9.4.2 One-to-One and Onto for Finite Sets 


Let X and Y be finite sets with the same number of elements and suppose fis a func- 
tion from X to Y. Then fis one-to-one if, and only if, fis onto. 


Proof: Suppose fis a function from X to Y, where X and Y are finite sets each with m 
elements. Let X = {x1, X2,...,X,} and Y = {y, yo,..-, Yn}. 


If fis one-to-one, then f is onto: Suppose fis one-to-one. Then f(x), (2), <, fm) 
are all distinct. Consider the set S of all elements of Y that are not the image of any 
element of X. 

Then the sets 


ED, SO) -Afam and S 
(continued on page 614) 
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are mutually disjoint. By the addition rule, 


MY) = NFED + NAF) + FNC SEn) + MS) 
=1+1+---+1+WMS) because each { f(x,)} 


is a singleton set 


m terms 
=m+N(S). 
Thus 
m = m-+N(S) because N(Y) = m, 
and so 
NS) = 0 by subtracting m from both sides. 


Hence S is empty, and so there is no element of Y that is not the image of some ele- 
ment of X. Consequently, fis onto. 


If f is onto, then f is one-to-one: Suppose f is onto. Then, for each i= 1, 
Digeavva-y MM f ‘od # Ø and so Nf 'O)) = 1. As in the proof of the pigeon- 
hole principle (Theorem 9.4.1), X is the union of the mutually disjoint sets 


F ODF OD,- -f Om). By the addition principle, 
NX) = NE 'ODENE ODE ANS Om) =m 9.4.3 


m terms, each = 1 


Now if any one of the sets f KOA) has more than one element, then the sum of the m 
terms in equation (9.4.3) is greater than m. But we know this is not the case because 
N(X) = m. Hence each set f ly) has exactly one element, and thus fis one-to-one 
[as was to be shown]. 


Note that Theorem 9.4.2 applies in particular to the case X = Y. Thus a one-to-one 


> A function from a finite set to itself is onto, and an onto function from a finite set to itself 
=| is one-to-one. Such functions are permutations of the sets on which they are defined. For 


instance, the function defined by the diagram on the left is another representation for the 
permutation cdba obtained by listing the images of a, b, c, and d in order. 


TEST YOURSELF 


1. The pigeonhole principle states that 


: 3. If X and Y are finite sets and fis a function from X 
to Y then fis one-to-one if, and only if, 


2. The generalized pigeonhole principle states that 


EXERCISE SET 9.4 


1. a. If 4 cards are selected from a standard 52- 2. a. If 13 cards are selected from a standard 
card deck, must at least 2 be of the same suit? 52-card deck, must at least 2 be of the same 
Why? denomination? Why? 

b. If 5 cards are selected from a standard 52- b. If 20 cards are selected from a standard 
card deck, must at least 2 be of the same suit? 52-card deck, must at least 2 be of the same 
Why? denomination? Why? 
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H7. 


10. 


Tl. 


12. 


13. 


. A small town has only 500 residents. Must there 


be 2 residents who have the same birthday? Why? 


In a group of 700 people, must there be 2 who 
have the same first and last initials? Why? 


a. Given any set of four integers, must there 
be two that have the same remainder when 
divided by 3? Why? 

b. Given any set of three integers, must there 
be two that have the same remainder when 
divided by 3? Why? 


a. Given any set of seven integers, must there 
be two that have the same remainder when 
divided by 6? Why? 

b. Given any set of seven integers, must there 
be two that have the same remainder when 
divided by 8? Why? 


Let S = {3, 4, 5, 6, 7, 8, 9, 10, 11, 12}. Suppose 
six integers are chosen from S. Must there be two 
integers whose sum is 15? Why? 


. Let T = {1, 2, 3, 4, 5, 6, 7, 8, 9}. Suppose five inte- 


gers are chosen from 7: Must there be two integers 
whose sum is 10? Why? 


a. If seven integers are chosen from between 1 
and 12 inclusive, must at least one of them be 
odd? Why? 

b. If ten integers are chosen from between 1 and 
20 inclusive, must at least one of them be even? 
Why? 


If n + | integers are chosen from the set 
(1,2, 3,+:5. 27}, 


where n is a positive integer, must at least one of 
them be odd? Why? 


If n + | integers are chosen from the set 
{;.2, 3,225,527}, 


where n is a positive integer, must at least one of 
them be even? Why? 


How many cards must you pick from a standard 
52-card deck to be sure of getting at least 1 red 
card? Why? 


Suppose six pairs of similar-looking boots are 
thrown together in a pile. How many individ- 
ual boots must you pick to be sure of getting a 
matched pair? Why? 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 
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How many integers from 0 through 60 must you 
pick in order to be sure of getting at least one that 
is odd? at least one that is even? 


If n is a positive integer, how many integers from 0 
through 2n must you pick in order to be sure of get- 
ting at least one that is odd? at least one that is even? 


How many integers from | through 100 must you 
pick in order to be sure of getting one that is divis- 
ible by 5? 


How many integers must you pick in order to 
be sure that at least two of them have the same 
remainder when divided by 7? 


How many integers must you pick in order to 
be sure that at least two of them have the same 
remainder when divided by 15? 


How many integers from 100 through 999 must 
you pick in order to be sure that at least two of 
them have a digit in common? (For example, 256 
and 530 have the digit 5 in common.) 


a. If repeated divisions by 20,483 are performed, 
how many distinct remainders can be obtained? 

b. When 5/20483 is written as a decimal, what is 
the maximum length of the repeating section 
of the representation? 


When 683/1493 is written as a decimal, what is 
the maximum length of the repeating section of 
the representation? 


Is 0.101001000100001000001 ... (where each 
string of 0’s is one longer than the previous one) 
rational or irrational? 


Is 56.556655566655556666. . . (where the strings 
of 5’s and 6’s become longer in each repetition) 
rational or irrational? 


Show that within any set of thirteen integers 
chosen from 2 through 40, there are at least two 
integers with a common divisor greater than 1. 


In a group of 30 people, must at least 3 have been 
born in the same month? Why? 


In a group of 30 people, must at least 4 have been 
born in the same month? Why? 


In a group of 2,000 people, must at least 5 have 
the same birthday? Why? 


A programmer writes 500 lines of computer code 
in 17 days. Must there have been at least 1 day 
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when the programmer wrote 30 or more lines of 
code? Why? 


H*35. 


Given a set of 52 distinct integers, show that there 
must be 2 whose sum or difference is divisible by 
100. 


29. A certain college class has 40 students. All the stu- 
dents in the class are known to be from 17 through H*36. Show that if 101 integers are chosen from | to 200 
34 years of age. You want to make a bet that the inclusive, there must be 2 with the property that 
class contains at least x students of the same age. one is divisible by the other. 
How large can you make x and yet be sure to win : i 
our het? * 37. a. Suppose dj, do,..., Ap is a sequence of n inte- 
y f gers none of which is divisible by n. Show that 
30. A penny collection contains twelve 1967 pennies, at least one of the differences a; — a; (for i # j) 
seven 1968 pennies, and eleven 1971 pennies. If must be divisible by n. 
you are to pick some pennies without looking at H b. Show that every finite sequence x), X2,..., Xn 
the dates, how many must you pick to be sure of of n integers has a consecutive subsequence 
getting at least five pennies from the same year? Xj+1, Xi+2 - - -, X; Whose sum is divisible by n. 
H 31. A group of 15 executives are to share 5 assistants. (For Istance; the sequence 3,4 1/-1 10 Hag 
ne j f the consecutive subsequence 17, 7, 16 whose 
Each executive is assigned exactly 1 assistant, and Soy gist 
: : ? : sum is divisible by 5.) (From: James E. Schultz 
no assistant is assigned to more than 4 executives. ate a 
: ; and William F. Burger, “An Approach to 
Show that at least 3 assistants are assigned to 3 or : À : 
more executes Problem-Solving Using Equivalence Classes 
i Modulo n,” College Mathematics Journal (15), 
H*32. Let A be a set of six positive integers each of No. 5, 1984, 401—405.) 
whioh i less than T3. Show that there mist be two: pike. Observe that the sequence 12, 15,8; 13,7, 18,19, 
distinct subsets of A whose elements when added 3 i 
up give the same sum. (For example, if A = {5, 12 11, 14, 10 has three increasing subsequences of 
PE i pe, oo length four: 12, 15, 18, 19; 12, 13, 18, 19; and 8, 
10, 1, 3, 4}, then the elements of the subsets ; 
S, = {1, 4, 10} and S> = {5, 10} both add up to 15) 13, 18, 19. It also has one decreasing subsequence 
l Sii 2 > p ` of length four: 15, 13, 11, 10. Show that in any se- 
H 33. Let A be a set of six positive integers each of quence of n? + 1 distinct real numbers, there must 
which is less than 15. Show that there must be be a sequence of length n + 1 that is either strictly 
two distinct subsets of A whose elements when increasing or strictly decreasing. 
AE ae er ei A mals dO Tanakan * 39. What is the largest number of elements that a set 
P ` of integers from 1 through 100 can have so that no 
34. Let S be a set of ten integers chosen from 1 one integer in the set is divisible by another? (Hint: 
through 50. Show that the set contains at least two Imagine writing all the integers from 1 through 
different (but not necessarily disjoint) subsets of 100 in the form 2k. m, where k = 0 and m is odd.) 
four integers that add up to the same number. (For 40. Sippose Xand Yare Mnie sels, X tasmow de 


ANSWERS FOR TEST YOURSELF 


instance, if the ten numbers are {3, 8, 9, 18, 24, 
34, 35, 41, 44, 50}, the subsets can be taken to be 
{8, 24, 34, 35} and {9, 18, 24, 50}. The numbers in 
both of these add up to 101.) 


ments than Y, and F: X — Y is a function. By the pi- 
geonhole principle, there exist elements a and b in X 
such that a # b and F(a) = F(b). Write a computer 
algorithm to find such a pair of elements a and b. 


contains k + 1 or more pigeons Or: for any function f 


from a finite set X with n elements to a finite set Y with 


1. if n pigeons fly into m pigeonholes and n > m, then at 
Or: a 
function from one finite set to a smaller finite set cannot be 


least two pigeons fly into the same pigeonhole 
m elements and for any positive integer k, if km < n, then 
one-to-one 2. if n pigeons fly into m pigeonholes and, for there is some y € Y such that y is the image of at least k + 1 


some positive integer k, km < n, then at least one pigeonhole distinct elements of Y 3. fis onto 
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EH Counting Subsets of a Set: Combinations 


“But ‘glory’ doesn’t mean ‘a nice knock-down argument,’” Alice objected. “When 
I use a word,” Humpty Dumpty said, in rather a scornful tone, “it means just what 
I choose it to mean—neither more nor less.” —Lewis Carroll, Through the Looking 
Glass, 1872 


Consider the following question: 


Suppose 5 members of a group of 12 are to be chosen to work as a team on a spe- 
cial project. How many distinct 5-person teams can be selected? 


This question is answered in Example 9.5.4. It is a special case of the following more gen- 
eral question: 


Given a set S with n elements, how many subsets of size r can be chosen from S? 


The number of subsets of size r that can be chosen from S equals the number of subsets 
of size r that S has. Each individual subset of size r is called an r-combination of the set. 


Note Although there is 
a convenient formula for 
computing values of (”), Let n and r be nonnegative integers with r = n. Anr-combination of a set of n ele- 
it is helpful to think of (n) ments is a subset of r of the n elements. 

simply as a way to count 
the number of subsets of 
size r that can be formed 
from a set of n elements. Notation (") 


The symbol ("), read “n choose r,’ denotes the number of subsets of size r (or r- 
combinations) that can be formed from a set of n elements. 


Definition r-combination 


As noted in Section 5.1, calculators generally use symbols like C(n, r), „C, Cn,» Or "C, 
instead of ("). 


3-Combinations 


Consider a set consisting of 4 people: Ann, Bob, Cyd, and Dan. 


a. Given the set {Ann, Bob, Cyd, and Dan}, each subset of size 3 is a 3-combination of 
its elements. List all the 3-combinations of the set. 


b. Use the result of part (a) to find (3). 
c. In how many ways can the people in the set form a committee of size 3? 


Solution 


a. Each 3-combination (subset of size 3) can be obtained by leaving out one of the ele- 
ments of the set: 
{Bob, Cyd, Dan} leave out Ann 
{Ann, Cyd, Dan} leave out Bob 
{Ann, Bob, Dan} leave out Cyd 
{Ann, Bob, Cyd} leave out Dan. 
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b. There are 4 items in the list of 3-combinations in part (a). So, (3) = 4, 


c. The number of ways for the people in the set to form a committee of size 3 is the num- 
ber of distinct such committees, which is the same as the number of subsets of size 3 
and equals the number of 3-combinations of elements in the set. Thus there are 4 ways 
the people in the set can form a committee of size 3. E 


There are two distinct methods that can be used to select r objects from a set of n ele- 
ments. In an ordered selection, it is not only what elements are chosen but also the order 
in which they are chosen that matters. Two ordered selections are said to be the same if 
the elements chosen are the same and also if the elements are chosen in the same order. 
An ordered selection of r elements from a set of n elements is an r-permutation of the set. 

In an unordered selection, on the other hand, it is only the identity of the chosen ele- 
ments that matters. Two unordered selections are said to be the same if they consist of the 
same elements, regardless of the order in which the elements are chosen. An unordered 
selection of r elements from a set of n elements is the same as a subset of size r or an 
r-combination of the set. 


SCUWGEAYA Unordered Selections 


How many unordered selections of two elements can be made from the set {0, 1, 2, 3}? 


Solution An unordered selection of two elements from {0, 1, 2, 3} is the same as a 
2-combination, or subset of size 2, taken from the set. These can be listed systematically: 
{0, 1}, {0, 2}, {0,3} subsets containing 0 
{1, 2}, {1, 3} subsets containing 1 but not already listed 
{2, 3} subsets containing 2 but not already listed. 


Since this listing exhausts all possibilities, there are six subsets in all. Thus (3) = 6, which 
is the number of unordered selections of two elements from a set of four. E 


When the values of n and r are small, it is reasonable to calculate values of (") using 
the method of complete enumeration (listing all possibilities), which was illustrated in 
Examples 9.5.1 and 9.5.2. But when n and r are large, it is not feasible to compute these 
numbers by listing and counting all possibilities. 

The general values of (") can be found by a somewhat indirect but simple method. An 
equation is derived that contains (") as a factor. Then this equation is solved to obtain a 


n 
r 


formula for ( ). The method is illustrated by Example 9.5.3. 
seule Relation between Permutations and Combinations 


Write all 2-permutations of the set {0, 1, 2, 3}. Find an equation relating the number of 
2-permutations, P (4, 2), and the number of 2-combinations, (3), and solve this equation for (3). 
Solution According to Theorem 9.2.3, the number of 2-permutations of the set {0, 1, 2, 3} 
is P(4, 2), which equals 
4! 4-3-2 
(4-2) Ao 
Now the act of constructing a 2-permutation of {0, 1, 2, 3} can be thought of as a two- 
step process: 


12. 


Step 1: Choose a subset of two elements from {0, 1, 2, 3}. 


Step 2: Choose an ordering for the two-element subset. 
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This process can be illustrated by the possibility tree shown in Figure 9.5.1. 


Step 1: Write the 2-combinations Step 2: Order the 2-combinations 
of {0, 1, 2, 3}. to obtain 2-permutations. 
{0, 1 01 
10 
{0,2 02 
20 
{0,3 03 
30 
Start ` 
{1,2 12 
21 
{1,3 13 
31 
{2,3 23 
32 


FIGURE 9.5.1 Relation between Permutations and Combinations 


The number of ways to perform step 1 is (3), the same as the number of subsets of size 
2 that can be chosen from {0, 1, 2, 3}. The number of ways to perform step 2 is 2!, the 
number of ways to order the elements in a subset of size 2. Because the number of ways of 
performing the whole process is the number of 2-permutations of the set {0, 1, 2, 3}, which 
equals P(4, 2), it follows from the product rule that 


4 
P(4, 2) = () *2!. This is an equation that relates P(4, 2) and (4). 


4 
Solving the equation for () gives 
() _ P(4, 2) 


2 2! 
Recall that P(4, 2) = A So, substituting yields 
4! 
4 4-2)! 4! 
_ 4-2)! _ B P 
2 2! 2'(4—2)! 


The reasoning used in Example 9.5.3 applies in the general case as well. To form an 
r-permutation of a set of n elements, first choose a subset of r of the n elements (there are (") 
ways to perform this step), and then choose an ordering for the r elements (there are r! ways 


to perform this step). Thus the number of r-permutations is 


n 
P(n, r) = o)» 


Now solve for (") to obtain the formula 


; ! Ske ; 
Since P(n, r) = Gao substitution gives 


n! 


n\ (=r! | n! 
r r! rn- rl 
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The result of this discussion is summarized and extended in Theorem 9.5.1. 


Theorem 9.5.1 Computational Formula for 


j 
The number of subsets of size r (or r-combinations) that can be chosen from a set of 
n elements, ("), is given by the formula 


n P(n, r) 
= first version 
r r! 


or, equivalently, 


n n! 
ae a OT second version 
ri(n=r)! 


where n and r are nonnegative integers with r = n. 


Note that the analysis presented before the theorem proves the theorem in all cases 
where n and r are positive. If r is zero and n is any nonnegative integer, then (") is the 
number of subsets of size zero of a set with n elements. But you know from Section 6.2 that 
there is only one set that does not have any elements. Consequently, (r) = 1. Also 


n! i! 
Oln—O0)! lent 


1 


since 0! = 1 by definition. (Remember we said that definition would turn out to be conve- 
nient!) Hence the formula 

n\ n! 

0 Ol — 0)! 


holds for every integer n = 0, and so the theorem is true for all nonnegative integers n and 
rwithr =n. 


Calculating the Number of Teams 


Consider again the problem of choosing five members from a group of twelve to work as a 
team on a special project. How many distinct five-person teams can be chosen? 

Solution The number of distinct five-person teams is the same as the number of subsets 
of size 5 (or 5-combinations) that can be chosen from the set of 12. This number is (ZI, 


By Theorem 9.5.1, 


12 12! -11-W-9-8-H 
( )- = 16 H _ 11.9.8 = 792. 
5) 5'12—-5)! Qpr 
Thus there are 792 distinct five-person teams. E 


The formula for the number of r-combinations of a set can be applied in a wide variety 
of situations. Some of these are illustrated in the following examples. 


Teams That Contain Both or Neither 


Suppose two members of the group of 12 insist on working as a pair—any team must con- 
tain either both or neither. How many five-person teams can be formed? 
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Solution Call the two members of the group that insist on working as a pair A and B. 
Then any team formed must contain both A and B or neither A nor B. The set of all possible 
teams can be partitioned into two subsets as shown in Figure 9.5.2 below. 

A team that contains both A and B looks like 


A B ? ? ? 


where the three question marks are replaced by three people from the remaining ten in the 
group. Hence there are as many such teams as there are subsets of three people that can be 
chosen from the remaining ten, and by Theorem 9.5.1, this number is 


ged 
10 10! 10-9-8-7! 
= = 9-8-7 = 120. 
3 31-7! 2-2-1-7! 
A team that contains neither A nor B looks like 
? ? ? ? ? 


where the five question marks are replaced by five people from the remaining ten. Thus 
there are as many such teams as there are subsets of five people that can be chosen from 
the remaining ten, and by Theorem 9.5.1, this number is 


5 515! 3432-1-51 252. 


Because the set of teams that contain both A and B is disjoint from the set of teams that 
contain neither A nor B, by the addition rule, 


($) — 10! 16:98:76 


the number of teams the number of teams the number of teams 
containing both A and B | = | containing + | containing 
or neither A nor B both A and B neither A nor B 


120 + 252 = 372. 


This reasoning is summarized in Figure 9.5.2. 


All Possible Five-Person Teams 
Containing Both or Neither 


So the total number of teams 
teams with teams with that contain either both A and B 
both A and B neither A nor B or neither A nor B is 

120 + 252 = 372. 


There are There are 


) = 120 of these. (2) = 252 of these. 


FIGURE 9.5.2 B 


Teams That Do Not Contain Both 


Suppose two members of the group don’t get along and refuse to work together on a team. 
How many five-person teams can be formed? 
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Solution Call the two people who refuse to work together C and D. There are two dif- 
ferent ways to answer the given question: One uses the addition rule and the other uses the 
difference rule. 

To use the addition rule, partition the set of all teams that don’t contain both C and D 
into three subsets as shown in Figure 9.5.3 below. 

Because any team that contains C but not D contains exactly four other people from the 
remaining ten in the group, by Theorem 9.5.1 the number of such teams is 


= 210. 


ae 10! _ 10-98-76 
4 4110-4)! 4-3-2-1-6f 


Similarly, there are (7 = 210 teams that contain D but not C. Finally, by the same reason- 
ing as in Example 9.5.5, there are 252 teams that contain neither C nor D. Thus, by the 
addition rule, 


ies number of teams that do 


= 210+ 210+ 252 = 672. 
not contain both C and D l ý j 2 2 


This reasoning is summarized in Figure 9.5.3. 


All Possible Five-Person Teams 
That Do Not Contain Both C and D 


So the total number of teams that 
do not contain both C and D is 
210 + 210 + 252 = 672. 


teams that teams that teams that 
contain C contain D contain neither 
but not D but not C Cnor D 


There are There are There are 

10 10 10 
(y)=210 (%)-20 (5) =252 
of these. of these. of these. 


FIGURE 9.5.3 


The alternative solution by the difference rule is based on the following observation: 
The set of all five-person teams that don’t contain both C and D equals the set difference 
between the set of all five-person teams and the set of all five-person teams that contain 
both C and D. By Example 9.5.4, the total number of five-person teams is (2) = 792. Thus, 
by the difference rule, 


the number of teams that don’t | _ | the total number of E the number of teams that 
contain both C and D contain both C and D 


= p = 792 — 120 = 672 
5 3 l 


This reasoning is summarized in Figure 9.5.4. 


teams of five 
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There are 
All Five-Person Teams 9=.<———————._ /12 
( 5 ) = 792 of these. 


teams that do teams that There are 

not contain contain (2) 

both C and D both C and D 3 ) = 120 of these. 
So there are 


792 — 120 = 672 of these. 
FIGURE 9.5.4 E 


Before we begin the next example, a remark on the phrases at least and at most is in 
order: 


The phrase at least n means “n or more.” 


The phrase at most n means “n or fewer.” 


For instance, if a set consists of three elements and you are to choose at least two, you will 
choose two or three; if you are to choose at most two, you will choose none, or one, or two. 


Teams with Members of Two Types 


Suppose the group of twelve consists of five men and seven women. 

a. How many five-person teams can be chosen that consist of three men and two women? 
b. How many five-person teams contain at least one man? 

c. How many five-person teams contain at most one man? 

Solution 

a. To answer this question, think of forming a team as a two-step process: 


Step 1: Choose the men. 
Step 2: Choose the women. 


There are (3) ways to choose the three men out of the five and (3) ways to choose the 
two women out of the seven. Hence, by the product rule, 


the number of teams of five that _—(5\/7\_ 5! 7! 
contain three men and two women Sy ND 312! 215! 


7 -7-6-5-4 
— A-121 


= 210. 


b. This question can also be answered either by the addition rule or by the difference 
rule. The solution by the difference rule is shorter and is shown first. 
Observe that the set of five-person teams containing at least one man equals the set 
difference between the set of all five-person teams and the set of five-person teams 
that do not contain any men. See Figure 9.5.5 on the next page. 
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Now a team with no men consists entirely of five women chosen from the seven 
women in the group, so there are (2) such teams. Also, by Example 9.5.4, the total num- 
ber of five-person teams is (2) = 792. Hence, by the difference rule, 


the number of teams the total number the number of teams 
with at least = | of teams — | of five that do not 
one man of five contain any men 


2\ A 7! 
= 792 
(2) (7) 51-2! 


3 
THA 


= 792 PERI = 792-21 = 771. 
This reasoning is summarized in Figure 9.5.5. 
There are 
All Five-Person Teams =< /12 

( 5 ) = 792 of these. 

teams that teams that There are 

contain at contain no men (2) 

least one man 5) =21 of these. 


A 
| 


So there are 
792 —21 = 771 of these. 


FIGURE 9.5.5 


Alternatively, to use the addition rule, observe that the set of teams containing 
at least one man can be partitioned as shown in Figure 9.5.6. The number of teams 
in each subset of the partition is calculated using the method illustrated in part (a). 
There are 


5 7 ; 
1 teams with one man and four women 
7 f 
teams with two men and three women 
teams with five men and no women. 


7 
( ) teams with three men and two women 


7 : 
teams with four men and one woman 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.5 COUNTING SUBSETS OF A SET: COMBINATIONS 


Hence, by the addition rule, 


ke number of teams 


with at least one man 


O 


5s! 7! 


GGN 


7! 5! 


625 


HONS 


5s! 7! 5! 7! 


~ 4114! 413! 


213! 314! 


312! 215! 


411! 116! | 5!0! O!7! 


5-41-7:6-5-41 sdai 5-4- -7-6-5 5-4-7-6 A-1 


“TMM3S1 LiAl AALA MA AA 


= 5-7-545-2-7-54+5-7:'64+5-74+1 = 175 +3504 210+ 354+1 = 771. 


This reasoning is summarized in Figure 9.5.6. 


Teams with At Least One Man 


teams with teams with teams with teams with teams with So the total number of 
one man two men three men four men five men teams with at least 
one man is 
175 + 350 + 210 + 35 + 1 = 771. 
| | | | | 
There are There are There are There are There are 
5X7 5\(7 D\S7 5\(7 S\ fT 
Dans GG) GG) =20 WG-5 G) (0) = 
of these. of these. of these. of these. of these. 
FIGURE 9.5.6 


c. As shown in Figure 9.5.7, the set of teams containing at most one man can be parti- 
tioned into the set without any men and the set with exactly one man. Hence, by the 
addition rule, 

the number of the number of 

teams without | + | teams with 

one man 


the number of teams 
with at 
most one man 


any men 


OOOO- 


This reasoning is summarized in Figure 9.5.7. 


Teams with At Most One Man 


So the total number of 


teams without teams with : 
any men one man teams with at most one 
man is 21 + 175 = 196. 
| | 
There are There are 
5\(7 5\(7 
(o) (5) =21 (1) G) =175 
of these. of these. 
FIGURE 9.5.7 a 
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An Application to Graphs 


a. Use the multiplication rule to show that K,,,,, a complete bipartite graph on (m, n) 


vertices, has mn edges. 


i i : =i 
b. Use 2-combinations to show that K,,, a complete graph on n vertices, has mn : edges. 


Solution 


a. A complete bipartite graph on (m, n) vertices, Knn, is a simple graph whose vertices 
can be divided into two distinct, nonoverlapping sets—say, V with m vertices and W 
with n vertices—in such a way that there is exactly one edge from each vertex of V 
to each vertex of W, there is no edge from any one vertex of V to any other vertex of V, 
and there is no edge from any one vertex of W to any other vertex of W. Label the m 
vertices of V as vj, Vo,..., Vm. Think of constructing the edges between the vertices 
of V and the vertices of W as an m-step process: for each integer k from 1 through m, 
draw exactly one edge from v; to each of the n edges of W. Because each step can be 
performed in n ways and there are m steps, by the multiplication rule there are mn ways 
to construct all the edges. Thus Km, n has mn edges. 


b. A complete graph on n vertices, K„, is a simple graph with n vertices and exactly one 

edge between each pair of vertices. If n = 1, then K, has one vertex and 0 edges, and 

; -1) 10-1 =i ‘ 

since aa yi 7 ) = 0, then K,, has oe” edges. If n = 2, then, since any two 
distinct vertices of K, are connected by exactly one edge, there are as many edges 


in K,, as there are subsets of size two in the set of n vertices. By Theorem 9.5.1, there 


n n! n(n — 1) 
2 2'\(n—2)! 2 
n(n — 1) 


2 
Hence K, has —z— edges. E 


eke Poker Hand Problems 


The game of poker is played with an ordinary deck of 52 cards (see Example 9.1.1). Various 
five-card holdings are given special names, and certain holdings beat certain other hold- 
ings. The named holdings are listed from highest to lowest below. 


Royal flush: 10, J, Q, K, A of the same suit 


Straight flush: five adjacent denominations of the same suit but not a royal flush— 
aces can be high or low, so A, 2, 3, 4, 5 of the same suit is a straight flush 


are ( ) such sets, and 


Four of a kind: four cards of one denomination—the fifth card can be any other in the deck 
Full house: three cards of one denomination, two cards of another denomination 
Flush: five cards of the same suit but not a straight or a royal flush 


Straight: five cards of adjacent denominations but not all of the same suit—aces can 
be high or low 


Three of a kind: three cards of the same denomination and two other cards of different 
denominations 


Two pairs: two cards of one denomination, two cards of a second denomination, and a 
fifth card of a third denomination 


One pair: two cards of one denomination and three other cards all of different 
denominations 


No pairs: all cards of different denominations but not a straight, or straight flush, or 
flush, or royal flush 
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a. How many five-card poker hands contain two pairs? 


b. If a five-card hand is dealt at random from an ordinary deck of cards, what is the 
probability that the hand contains two pairs? 


Solution 

a. Consider forming a hand with two pairs as a four-step process: 
Step 1: Choose the two denominations for the pairs. 
Step 2: Choose two cards from the smaller denomination. 
Step 3: Choose two cards from the larger denomination. 


Step 4: Choose one card from those remaining. 


(13 a ae 
The number of ways to perform step | is ( 5 ) because there are 13 denominations in 


all. The number of ways to perform each of steps 2 and 3 is (3) because there are four 
cards of each denomination, one in each suit. The number of ways to perform step 4 is 
($ ) because the fifth card is chosen from the eleven denominations not included in the 
pair and there are four cards of each denomination. Thus 


the total number of | _ { 13)(4)/(4)\( 44 
hands with two pairs 2/\2/\2/\ 1 
13! A! Al “A! 
21(13— 2)! 21(4-— 2)! 244-2)! 1{4-1)! 
13-12-11! 4-3-2! 4-3-2! 44-43! 
(2-1)-11! (2-1)-2! (2-1):-2! 1-43! 
= 78:6:6:44 = 123,552. 


b. The total number of five-card hands from an ordinary deck of cards is ear which 


equals 2,598,960. Thus if all hands are equally likely, the probability of obtaining a 


: ¿z 123,552 
hand with two pairs is 759g 960 = 4.75%. 


eee Number of Bit Strings with Fixed Number of 1’s 
How many eight-bit strings have exactly three |’s? 


Solution To solve this problem, imagine eight empty positions into which the 0’s and 1’s 
of the bit string will be placed. In step 1, choose positions for the three 1’s, and in step 2, 
put the 0’s into place. 


Three 1’s and 
five 0’s to be 
put into the 
positions 


1 2 3 4 5 6 7 8 


Once a subset of three positions has been chosen from the eight to contain 1’s, then the 
remaining five positions must all contain 0’s (since the string is to have exactly three 1’s). 
It follows that the number of ways to construct an eight-bit string with exactly three 1’s is 
the same as the number of subsets of three positions that can be chosen from the eight into 
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which to place the 1s. By Theorem 9.5.1, this equals 


(5) 7 aS F ree = 56. = 


Permutations of a Set with Repeated Elements 
Consider various ways of ordering the letters in the word MISSISSIPPI: 
IIMSSPISSIP, ISSSPMIIPIS, PIMISSSSIIP, and so on. 


How many distinguishable orderings are there? 


Solution This example generalizes Example 9.5.10. Imagine placing the 11 letters of 
MISSISSIPPI one after another into 11 positions. 


Letters of 
MISSISSIPPI 
to be placed 
into the 
positions 


1 2 3 4 5 6 7 8 9 10 11 


Because copies of the same letter cannot be distinguished from one another, once the posi- 
tions for a certain letter are known, then all copies of the letter can go into the positions 
in any order. It follows that constructing an ordering for the letters can be thought of as a 
four-step process: 


Step 1: Choose a subset of four positions for the S’s. 
Step 2: Choose a subset of four positions for the Ps. 
Step 3: Choose a subset of two positions for the P’s. 
Step 4: Choose a subset of one position for the M. 


11 
$ 7 
Once the four S’s are in place, there are seven positions that remain empty, so there are ( ‘) 


Since there are 11 positions in all, there are ( ) subsets of four positions for the S’s. 


subsets of four positions for the /’s. After the /’s are in place, there are three positions left 


2 
the M. But 1 = (). Hence by the multiplication rule, 


number of ways to aq il \W7\W3\! 
position all the letters 4 I\4/\2/\1 
1 Me ae 
41-7 41-34 21-7 1!-0f 


od 
~ ANeAL-2ted! 


empty, so there are ( ) subsets of two positions for the P’s. That leaves just one position for 


= 34,650. a 


In exercise 18 at the end of the section, you are asked to show that changing the order in 
which the letters are placed into the positions does not change the answer to this example. 

The same reasoning used in this example can be used to derive the following general 
theorem. 
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Theorem 9.5.2 Permutations with Sets of Indistinguishable Objects 


Suppose a collection consists of n objects of which 


n; are of type | and are indistinguishable from each other 


m are of type 2 and are indistinguishable from each other 


n, are of type k and are indistinguishable from each other, 


and suppose that nı +n. + ++: +n, = n. Then the number of distinguishable permu- 
tations of the n objects is 


n n— ny, Mo mi Ny no n n e 
ni np n3 nk 


n! 


Some Advice about Counting 


Students learning counting techniques often ask, “How do I know what to multiply and 
what to add? When do I use the multiplication rule and when do I use the addition rule?” 
Unfortunately, these questions have no easy answers. You need to imagine, as vividly as 
possible, the objects you are to count. In fact, it is helpful to start making a list of the items 
you need to count to get a sense for how to obtain them in a systematic way. You should 
then construct a model that would allow you to continue counting the objects one by one 
if you had enough time. If you can imagine the elements to be counted as being obtained 
through a multistep process (in which each step is performed in a fixed number of ways 
regardless of how preceding steps were performed), then you can use the multiplication 
rule. The total number of elements will be the product of the number of ways to perform 
each step. If, however, you can imagine the set of elements to be counted as being broken 
up into disjoint subsets, then you can use the addition rule. The total number of elements 
in the set will be the sum of the number of elements in each subset. 

One of the most common mistakes students make is to count certain possibilities more 


than once. 
elu eee Double Counting 

Consider again the problem of Example 9.5.7(b). A group consists of five men and seven 
women. How many teams of five contain at least one man? 

Caution! Be careful Incorrect Solution 

to avoid counting items Imagine constructing the team as a two-step process: 

twice when using the mul- 

tiplication rule. Step 1: Choose a subset of one man from the five men. 


Step 2: Choose a subset of four others from the remaining eleven people. 


11 


1) = 1,650 five-person teams that contain 


Gea 5 
Hence, by the multiplication rule, there are C) 
at least one man. 


Analysis of the Incorrect Solution The problem with the solution above is that some 
teams are counted more than once. Suppose the men are Anwar, Ben, Carlos, Dwayne, and 
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Ed and the women are Fumiko, Gail, Hui-Fan, Inez, Jill, Kim, and Laura. According to the 
method described previously, one possible outcome of the two-step process is as follows: 


Outcome of step 1: Anwar 
Outcome of step 2: Ben, Gail, Inez, and Jill. 


In this case the team would be {Anwar, Ben, Gail, Inez, Jill}. But another possible 
outcome is 


Outcome of step 1: Ben 
Outcome of step 2: Anwar, Gail, Inez, and Jill, 


which also gives the team {Anwar, Ben, Gail, Inez, Jill}. Thus this one team is given by 
two different branches of the possibility tree, and so it is counted twice. E 


The best way to avoid mistakes such as the one just described is to visualize the pos- 
sibility tree that corresponds to any use of the multiplication rule and the set partition that 
corresponds to a use of the addition rule. Check how your division into steps works by ap- 
plying it to some actual data—as was done in the analysis above—and try to pick data that 
are as typical or generic as possible. 

It often helps to ask yourself (1) “Am I counting everything?” and (2) “Am I counting 
anything twice?” When using the multiplication rule, these questions become (1) “Does 
every outcome appear as some branch of the tree?” and (2) “Does any outcome appear on 
more than one branch of the tree?” When using the addition rule, the questions become 
(1) “Does every outcome appear in some subset of the diagram?” and (2) “Do any two 
subsets in the diagram share common elements?” 


TEST YOURSELF 


1. The number of subsets of size r that can be formed 3. Two unordered selections are said to be the same 
from a set with n elements is denoted ; if the elements chosen are the same, regardless 


which is read as “ o of 
2. The number of r-combinations of a set of n ele- 4. A formula relating (") and p(n, r)is—— 
eee 5. The phrase “at least n” means , and the 
phrase “at most n” means ____. 
EXERCISE SET 9.5 
8 
1. a. List all 2-combinations for the set {x1, x2, x3}. 4. Write an equation relating P(8, 3) and (5). 
Deduce the value of G). 
b. List all unordered selections of four elements 5. Use Theorem 9.5.1 to compute each of the following. 
from the set {a, b, c, d, e}. Deduce the value of á ($) b ($) c (°) 
5 ” No PN 2 


i af) ef G 


2. a. List all 3-combinations for the set {x,, x2, x3, 


() 


X4, X5}. Deduce the value of £). 6. A student council consists of 15 students. 

b. List all unordered selections of two elements a. In how many ways can a committee of six be 
from the set {x1, X2, X3, X4, X5, X6}. Deduce the selected from the membership of the council? 
value of {,). b. Two council members have the same major 


and are not permitted to serve together on a 


7 
3. Write an equation relating P(7, 2) and : } ; 
2 committee. How many ways can a committee 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


of six be selected from the membership of 
the council? 

c. Two council members always insist on serving 
on committees together. If they can’t serve 
together, they won’t serve at all. How many 
ways can a committee of six be selected from 
the council membership? 

d. Suppose the council contains eight men and 
seven women. 

(i) How many committees of six contain 
three men and three women? 

(ii) How many committees of six contain at 
least one woman? 

e. Suppose the council consists of three fresh- 
men, four sophomores, three juniors, and five 
seniors. How many committees of eight con- 
tain two representatives from each class? 


7. A computer programming team has 13 members. 


a. How many ways can a group of seven be cho- 
sen to work on a project? 

b. Suppose seven team members are women and 
six are men. 

(i) How many groups of seven can be chosen 
that contain four women and three men? 
Gi) How many groups of seven can be chosen 
that contain at least one man? 
(iii) How many groups of seven can be chosen 
that contain at most three women? 

c. Suppose two team members refuse to work to- 
gether on projects. How many groups of seven 
can be chosen to work on a project? 

d. Suppose two team members insist on either 
working together or not at all on projects. How 
many groups of seven can be chosen to work 
on a project? 


H 8. An instructor gives an exam with fourteen questions. 


Students are allowed to choose any ten to answer. 

a. How many different choices of ten questions 
are there? 

b. Suppose six questions require proof and eight 
do not. 

(i) How many groups of ten questions con- 
tain four that require proof and six that 
do not? 

(ii) How many groups of ten questions con- 
tain at least one that requires proof? 
(iii) How many groups of ten questions con- 
tain at most three that require proof? 
c. Suppose the exam instructions specify that at 
most one of questions | and 2 may be included 
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10. 


Tl. 


12. 


13. 


among the ten. How many different choices of 
ten questions are there? 

d. Suppose the exam instructions specify that 
either both questions 1 and 2 are to be included 
among the ten or neither is to be included. 
How many different choices of ten questions 
are there? 


A club is considering changing its bylaws. In an 
initial straw vote on the issue, 24 of the 40 mem- 
bers of the club favored the change and 16 did not. 
A committee of six is to be chosen from the 40 club 
members to devote further study to the issue. 
a. What is the total number of committees of six 
that can be formed from the club membership? 
b. How many of the total number of committees 
will contain at least three club members who, 
in the preliminary survey, favored the change 
in the bylaws? 


Two new drugs are to be tested using a group of 
60 laboratory mice, each tagged with a number 
for identification purposes. Drug A is to be given 
to 22 mice, drug B is to be given to another 22 
mice, and the remaining 16 mice are to be used 
as controls. How many ways can the assignment 
of treatments to mice be made? (A single assign- 
ment involves specifying the treatment for each 
mouse—whether drug A, drug B, or no drug.) 


Refer to Example 9.5.9. For each poker holding be- 

low, (1) find the number of five-card poker hands 

with that holding; (2) find the probability that a 

randomly chosen set of five cards has that holding. 

a. royal flush b. straight flush 

c. four ofakind d. full house e. flush 

f. straight (including a straight flush and a royal 
flush) 

g- three ofakind h. one pair 

neither a repeated denomination nor five of the 

same suit nor five adjacent denominations 


How many pairs of two distinct integers chosen from 
the set {1, 2, 3,..., 101} have a sum that is even? 


A coin is tossed ten times. In each case the 

outcome H (for heads) or T (for tails) is recorded. 

(One possible outcome of the ten tosses is denoted 

THHTTTHTTHĦ) 

a. What is the total number of possible outcomes 
of the coin-tossing experiment? 

b. In how many of the possible outcomes are 
exactly five heads obtained? 
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14. 


15. 


16. 


17. 


18. 
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c. In how many of the possible outcomes are at 
least eight heads obtained? 

d. In how many of the possible outcomes is at 
least one head obtained? 

e. In how many of the possible outcomes is at 
most one head obtained? 


a. How many 16-bit strings contain exactly seven 
I’s? 

b. How many 16-bit strings contain at least thir- 
teen |’s? 

c. How many 16-bit strings contain at least one 1? 

d. How many 16-bit strings contain at most 
one 1? 


a. How many even integers are in the set 
{1, 2, 3,..., LOO}? 


b. How many odd integers are in the set 
{1, 2, 3,..., LOO}? 


c. How many ways can two integers be selected 
from the set {1, 2, 3,..., 100} so that their sum 
is even? 

d. How many ways can two integers be selected 
from the set {1, 2, 3,..., 100} so that their sum 
is odd? 


Suppose that three microchips in a production run 

of forty are defective. A sample of five is to be 

selected to be checked for defects. 

a. How many different samples can be chosen? 

b. How many samples will contain at least one 
defective chip? 

c. What is the probability that a randomly chosen 
sample of five contains at least one defective 
chip? 


Ten points labeled A, B, C, D, E, F, G, H, I, J are 

arranged in a plane in such a way that no three lie 

on the same straight line. 

a. How many straight lines are determined by the 
ten points? 

b. How many of these straight lines do not pass 
through point A? 

c. How many triangles have three of the ten 
points as vertices? 

d. How many of these triangles do not have point 
A as a vertex? 


Suppose that you placed the letters in Example 
9.5.11 into positions in the following order: first 
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19. 


20. 


21. 


22. 


23. 


24. 


the M, then the Ps, then the S’s, and then the P’s. 
Show that you would obtain the same answer for 
the number of distinguishable orderings. 


a. How many distinguishable ways can the letters 
of the word HULLABALOO be arranged in 
order? 

b. How many distinguishable orderings of the let- 
ters of HULLABALOO begin with U and end 
with L? 

c. How many distinguishable orderings of the let- 
ters of HULLABALOO contain the two letters 
HU next to each other in order? 


a. How many distinguishable ways can the letters 
of the word MILLIMICRON be arranged in 
order? 

b. How many distinguishable orderings of the let- 
ters of MILLIMICRON begin with M and end 
with N? 

c. How many distinguishable orderings of the let- 
ters of MILLIMICRON contain the letters CR 
next to each other in order and also the letters 
ON next to each other in order? 


In Morse code, symbols are represented by vari- 
able-length sequences of dots and dashes. (For ex- 
ample, A = -—-,1=- ,? 5 e se) 
How many different symbols can be represented by 
sequences of seven or fewer dots and dashes? 


Each symbol in the Braille code is represented 
by a rectangular arrangement of six dots, each 
of which may be raised or flat against a smooth 
background. For instance, when the word Braille 
is spelled out, it looks like this: 


Given that at least one of the six dots must be 
raised, how many symbols can be represented in 
the Braille code? 


On an 8 X 8 chessboard, a rook is allowed to 
move any number of squares either horizontally 
or vertically. How many different paths can a rook 
follow from the bottom-left square of the board to 
the top-right square of the board if all moves are 
to the right or upward? 


The number 42 has the prime factorization 2:3-7. 
Thus 42 can be written in four ways as a product 
of two positive integer factors (without regard to 


25. 


the order of the factors): 1-42, 2-21, 3-14, and 

6-7. Answer a—d below without regard to the 

order of the factors. 

a. List the distinct ways the number 210 can be 
written as a product of two positive integer 
factors. 

b. Ifn = pı p2p3p4, where the p; are distinct 
prime numbers, how many ways can n be writ- 
ten as a product of two positive integer factors? 

c. Ifn = pı p2p3p4Ps, where the p; are distinct 
prime numbers, how many ways can n be writ- 
ten as a product of two positive integer factors? 

d. Ifn =p\p2... pg, where the p; are distinct 
prime numbers, how many ways can n be 
written as a product of two positive integer 
factors? 


a. How many one-to-one functions are there from 
a set with three elements to a set with four ele- 
ments? 

b. How many one-to-one functions are there from 
a set with three elements to a set with two ele- 
ments? 

c. How many one-to-one functions are there from 
a set with three elements to a set with three 
elements? 

d. How many one-to-one functions are there 
from a set with three elements to a set with five 
elements? 


He. How many one-to-one functions are there from 
a set with m elements to a set with n elements, 
where m = n? 
26. a. How many onto functions are there from a set 


ANSWERS FOR TEST YOURSELF 


1. ("); nchooser 2. (") (Or: n choose r) 3. the order in which they are chosen 4. (" 
P 


with three elements to a set with two elements? 
b. How many onto functions are there from a set 
with three elements to a set with five elements? 
How many onto functions are there from a set 
with three elements to a set with three elements? 
d. How many onto functions are there from a set 

with four elements to a set with two elements? 
e. How many onto functions are there from 

a set with four elements to a set with three 

elements? 
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H* f. Let Cmn be the number of onto functions from 


27. 


H*28. 


* 29. 


30. 


a set of m elements to a set of n elements, 
where m = n = 1. Find a formula relating c,, , 
to Cm—1,n and Cm—1,n—1+ 


Let A be a set with eight elements. 

a. How many relations are there on A? 

b. How many relations on A are reflexive? 

c. How many relations on A are symmetric? 

d. How many relations on A are both reflexive 
and symmetric? 


A student council consists of three freshmen, four 
sophomores, four juniors, and five seniors. How 
many committees of eight members of the council 
contain at least one member from each class? 


An alternative way to derive Theorem 9.5.1 uses 
the following division rule: Let n and k be integers 
so that k divides n. If a set consisting of n ele- 
ments is divided into subsets that each contain k 
elements, then the number of such subsets is n/k. 
Explain how Theorem 9.5.1 can be derived using 
the division rule. 


Find the error in the following reasoning: “Con- 
sider forming a poker hand with two pairs as a 
five-step process. 


Step |: Choose the denomination of one of the 
pairs. 

Step 2: Choose the two cards of that denomination. 

Step 3: Choose the denomination of the other of 
the pairs. 

Step 4: Choose the two cards of that second de- 
nomination. 

Step 5: Choose the fifth card from the remaining 
denominations. 


There are (5) ways to perform step 1, (3) ways 


1 
to perform step 2, | | ) ways to perform step 

3, (3) ways to perform step 4, and (+) ways 

to perform step 5. Therefore, the total num- 
ber of five-card poker hands with two pairs is 
13-6-12-6-44 = 247,104.” 


_ P(n,r) 


r! 


5. n or more; n or fewer 
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EXJ r-Combinations with Repetition Allowed 


The value of mathematics in any science lies more in disciplined analysis and 
abstract thinking than in particular theories and techniques. —Alan Tucker, 1982 


In Section 9.5 we showed that there are (") r-combinations, or subsets of size r, of a set of n 
elements. In other words, there are (") ways to choose r distinct elements without regard to 
order from a set of n elements. For instance, there are (3) = 4 ways to choose three elements 
out of a set of four: {1, 2, 3}, {1, 2, 4}, {1, 3, 4}, {2, 3, 4}. 

In this section we ask: How many ways are there to choose r elements without regard to 
order from a set of n elements if repetition is allowed? A good way to imagine this is to vi- 
sualize the n elements as categories of objects from which multiple selections may be made. 
For instance, if the categories are labeled 1, 2, 3, and 4 and three elements are chosen, it is 
possible to choose two elements of type 3 and one of type 1, or all three of type 2, or one 
each of types 1, 2, and 4, and so forth. We denote such choices by [3, 3, 1], [2, 2, 2], and [1, 
2, 4], respectively. Note that because order does not matter, [3, 3, 1] = [3, 1, 3] = [1, 3, 3], 
for example. 


Definition and Notation 


An r-combination with repetition allowed, or multiset of size r, chosen from a set 
X of n elements is an unordered selection of elements taken from X with repetition 
allowed. If X = {x1, X2, . . . , Xn}, we write an r-combination with repetition allowed, 
or multiset of size r, as Lx;,, x;,,..., x;] where each x;, is in X and some of the x;, may 
equal each other. 


r-Combinations with Repetition Allowed 


Write a complete list to find the number of 3-combinations with repetition allowed, or 
multisets of size 3, that can be selected from {1, 2, 3, 4}. Observe that because the order 
in which the elements are chosen does not matter, the elements of each selection may be 
written in increasing order, and writing the elements in increasing order will ensure that 
no combinations are overlooked. 


Solution 


{1, 1, 1]; [1, 1, 2); 1, 1, 3]; 0, 1, 4] all combinations that start with 1, 1 


[1, 2, 2]; [1, 2, 3]; [1, 2, 4]; all additional combinations that start with 1, 2 

[1, 3, 3]; (1, 3, 4]; D, 4 4]; all additional combinations that start with 1, 3 or 1, 4 

(2, 2, 2]; B, 2, 3]; B, 2, 4]; all additional combinations that start with 2, 2 

[2, 3, 3]; (2, 3, 4]; [2; 4, 4]; all additional combinations that start with 2, 3 or 2, 4 

(3, 3, 3]; [3, 3, 4]; B, 4, 4]; all additional combinations that start with 3, 3 or 3, 4 

[4, 4, 4] the only additional combination that starts with 4, 4. 

Thus there are twenty 3-combinations with repetition allowed. E 


How could the number twenty have been predicted other than by making a complete 
list? Consider the numbers 1, 2, 3, and 4 as categories and imagine choosing a total of three 
numbers from the categories with multiple selections from any category allowed. The re- 
sults of several such selections are represented by the table on the next page. 
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Category 1 Category 2 Category 3 Category 4 | Result of the Selection 


| x | | x x one from category 2 
two from category 4 


x | | x | x one each from 
categories 1, 3, and 4 


xxx | | | three from category 1 


As you can see, each selection of three numbers from the four categories can be repre- 
sented by a string of vertical bars and crosses. Three vertical bars are used to separate the 
four categories, and three crosses are used to indicate how many items from each category 
are chosen. Each distinct string of three vertical bars and three crosses represents a distinct 
selection. For instance, the string 


x x||x| 


represents the selection: two from category 1, none from category 2, one from category 3, 
and none from category 4. Thus the number of distinct selections of three elements that can 
be formed from the set {1, 2, 3, 4} with repetition allowed equals the number of distinct 
strings of six symbols consisting of three |’s and three <’s. And this equals the number of 
ways to select three positions out of six because once three positions have been chosen for 
the X’s, the |’s are placed in the remaining three positions. Thus the answer is 


() E T 3) A 7 


as was obtained earlier by a careful listing. 

The analysis of this example extends to the general case. To count the number of r- 
combinations with repetition allowed, or multisets of size r, that can be selected from a set 
of n elements, think of the elements of the set as categories. Then each r-combination with 
repetition allowed can be represented as a string of n— 1 vertical bars (to separate the n 
categories) and r crosses (to represent the r elements to be chosen). The number of X’s in 
each category represents the number of times the element represented by that category is 
repeated. 


20, 


Category 1 Category 2 Category 3 tee Category n — 1 Category n 


r x’s to be placed in categories 


The number of strings of n — 1 vertical bars and r crosses is the number of ways to choose 
r positions, into which to place the r crosses, out of a total of r+ (n — 1) positions, leav- 


ing the remaining positions for the vertical bars. And by Theorem 9.5.1, this number is 
r+n— 1); 


This discussion proves the following theorem. 
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Theorem 9.6.1 


The number of r-combinations with repetition allowed (or multisets of size r) that 
can be selected from a set of n elements is 


(or 


This equals the number of ways r objects can be selected from n categories of objects 
with repetition allowed. 


Selecting 15 Cans of Soft Drinks of Five Different Types 


A person giving a party wants to set out 15 assorted cans of soft drinks for his guests. He 
shops at a store that sells five different types of soft drinks. 


a. How many different selections of cans of 15 soft drinks can he make? 


b. If root beer is one of the types of soft drink, how many different selections include at 
least six cans of root beer? 


c. If the store has only five cans of root beer but at least 15 cans of each other type of 
soft drink, how many different selections are there? 


Solution 


a. Think of the five different types of soft drinks as the n categories and the 15 cans of 
soft drinks to be chosen as the r objects (so n = 5 and r = 15). Each selection of cans 
of soft drinks is represented by a string of 5 — 1 = 4 vertical bars (to separate the 
categories of soft drinks) and 15 crosses (to represent the cans selected). For instance, 
the string 


x x xX|xX xX xX xX xX xX xXi||x xX xX|x x 


represents a selection of three cans of soft drinks of type 1, seven of type 2, none of 
type 3, three of type 4, and two of type 5. The total number of selections of 15 cans of 
soft drinks of the five types is the number of strings of 19 symbols, 5 — 1 = 4 of them | 
and 15 of them X: 


6 2 
15+5=1 19! 1 19-18-17-16- 1! 
(S5-1) iat (0) BIEN IE rg 


15 1519-15) 15 51-A- 3-2-1 


b. If at least six cans of root beer are to be included in the selection, you can imagine 
choosing six such cans first and then choosing nine additional cans. The choice of the 
nine additional cans can be represented as a string of 9 X’s and 4 |’s. For example, if 
root beer is type 1, then the string X X X || X X |X X X x | represents a selection 
of three cans of root beer (in addition to the six chosen initially), none of type 2, two 
of type 3, four of type 4, and none of type 5. Thus the total number of selections of 
15 cans of soft drinks of the five types, including at least six cans of root beer, is the 
number of strings of 13 symbols, 4 (=5 — 1) of them | and 9 of them x: 


= 715. 


5 
°5") g 5103-9 7 (5) E veri 
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c. If the store has only five cans of root beer, then the number of different selections of 
15 cans of soft drinks of the five types is the same as the number of different selec- 
tions that contain five or fewer cans of root beer. Let T be the set of all possible 
selections assuming that there are at least 15 cans of each type, let R=; be the set of 
selections from T that contain five or fewer cans of root beer, and let R= be the set of 
selections from T that contain six or more cans of root beer. Then 


T= Res U Rz=6 and Res N R=6 = Ø. 
By part (a) N(T) = 3,876 and by part (b) N(R=6) = 715. Thus, by the difference rule, 
N(Res) = N(T) — N(R=6) = 3,876 — 715 = 3,161. 


So, when there are only five or fewer cans of root beer, the number of different selec- 
tions of soft drinks is 3,161. E 


Counting Triples (i, j, k) with1 < i £ j S5 k = n 


If n is a positive integer, how many triples of integers from 1 through n can be formed in 
which the elements of the triple are written in increasing order but are not necessarily dis- 
tinct? In other words, how many triples of integers (i, j, k) are there with 1 S i Sj S k S n? 


Solution Any triple of integers (i, j, k) with 1 S i S j S k Sn can be represented as a 
string of n — 1 vertical bars and three crosses, with the positions of the crosses indicating 
which three integers from 1 to n are included in the triple. The table below illustrates this 


forn = 5. 
Category denis T 
1 2 3 4 5 esult of the Selection 
| | xx | | x (3, 3, 5) 
x | x | | | (1,2, 4) 
Thus the number of such triples is the same as the number of strings of (7 — 1) |’s and 
3 X’s, which is 
3+(n=1)\_/n+2\_ (a+?! 
3 3 3'(n +2 —3)! 
(nt+2)\(nt+D)na—t!  n(nt+1)(n+2) = 


3\(n—T)! 6 


Note that in Examples 9.6.2 and 9.6.3 the reasoning behind Theorem 9.6.1 was used 
rather than the statement of the theorem itself. Alternatively, in either example we could 
invoke Theorem 9.6.1 directly by recognizing that the items to be counted either are 
r-combinations with repetition allowed or are the same in number as such combinations. 
For instance, in Example 9.6.3 we might observe that there are exactly as many triples 
of integers (i, j, k) with 1 <i <j =k <n as there are 3-combinations of integers from 1 
through n with repetition allowed because the elements of any such 3-combination can be 
written in increasing order in only one way. 


EU eee Counting Iterations of a Loop 


How many times will the innermost loop be iterated when the algorithm segment below is 
implemented and run? (Assume n is a positive integer.) 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


638 CHAPTER9 COUNTING AND PROBABILITY 


for k:= 1 ton 
for j := 1tok 
for i := 1 toj 


[Statements in the body of the inner loop, 
none containing branching statements that lead 
outside the loop] 


next i 
next j 
next k 


Solution Construct a trace table for the values of k, j, and i for which the statements in 
the body of the innermost loop are executed. 


k 1 2 el) “3 >| ote n $ 
j 1 1 2== 1 2] 3) i 1 p E a er n 
i 1 1 1 2 1 1 2 1 2 3 tee 1 1 2 tee 1 tee n 


Because i goes from | to j, it is always the case that i S j. Similarly, because j goes 
from 1 to k, it is always the case that j = k. To focus on the details of the table construc- 
tion, consider what happens when k = 3. In this case, j takes each value 1, 2, and 3. When 
j= 1, ican only take the value 1 (because į = j). When j = 2, i takes each value 1 and 2 
(again because i = j). When j = 3, i takes each value 1, 2, and 3 (yet again because i < j). 

Observe that there is one iteration of the innermost loop for each column of the table, and 
there is one column of the table for each triple of integers (i, j, k) with | =isSjSk=n. 
Now Example 9.6.3 showed that the number of such triples is [n(n + 1)(n + 2)]/6. Thus 
there are [n(n + 1)(n + 2)]/6 iterations of the innermost loop. E 


The solution in Example 9.6.4 is elegant and generalizable. (See exercises 8 and 9.) An 
alternative solution using summations is outlined in exercise 21. 


The Number of Integral Solutions of an Equation 


How many solutions are there to the equation x, + x + x3 + x4 = 10 if x, x5, x3, and x; are 
nonnegative integers? 


Solution Think of the number 10 as divided into ten individual units and the variables 
Xi, X2, X3, and x4 as four categories into which these units are placed. The number of units 
in each category x; indicates the value of x; in a solution of the equation. Each solution can, 
then, be represented by a string of three vertical bars (to separate the four categories) and 
ten crosses (to represent the ten individual units). For example, in the following table, the 
two crosses under x4, five crosses under xz, and three crosses under x, represent the solu- 
tion x; = 2, x = 5, x; = 0, and x4 = 3. 


Categories r : 
xı xX x3 X4 Solution to the Equation x, +x, + x3 + x4 = 10 
x xX | XXXXX | | XXX] x, =2, x2=5, x,=0, and x4=3 
xxxx | XXXXxXXxXX | | x =4, x2=6, x =0, and x=0 
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Therefore, there are as many solutions to the equation as there are strings of ten crosses 
and three vertical bars, namely, 


10+3\ /13\_ 13! 13-12-11-40 
10 10) 10113-10!  10f-3-2-1 


286. B 
Example 9.6.6 illustrates a variation on Example 9.6.5. 


See AR Additional Constraints on the Number of Solutions 
How many integer solutions are there to the equation x, + x2 + x3 + x4 = 10 if each x; = 1? 


Solution In this case imagine starting by putting one cross in each of the four categories. 
Then distribute the remaining six crosses among the categories. Such a distribution can be 
represented by a string of three vertical bars and six crosses. For example, the string 


xxx| |x x|x 


indicates that there are three more crosses in category x, in addition to the one cross al- 
ready there (so x; = 4), no more crosses in category x, in addition to the one already there 
(so x2 = 1), two more crosses in category x3 in addition to the one already there (so x3 = 3), 
and one more cross in category x4 in addition to the one already there (so x4 = 2). It follows 
that the number of solutions to the equation that satisfy the given condition is the same as 
the number of strings of three vertical bars and six crosses, namely, 


(5°) 4 9! 9-8-7-64 si 
6 | \6/ 69-6)! 63-21 `” 


An alternative solution to this example is based on the observation that since each x; = 1, 
we may introduce new variables y; = x;— 1 for each i = 1, 2, 3, 4. Then each y; = 0, and 
yı ty. + y3 + y4 = 6. Thus the number of solutions of yı + y2 + y3 + y4 = 6 in nonnega- 
tive integers is the same as the number of solutions of x; + x2 + x3 + x4 = 10 in positive 
integers. E 


Remark: Deciding Which Formula to Use 


In Sections 9.2, 9.3, 9.5, and 9.6 we discussed four different ways of choosing k elements 
from n. The order in which the choices are made may or may not matter, and repetition 
may or may not be allowed. The following table summarizes which formula to use in 
which situation. 


Order Matters Order Does Not 
Matter 
ae k k+n-1 

Repetition Is Allowed n k 

Repetition Is Not Allowed | P(n, k) (") 

TEST YOURSELF 
1. Given a set X = {x),X,...,x,}, an r-combination 2. If X = {xj,X,...,x,}, the number of 
with repetition allowed, or a multiset of size r, r-combinations with repetition allowed (or 


chosen from X is , which is denoted multisets of size r) chosen from X is 
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3. 


EXERCISE SET 9.6 


1. 


When choosing k elements from a set of n ele- 
ments, order may or may not matter and repetition 
may or may not be allowed. 


e The number of ways to choose the k elements 
when repetition is allowed and order matters 
is 


e The number of ways to choose the k elements 
when repetition is not allowed and order mat- 
ters is ___ 


e The number of ways to choose the k elements 
when repetition is not allowed and order does 
not matter is ___. 


e The number of ways to choose the k elements 
when repetition is allowed and order does not 
matter is 


a. According to Theorem 9.6.1, how many 
5-combinations with repetition allowed can be 
chosen from a set of three elements? 

b. List all of the 5-combinations that can be 
chosen with repetition allowed from the set 
{1, 2, 3}. 


. a. According to Theorem 9.6.1, how many 


multisets of size four can be chosen from a set 
of three elements? 

b. List all of the multisets of size four that can be 
chosen from the set {x, y, z}. 


A bakery produces six different kinds of pastry, 

one of which is éclairs. Assume there are at least 

20 pastries of each kind. 

a. How many different selections of twenty 
pastries are there? 

b. How many different selections of twenty pas- 
tries are there if at least three must be éclairs? 

c. How many different selections of twenty 
pastries contain at most two éclairs? 


. A camera shop stocks eight different types of bat- 


teries, one of which is type A76. Assume there are 

at least 30 batteries of each type. 

a. How many ways can a total inventory of 30 
batteries be distributed among the eight differ- 
ent types? 

b. How many ways can a total inventory of 30 
batteries be distributed among the eight differ- 
ent types if the inventory must include at least 
four A76 batteries? 

c. How many ways can a total inventory of 30 
batteries be distributed among the eight dif- 
ferent types if the inventory includes at most 
three A76 batteries? 


. Ifnis a positive integer, how many 4-tuples of 


integers from 1 through n can be formed in which 
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the elements of the 4-tuple are written in increas- 
ing order but are not necessarily distinct? In other 
words, how many 4-tuples of integers (i, j, k, m) 
are there with | =isSjsSksSm=n? 


. Ifnis a positive integer, how many 5-tuples of 


integers from 1 through n can be formed in which 
the elements of the 5-tuple are written in decreas- 
ing order but are not necessarily distinct? In other 
words, how many 5-tuples of integers (h, i, j, k, m) 
are there with n =h2>iz=jzk=m=21? 


. Another way to count the number of nonnega- 


tive integral solutions to an equation of the form 
xX; +x, +: +x, = mis to reduce the problem to 
one of finding the number of n-tuples (y1, yo,..., 
Yp) with 0 Sy, Sy. S 4 Sy, =m. The reduc- 
tion results from letting y; = xı +x) +--+: +x; for 
each į = 1,2,...,n. Use this approach to derive 
a general formula for the number of nonnegative 
integral solutions to x, +x, ++: +x, =m. 


In 8 and 9, how many times will the innermost loop be 
iterated when the algorithm segment is implemented and 
run? Assume n, m, k, and j are positive integers. 


8. for m:= 1 ton 


for k := 1 tom 
for j := 1 tok 
for i := 1 toj 
[Statements in the body of the inner 
loop, none containing branching 
statements that lead outside the 
loop] 
next i 
next j 
next k 
next m 


9. 


for k:= 1 ton 
for j:= kton 
for i := j ton 
[Statements in the body of the inner loop, 


none containing branching statements 
that lead outside the loop] 


next i 
next j 
next k 


In 10-14, find how many solutions there are to the given 
equation that satisfy the given condition. 


10. 
11. 
12. 


13. 


14. 


*15; 


16. 


ANSWERS FOR TEST YOURSELF 


1. an unordered selection of elements taken from X with repetition allowed; [x;, X}, . . 


xı +X. + x3 = 20, each x; is a nonnegative integer. 
xı +x + x3 = 20, each x; is a positive integer. 


yı + y2 + y3 + y4 = 30, each y; is a nonnegative 
integer. 


yı + y2 + y3 + y4 = 30, each y; is an integer that is 
at least 2. 


a+b+c+d+e = 500, each of a, b, c, d, and e is 
an integer that is at least 10. 


For how many integers from 1 through 99,999 is 
the sum of their digits equal to 10? 


Consider the situation in Example 9.6.2. 

a. Suppose the store has only six cans of lem- 
onade but at least 15 cans of each of the other 
four types of soft drink. In how many dif- 
ferent ways can fifteen cans of soft drink be 
selected? 

b. Suppose that the store has only five cans of 
root beer and only six cans of lemonade but at 
least 15 cans of each of the other three types 
of soft drink. In how many different ways can 
fifteen cans of soft drink be selected? 


. a. A store sells 8 colors of balloons with at least 


30 of each color. How many different combina- 
tions of 30 balloons can be chosen? 

b. If the store has only 12 red balloons but at least 
30 of each other color of balloon, how many 
combinations of balloons can be chosen? 

c. Ifthe store has only 8 blue balloons but at least 
30 of each other color of balloon, how many 
combinations of balloons can be chosen? 
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d. If the store has only 12 red balloons and only 
8 blue balloons but at least 30 of each other 
color of balloon, how many combinations of 
balloons can be chosen? 


18. A large pile of coins consists of pennies, nickels, 


20. 


21. 


dimes, and quarters. 

a. How many different collections of 30 coins 
can be chosen if there are at least 30 of each 
kind of coin? 

b. If the pile contains only 15 quarters but at least 
30 of each other kind of coin, how many col- 
lections of 30 coins can be chosen? 

c. Ifthe pile contains only 20 dimes but at least 
30 of each other kind of coin, how many col- 
lections of 30 coins can be chosen? 

d. If the pile contains only 15 quarters and only 
20 dimes but at least 30 of each other kind of coin, 
how many collections of 30 coins can be chosen? 


Suppose the bakery in exercise 3 has only ten 

éclairs but has at least twenty of each of the other 

kinds of pastry. 

a. How many different selections of twenty pas- 
tries are there? 

b. Suppose in addition to having only ten éclairs, the 
bakery has only eight napoleon slices. How many 
different selections of twenty pastries are there? 


Suppose the camera shop in exercise 4 can obtain 
at most ten A76 batteries but can get at least 30 of 
each of the other types. 

a. How many ways can a total inventory of 30 
batteries be distributed among the eight differ- 
ent types? 

b. Suppose that in addition to being able to obtain 
only ten A76 batteries, the store can get only 
six of type D303. How many ways can a total 
inventory of 30 batteries be distributed among 
the eight different types? 


Observe that the number of columns in the trace 
table for Example 9.6.4 can be expressed as the sum 


L+(1+2)+(1 +243) + 00° +0+2+ +n). 


Explain why this is so, and show how this sum 
simplifies to the same expression given in the 
solution of Example 9.6.4. (Hint: A formula from 
exercise 13 in Section 5.2 will be helpful.) 


. , X; ] where each x; is in X and some of 


the x; may equal each other 2. aaa 1) 3. ns n(n —1X(n— 2): (n—k+ 1) (Or: P(n, K); (o); (m 1) 


r 
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Pascal’s Formula and the Binomial Theorem 


I’m very well acquainted, too, with matters mathematical, I understand equations both 
the simple and quadratical. About binomial theorem I am teaming with a lot of news. 
—William S. Gilbert, The Pirates of Penzance, 1880 

In this section we derive several formulas for values of () The most important is Pascal’s 
formula, which is the basis for Pascal’s triangle and is a crucial component of one of the 
proofs of the binomial theorem. We offer two distinct proofs for both Pascal’s formula and 
the binomial theorem. One of them is called “algebraic” because it relies to a great extent 
on algebraic manipulation, and the other is called “combinatorial,” because it is based on 
the kind of counting arguments we have been discussing in this chapter. 


n n n 
Scam Values of ( h ( ), ( ) 
n)’ \n n 


e =2 


Think of Theorem 9.5.1 as a general template: Regardless of what nonnegative integers are 
placed in the boxes, if the integer in the lower box is no greater than the integer in the top 


box, then 
O ! 
S) ovO-oy 


Use Theorem 9.5.1 to show that for every integer n = 0, 


() 
=1 9.7.1 
n 
n 
Ge =n, ifn=2al 9.7.2 
(e ; = ae ifn =2. 9.7.3 
Solution 
n ní 1 ‘jaca = 
i = ee = 0! =1 since 0! = 1 by definition 
n n! 
n=1) (n-=1DNn-(n-1))! 
— (nijn -n+1) 1 
n _ n! 
n-2) (n—2)\(n—(n—2))! 
_a(n—t): !_ atn—1) = 


(n—2)!2! 2 


Note that the result derived algebraically above, that (") equals 1, agrees with the fact 
that a set with n elements has just one subset of size n, namely, itself. Similarly, exercise 1 
at the end of the section asks you to show algebraically that (") = 1, which agrees with the fact 
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that a set with n elements has one subset, the empty set, of size 0. In exercise 2 you are also 
asked to show algebraically that (") =n. This result agrees with the fact that there are n 
subsets of size 1 that can be chosen from a set with n elements, namely the subsets consist- 


ing of each element taken alone. 

n n 
Example 9.7.2 ( = ( ) 
r n-r 


In exercise 5 at the end of the section you are asked to verify algebraically that 


(=l 


for all nonnegative integers n and r with r S n. 

An alternative way to deduce this formula is to interpret it as saying that a set A with 
n elements has exactly as many subsets of size r as it has subsets of size n — r. Derive the 
formula using this reasoning. 


Solution Observe that any subset of size r can be specified either by saying which r ele- 
ments lie in the subset or by saying which n — r elements lie outside the subset. 


A, a Set with n Elements 


Any subset B with r 


D OR a = i i subse elements completely 
with r withn-r determines a subset, 
elements elements 


A — B, with n — r elements. 


Suppose A has k subsets of size r: B4, By,..., B,. Then each B; can be paired up with 
exactly one set of size n — r, namely, its complement A — B;, as shown below. 
Subsets of Sizer Subsets of Size n -r 
B; <————_> A-B; 


B, <> A-B, 


B,<——> A-B, 


All subsets of size r are listed in the left-hand column, and all subsets of size n — r are 
listed in the right-hand column. The number of subsets of size r equals the number of sub- 


š n n 
sets of size n — r, and so (") = E 


The type of reasoning used in this example is called combinatorial, because it is ob- 
tained by counting things that are combined in different ways. A number of theorems have 
both combinatorial proofs and proofs that are purely algebraic. 


Pascal’s Formula 


Pascal’s formula, named after the seventeenth-century French mathematician and philoso- 
pher Blaise Pascal, is one of the most famous and useful in combinatorics (which is the 

: Sai n+1 
formal term for the study of counting and listing problems). It relates the value of ( i ) to 
the values of fe :) and ("). Specifically, it says that 


Blaise Pascal nl = n + . 
(1623-1662) r P= r 
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whenever n and r are positive integers with r = n. This formula makes it easy to compute 
higher combinations in terms of lower ones: If all the values of (") are known, then the 
values of ( H 3) can be computed for every integer r such that 0 < r £ n. 

Pascal’s triangle, shown in Table 9.7.1, is a geometric version of Pascal’s formula. Some- 
times it is simply called the arithmetic triangle because it was used centuries before Pascal 
by Chinese and Persian mathematicians. But Pascal discovered it independently, and ever 
since 1654, when he published a treatise that explored many of its features, it has generally 
been known as Pascal’s triangle. 


TABLE 9.7.1 Pascal’s Triangle for Values of (") 


~ 0 1 2 3 4 5 r-1 r 
0 1 

1 1 

2 1 2 1 

3 1 3 3 1 

4 1 4 6 + 4 1 

5 1 5 10 | = 10 5 1 


A () G) 
a) 


O Q @ 
Cele nc, Mae ne Oe 


n 
r 


Each entry in the triangle is a value of ( ). Pascal’s formula translates into the fact that the 
entry in row n + 1, column r equals the sum of the entry in row n, column r — 1 plus the 
entry in row n, column r. That is, the entry in a given interior position equals the sum of 
the two entries directly above and to the above left. The left-most and right-most entries 
in each row are 1 because (") = | by Example 9.7.1 and l) = | by exercise | at the end of 
this section. 


n 
Calculating (") Using Pascal’s Triangle 


Use Pascal’s triangle to compute the values of 


(2) # (3) 


Solution By construction, the value in row n, column r of Pascal’s triangle is the value 
: Che od ; +1 
of ("), for every pair of positive integers n and r with r = n. By Pascal’s formula, (" P ) can 
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be computed by adding together (" ) and ("), which are located directly above and above 


n+] 
6 5 5 
= + =5+ = 
(5) G) (5) 5+10=15 and 


left of (" $ '). Thus, 
6\_ (5\, (5 
=(~}+(+) = 10+ 10 =20. = 


Pascal’s formula can be derived by two entirely different arguments. One is algebraic; 
it uses the formula for the number of r-combinations obtained in Theorem 9.5.1. The other 
is combinatorial; it uses the definition of the number of r-combinations as the number of 
subsets of size r taken from a set with a certain number of elements. We give both proofs 
since both approaches have applications in many other situations. 


Theorem 9.7.1 Pascal’s Formula 


Let n and r be positive integers with r = n. Then 
= Hi), 
r p= iP 


Proof (algebraic version): Let n and r be positive integers with r < n. We will show 
that the right-hand side of Pascal’s formula equals its left-hand side. By Theorem 9.5.1, 


( n (") n! n! 
+ = + 
r= r (r=) m= (r1)! rl@—n)! 


n! n! 
+ 
(r-l!(a-rt+1)! r!(—r)! 


To add these fractions, a common denominator is needed, so multiply the nu- 
merator and denominator of the left-hand fraction by r and multiply the numerator 
and denominator of the right-hand fraction by (n — r+ 1). Then 


( n )+(")- n! Ty n! Marth) 
r-1 r (r-l)!(r-rt+1)! ro ri(a—r)! (n-r+l) 
_ nlr + n'n!—n!r+n! 
m=r+1)r(r=1) (n-rt+l1)m-—r)!-r! 
_nbr+n!n—-nbr+n!  ni(n+!) 
(n—r-+1)!r! (n+1— P)!r! 


(n+ 1)! n+1 
= (n+) ir! = $ . This is what was to be shown. 
n amie Gp etl Za 


Proof (combinatorial version): Let n and r be positive integers with r = n. Suppose 
Sis a set with n + 1 elements. The number of subsets of S of size r can be calculated 
by thinking of S as consisting of two pieces: one with n elements {x), X2,- . - , Xn} and 
the other with one element {x,,+,}. 

Any subset of S with r elements either contains x, or it does not. If it contains 
X,+1, then it contains r—1 elements from the set {x,, x2,..., Xn}. If it does not con- 
tain xX„+1, then it contains r elements from the set {x1, x2,..., Xn}. 


(continued on page 646) 
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Subsets of Size r of {x}, x7, ... 


? Xn+1} 


subsets of size r 
that consist entirely 


subsets of size r 


that contain x41 


of elements from and r — 1 elements 
EEE E from {xi Xe An] 
There are There are 
n\ . n 
( r ) of these. ( Fail Jof these. 


By the addition rule, 


the number of subsets 
+] of {x1, X2,- Xn} 
of size r 


the number of subsets 
of {x1, Xs.. 
of size r— 1 


the number of subsets 
of {x1, X2, » -© s Xm Xari) 
of size r 


«Xn 


1 ; 
By Theorem 9.5.1, the set {x1, X2,- -< Xn, Xn+1} has (0 ) subsets of size r, the set 
{xis Xo,..-, X,} has es ) subsets of size r— 1, and the set {x1, x2,..., Xp} has (") 


subsets of size r. Thus 
4 + ') ( n ) (") 
= + . 
r r-1 r 


as was to be shown. 


Example 9.7.4 


Deriving New Formulas from Pascal’s Formula 

: +2). 
Use Pascal’s formula to derive a formula for (" ; ) in terms of values of ("), he J. and (. : A 
Assume n and r are nonnegative integers and 2 S r S n. 


Solution By Pascal’s formula, 


ma-e) 


n+1 
r= I 


7) Lt) CC) Oh 


Combining the two middle terms gives 


CIRE 


for all nonnegative integers n and r such that 2 S r S n. E 


Now apply Pascal’s formula to ( ) and (" H ') and substitute into the above to obtain 


The Binomial Theorem 


In algebra a sum of two terms, such as a + b, is called a binomial. The binomial theorem 
gives an expression for the powers of a binomial (a + b)”, for each nonnegative integer n 
and all real numbers a and b. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


9.7 PASCAL’S FORMULA AND THE BINOMIAL THEOREM 647 


Consider what happens when you calculate the first few powers of a+b. According 
to the distributive law of algebra, you take the sum of the products of all combinations of 
individual terms: 


(a+b? = (a+b\(at+b) = aa +ab + ba + bb, 


(a+ by = (at+byatb\atb) 
= aaa + aab + aba + abb + baa + bab + bba + bbb, 
(a+b) = (a+b\(a+bXa+b)(a +b) 


lst 2nd 3rd 4th 
factor factor factor factor 


= aaaa + aaab + aaba + aabb + abaa + abab + abba + abbb 
+ baaa + baab + baba + babb + bbaa + bbab + bbba + bbbb. 


Now focus on the expansion of (a + by. (It is concrete, and yet it has all the features of 
the general case.) A typical term of this expansion is obtained by multiplying one of the 
two terms from the first factor times one of the two terms from the second factor times one 
of the two terms from the third factor times one of the two terms from the fourth factor. 
For example, the term abab is obtained by multiplying the a’s and b’s marked with arrows 
below. 


l lJ l 


(a+ b)X(a+ b)\(a+ bat b) 


Since there are two possible values—a or b—for each term selected from one of the four 
factors, there are 2* = 16 terms in the expansion of (a + by’. 

Now some terms in the expansion are “like terms” and can be combined. Consider all 
possible orderings of three a’s and one b, for example. By the techniques of Section 9.5, 
there are ($) = 4 of them. And each of the four occurs as a term in the expansion of (a + b}: 


aaab aaba abaa baaa. 


By the commutative and associative laws of algebra, each such term equals ab, so all 
four are “like terms.” When the like terms are combined, therefore, the coefficient of ab 
equals (): 

Similarly, the expansion of (a + b)* contains the (3) = 6 different orderings of two a’s 
and two b’s, 


aabb abab abba baab baba bbaa, 
all of which equal a&b’, so the coefficient of a*b* equals (6). By a similar analysis, 


4 


the coefficient of ab? equals ( ). Also, since there is only one way to order four a’s, the coef- 


3 
ficient of a* is 1 (which equals (3), and since there is only one way to order four b’s, the 


coefficient of b* is 1 (which equals 6). Thus, when all of the like terms are combined, 


(a+b)* = e + (‘Je + (e + (a + (Je 


= af + 40b + 6a b? + 4ab? + br. 


The binomial theorem generalizes this formula to an arbitrary nonnegative integer n. 
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Theorem 9.7.2 Binomial Theorem 


Given any real numbers a and b and any nonnegative integer n, 
(a+b) = S n ae p 
k 


=e n ad ph ae n d b + E n ab tb" 
1 2 i= Il 


Note that the second expression equals the first because (") = | and (") = 1, for every 
nonnegative integer n, since b? = 1 and a” " = 1. 

It is instructive to see two proofs of the binomial theorem: an algebraic proof and a 
combinatorial proof. Both require a precise definition of integer power. 


Definition 


For any real number a and any nonnegative integer n, the nonnegative integer 
powers of a are defined as follows: 


a \\t ifn = 0 
2 aa tn >. 


In some mathematical contexts, 0° is left undefined. Defining it to be 1, as indicated 
in Section 5.1, makes it possible to write general formulas such as È;=0 z= cs without 
having to exclude values of the variables that result in the expression 0°. 

The algebraic version of the binomial theorem uses mathematical induction and calls 
upon Pascal’s formula at a crucial point. 


Proof of the Binomial Theorem (algebraic version): 


Suppose a and b are real numbers. We use mathematical induction and let the prop- 
erty P(n) be the equation 


a+b => (‘ers <— P(n) 


k=0 
Show that P(0) is true: When n = 0, the binomial theorem states that: 
0 SOY okk 
(a+b) = a ob. <— P(0) 
Dk 


Now the left-hand side is (a +b)? = 1 [by definition of power], and the right-hand 


side is 
5 (a E (e 
Eo \k 0 


0! 1 
= --1=—=1 
0!-(0— 0)! 1-1 


also [since 0! = 1, ad = 1, and b? = 1]. Hence P (0) is true. 
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Show that for each integer m = 0, if P(m) is true then P (m + 1) is true: Let m be 
any integer with m = 0, and suppose P(m) is true. That is, suppose 


m m = 
(a+by” = >» ( k Jer kpk, P(m) inductive hypothesis. 
k=0 


We need to show that P(m + 1) is true: 


m+ fin +1 7 
(a+b)"™! = T i ja» khk Pim +1) 
k=0 


Now, by definition of the (m + 1)st power, 
(at by"*! = (a+b) (a +b)", 


so by substitution from the inductive hypothesis, 


l 
M 


m m _ 
(a+b)"*} = a+b): X, a" kpk 
k=0 k 
by the generalized distributive 


m m = m m = 
: ( Jer oS ( Jer kpk 
k=0 k k=0 k 
law and the facts that 


Afm -kk wal™ kpk+l 
= 5 a”t Tkp + Ss a” *b + ad” * = qitm*k = qt ick 
k a\k 


k=0 and b-bK = plt* = pet! 


We transform the second summation on the right-hand side by making the change 
of variable j = k+ 1. When k = 0, then j = 1. When k = m, then j = m+ 1. And 
since k = j — 1, the general term is 


M \ m—kpkt+1 _ m m-(-)pj — m m+1—j,j 
b = b = b’. 
(") (i) Ui) 


Hence the second summation on the right-hand side above is 
m+ 1 
5 ( = Jeers 
mini 


But the j in this summation is a dummy variable; it can be replaced by the letter k, as 
long as the replacement is made everywhere the j occurs: 


ŞS m atii S m art- 
j=1 (al ak=l l 


Substituting back, we get 


(a+b)"™! = > m aiz S m ati- kpk 
imo \K imi \K-1 


[The reason for the above maneuvers was to make the powers of a and b agree so that 
we can add the summations together term by term, except for the first and the last terms, 
which we must write separately. ] 


(continued on page 650) 
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Thus 


+p m+ _ m m+1—070 1 = m + m m+1—kpk 
or @: > k) \e-1) |" 


4 m qdtti-@t)pmt 
(m+1)-1 


m+1 m m mt+1—kyk m+1 
= gmtly + bk +b 
enik 


since a? = b? = 1 
and (") = (") =1. 


m m m+1 
+ = by Pascal’s formula. 


(a + py"! = qt} $ Z E Janig e 
k=1 


But 


Hence 


m+1 +1 
_ 5 (” : Jaa because ag ') = ("* J =1 


m+ 1 
k=0 


This is what was to be shown. 


It is instructive to write out the product (a+ b): (a + b)” without using the summation 
notation but using the inductive hypothesis about (a + b)”: 


(a+b)"*! = (a+b): e + (r)a ++ ts jee 


M \ m-kyk ne m m—1 m 
+ a G Fns ab +b |. 
k m=l 


You will see that the first and last coefficients are clearly 1 and that the term contain- 
m*1~kpk is obtained from multiplying a” “b* by a and a” “~b*"! by b [because 


m+1—kpk equals the sum of ("") and i A 


ing a 
m+1—k=m-—(k-—1)]. Hence the coefficient of a 
This is the crux of the algebraic proof. 
If n and r are nonnegative integers and r = n, then (") is called a binomial coefficient 
because it is one of the coefficients in the expansion of the binomial expression (a + b)”. 
The combinatorial proof of the binomial theorem follows. 


Proof of Binomial Theorem (combinatorial version): 


[The combinatorial argument used here to prove the binomial theorem works only for 
n = 1. If we were giving only this combinatorial proof, we would have to prove the case 
n = 0 separately. Since we have already given a complete algebraic proof that includes 
the case n = Q, we do not prove it again here.] 
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Let a and b be real numbers and n an integer that is at least 1. The expression 
(a +b)" can be expanded into products of n letters, where each letter is either a or b. 
For each k = 0, 1, 2,..., n, the product 


a’ *b* = a-a-a-*-a:b-b-b--b 
ct A E E 
n — k factors k factors 


occurs as a term in the sum the same number of times as there are orderings of 
(n — k) a’s and k b’s. But this number equals ("), the number of ways to choose k 
positions into which to place the b’s. [The other n—k positions will be filled by a’s.] 
Hence, when like terms are combined, the coefficient of a" *p* in the sum is ‘i ; 
Thus 


(a+b) = 5 (r)a, 


k=0 


as was to be shown. 


Substituting into the Binomial Theorem 


Expand the following expressions using the binomial theorem: 
a. (a+b) b. (x—4y)* 


Solution 


a. (a+b = 


| 
Me 
ae 
=~ U 
NL 
nn 
A 
> 
= 


at 3 atb! + 3 a 7b + a ab? + 3 atb +b? 
1 2 3 4 


=a + 5afb + 10a°b* + 10a°b? + Sab* + b’ 


b. Observe that (x — 4y)* = (x + (—4y))*. So let a = x and b = (—4y), and substitute into 
the binomial theorem. 


_ hae 4 4—kr_ yxk 
G= = SIL Je 4y) 


k=0 
=x'+ (aiea + (aeo + (Gaeo +(—4yy" 
= x4 + 4x3(—4y) + 6x°(16y*) + 4x! (—64y°) + (256y*) 


=x — 16y + 96x y = 256xy° + 256y" E 


Deriving Another Combinatorial Identity from the Binomial Theorem 


Use the binomial theorem to show that 


o A R +Q) e+) 


for each integer n = 0. 
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Solution Since 2 = 1+ 1,2” = (1+ 1)”. Apply the binomial theorem to this expression 
by letting a = 1 and b = 1. Then 


ie € n _qn-k. k ~X nį, 
e-ga 


because 1” * = 1 and 1% = 1. Consequently, 


AO RONONO O, i 


Using a Combinatorial Argument to Derive the Same Identity 


According to Theorem 6.3.1, a set with n elements has 2” subsets. Apply this fact to give a 
combinatorial argument to justify the identity 


FO 


Solution Suppose S is a set with n elements. Then every subset of S has some number 
of elements k, where k is between 0 and n. It follows that the total number of subsets of S, 
N(P(S)), can be expressed as the following sum: 


the number the number the number the number 
of subsets = | of subsets of | + | of subsets |+---+] of subsets 
of S size 0 of size 1 of size n 


Now, for each integer k from | through n, the number of subsets of size k of a set with n 
elements is ("). Hence the 


n n n n 
number of subsets of S = + + E \ 
0 ji 2 n 


By Theorem 6.3.1, S has 2” subsets. Hence 


ORONONO O Ei i 


Using the Binomial Theorem to Simplify a Sum 


Express the following sum in closed form (without using a summation symbol and without 


using an ellipsis ---): 
n n k 
9". 


Solution When the number 1 is raised to any power, the result is still 1. Thus 


SG) 


(1+ 9)" by the binomial theorem with a = 1 and b = 9 


= 10". = 
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1. Ifn andr are nonnegative integers with r = n, 
then the relation between (") and (,,” ) is 


r nr, 


2. Pascal’s formula says that if n and r are positive 
integers with r S n, then____ 


3. The crux of the algebraic proof of Pascal’s formu- 
la is that to add two fractions you need to express 
both of them with a ___. 


4. The crux of the combinatorial proof of Pascal’s 
formula is that the set of subsets of size r of a set 
{X1, X2,- - - , X,+,} can be partitioned into the set of 
subsets of size r that contain and the set of 
subsets of size r that _____ 


EXERCISE SET 9.7 


5. The binomial theorem says that given any 
real numbers a and b and any nonnegative 
integer n, 


6. The crux of the algebraic proof of the bino- 
mial theorem is that, after making a change 
of variable so that two summations have the 
same lower and upper limits and the exponents 
of a and b are the same, you use the fact that 
m m 
Ee ~ ——: 

7. The crux of the combinatorial proof of the 
binomial theorem is that the number of ways to 
arrange k b’s and (n — k) a’s in a row is 


In 1-4, use Theorem 9.5.1 to compute the values of the 
indicated quantities. (Assume n is an integer.) 


1. ("), forn=0 2. ("), forn = 1 


3. 6), forn = 2 4. 6), forn = 3 


5. Use Theorem 9.5.1 to prove algebraically that 
(") = (, " „b for integers n and r with 0 S r S n. 
(This can be done by direct calculation; it is not 
necessary to use mathematical induction.) 
Justify the equations in 6-9 either by deriving them from 
formulas in Example 9.7.1 or by direct computation from 
Theorem 9.5.1. Assume m, n, k, and r are integers. 


m+k 
6. (rE )=mtk, formt+k= 1 


+ +3)\(n+2 
7. (" ee xn ) fone = 


n+l 


8. k) = 1, fork-r=0 


2(n + 1) 
9. ( 2n ) = (n+ 1)(2n + 1), frn = 0 


10. a. Use Pascal’s triangle given in Table 9.7.1 to 


compute the values of (), (3), (9, and I 


b. Use the result of part (a) and Pascal’s formula 
7) (7 7 
to compute (3), (o). and (3). 
c. Complete the row of Pascal’s triangle that 


corresponds to n = 7. 


11. The row of Pascal’s triangle that corresponds to 
n = 8 is as follows: 
1 8 28 56 70 56 28 8 1. 


What is the row that corresponds to n = 9? 


12. Use Pascal’s formula repeatedly to derive 
a formula for (" j °) in terms of values of (") 
with k = r. (Assume n and r are integers with 
n=r=3,) 


13. Use Pascal’s formula to prove by mathematical 
induction that if n is an integer and n = 1, then 


TOA 
(3) 


. Prove that if n is an integer and n = 1, then 


n+2 
[242-340 1) = 2( 3 ) 


15. Prove the following generalization of exercise 13: 
Let r be a fixed nonnegative integer. For every 
integer n with n = r, 


SOC) 


16. Think of a set with m + n elements as composed 
of two parts, one with m elements and the other 
with n elements. Give a combinatorial argument to 
show that 


CO 


where m and n are positive integers and r is an 
integer that is less than or equal to both m and n. 
This identity gives rise to many useful additional 
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identities involving the quantities Ki Because 
Alexander Vandermonde published an influen- 
tial article about it in 1772, it is generally called 
the Vandermonde convolution. However, it was 
known at least in the 1300s in China by Chu 
Shih-chieh. 


H 17. Prove that for every integer n = 0, 
+ +e = A 
0 1 n n 
18. Let m be any nonnegative integer. Use mathemati- 


cal induction and Pascal’s formula to prove that 
for every integer n = 0, 


(r) n ow are 
+ +e = 
0 1 n n 

Use the binomial theorem to expand the expres- 


sions in 19-27. 


19. (1 +x)’ 20. (p+q)° 21. (1—x)® 


23. (p—2q)* 24. (u—3vf 


25. (xt 26. |<= 27. |x +t 
x a 3 x 


28. In Example 9.7.5 it was shown that 
(a+ by = a> + 5afb + 10a°b? + 10a7b? + 5ab* + b5. 


22. (u- vy 


Evaluate (a + b) by substituting the expression 
above into the equation 


(a+b) = (a+ byat+b) 


and then multiplying out and combining like 
terms. 


In 29-34, find the coefficient of the given term when the 
expression is expanded by the binomial theorem. 

29. x°y in (x+y)? 30. x’ in (2x +3)!° 

31. ab’ in (a— 2b)” 32. uv‘ in (u — v^! 
33. p'°q’ in (3p — 2q) 34. xy" in Qx- 3y) 


35. As in the proof of the binomial theorem, trans- 
form the summation 


- m m—kyk+1 
> a `b 
k=0 ( 3 


by making the change of variable j = k+ 1. 
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Use the binomial theorem to prove each statement in 36-41. 


36. For every integer n = 1, 


9-0-0- 


(Hint: Use the fact that 1 + (—1) = 0.) 


H 37. For every integer n = 0, 


Aet) 


, m [m r 
38. For every integer m = 0, Sen "oe i=]. 


i=0 l 


. n [on a 
39. For every integer n = 0, Sen a = 2". 


i=0 l 


40. For every integer n = 0 and for every nonnegative 
real number x, 1 + nx S (1 +x)". 


H 41. For every integer n = 1, 


(9) 32) 20) 


pres + ( iy 1 n = 0 
' gn-l n—-1 1 


gn-l 


if n is even 


if n is odd. 


42. Use mathematical induction to prove that for 
every integer n = 1, if Sis a set with n elements, 
then S has the same number of subsets with an 
even number of elements as with an odd number 
of elements. Use this fact to give a combinatorial 
argument to justify the identity of exercise 36. 

Express each of the sums in 43-54 in closed form (with- 

out using a summation symbol and without using an 


ellipsis ...). 
n n m m . 
43. ( )s 44. ( a 
2 k 2 i 
45. >, C) 46. (x) ark 
=p \! imo \K 
2n : 2n : n n 
4. > CY Jz’ 48. x 
j=0 J r=0 2 
49 S (") m=i 2i 50 a (") a 
` 0 \i m ` Ak 2 
Z i(m)\ 1 L A7 \ ,2n-2ky2k 
51. -1 =a 52. 38 2. 
a ( La A 
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c. Use the result of part (b) to derive the formulas 


k il” n—ini á n 2n— 4 
53. Si-1(")s 2 54. Su") JAE below. 
i=0 k=0 


lj /n n n n 
. n=l = er 
* 55. (For students who have studied calculus.) 2 n (") = 2(") j (3) Hs “(")] 
a. Explain how the equation below follows from E 
the binomial theorem: Gi) Sk (") (-1)k =0 
k 
n n n i k=0 
1+x) = X: 
asats 32) 


. . ; d. Express > k (") 3* in closed form (without using 
b. Write the formula obtained by taking the de- £ \k 


rivative of both sides of the equation in part (a) a summation sign or ellipsis). 
with respect to x. 


ANSWERS FOR TEST YOURSELF 


1. (") = h a J) pa ie ') = (" ) + (") 3. common denominator 4. x„+1; do not contain x,+, 


5.(a+by" = X (Ja 6) 7.(°) 


k=0 


EEJ Probability Axioms and Expected Value 


The theory of probability is at bottom nothing but common sense reduced to a 
calculus. —Pierre-Simon Laplace (1749-1827) 


Up to this point, we have calculated probabilities only for situations, such as tossing a fair 
coin or rolling a pair of balanced dice, where the outcomes in the sample space are all 
equally likely. But coins are not always fair and dice are not always balanced. How is it 
possible to calculate probabilities for these more general situations? 

The following axioms were formulated by A. N. Kolmogorov in 1933 to provide a theo- 
retical foundation for a far-ranging theory of probability. In this section we state the axi- 
oms, derive a few consequences, and introduce the notion of expected value. 

Recall that a sample space is a set of all outcomes of a random process or experiment 
and that an event is a subset of a sample space. 


ITAR-TASS News Agency/Alamy 


Andrei Nikolaevich 
Kolmogorov (1903—1987) Probability Axioms 


Let S be a sample space. A probability function P from the set of all events in S to the 

set of real numbers satisfies the following three axioms: For all events A and B in S: 

1. 0s P(A) =1. 

2. P(@) = Oand P(S) = 1. 

3. If A and B are disjoint (that is, if A N B = ©), then the probability of the union 
of A and B is 


P(A UB) = P(A) + P(B). 


Applying the Probability Axioms 


Suppose that A and B are events in a sample space S. If A and B are disjoint, could 
P(A) = 0.6 and P(B) = 0.8? 
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Solution No. Probability axiom 3 would imply that P(A U B) = P(A) + P (B) = 0.6 + 
0.8 = 1.4, and since 1.4 > 1, this result would violate probability axiom 1. E 


See ewe The Probability of the Complement of an Event 
Suppose that A is an event in a sample space S. Deduce that P(A‘) = 1 — P(A). 


Solution By Theorem 6.2.2(5), with S representing the universal set U, 
ANA =Ø and AUA =S. 
Thus S is the disjoint union of A and A‘, and so 
P(A UA‘) = P(A) + P(A) = P(S) = 1. 


So P(A)+P(A‘) =1 and subtracting P(A) from both sides gives the result that 
P(A‘) = 1— P(A). E 


Probability of the Complement of an Event 
If A is any event in a sample space S, then 


P(A‘) = 1 — P(A). 9.8.1 


It is important to check that Kolmogorov’s probability axioms are consistent with the 
results obtained using the equally likely probability formula. To see that this is the case, let 
S be a finite sample space with outcomes qj, a2, 43, . . . , An. It is clear that all the singleton 
sets {a1}, {a}, {a3},..., {a,} are mutually disjoint and that their union is S. Since P(S) = 1, 
probability axiom 3 can be applied multiple times (see exercise 13 at the end of this section) 
to obtain 


P({a,} U {a} U {a3} U +++ U {ap} = S Pda) =i 


If, in addition, all the outcomes are equally likely, there is a positive real number c so that 


Pair) = Pag) = Pas) = e =P Gad) = c. 
Hence 
1= Se=ere+ “+ +c=n0, 
= ————SS 
sal n terms 
and thus 
1 
C= 
n 


It follows that if A is any event with outcomes lip Ais Qip + + -> Gi,» then 


m m 1 m N A) 


P(A) = Ș Pha; }) = = 
(A) > ({a;,}) 2: n NG) 


which is the result given by the equally likely probability formula. 
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The Probability of a General Union of Two Events 


Follow the steps outlined in parts (a) and (b) below to prove the following formula: 


Probability of a General Union of Two Events 


If S is any sample space and A and B are any events in S, then 
P(A U B) = P(A) + P(B)— P(A NA B). 9.8.2 


In both steps, suppose that A and B are any events in a sample space S. 


a. Show that A U B is a disjoint union of the following sets: A — (A N B), B— (A N B), 
and A N B. 


b. For any events U and V in a sample space S, if U C V then P(V— U) = P(V) — P(U). 
Use this result (which you are asked to prove in exercise 12 at the end of this section) 
and the result of part (a) to finish the proof of the formula. 


Solution 


a. Refer to Figure 9.8.1 as you read the following explanation. Elements in the set 
A-— (A N B) are in the region shaded blue, elements in B — (A N B) are in the region 
shaded gray, and elements in A N B are in the white region. 


A-(ANB) 


FIGURE 9.8.1 


Part 1: Show that A U BC (A — (A N B)) U (B — (A N B)) U (ANB): Given any 
element x in A U B, x satisfies exactly one of the following three conditions: 
d)x€AandxEB 
(2)x E Aandx B 
(3)x E Band x ZA. 
1. Inthefirstcase,x E A N B,andsox E (A— (A NB) U (B-(ANB))UANB) 
by definition of union. 
2. In the second case, x € A N B (because x É B), and so x E A — (A N B). There- 
fore x E (A — (A N B)) U (B — (A N B)) U (A N B) by definition of union. 
3. In the third case, x A N B (because x € A), and hence x € B -— (A N B). So, 
again, x E (A — (A N B)) U (B — (A N B)) U (A N B) by definition of union. 


Hence, in all three cases, x E (A — (A N B)) U (B — (A N B)) U (A N B), which com- 
pletes the proof of part 1. 

Moreover, since the three conditions are mutually exclusive, the three sets A — (A N B), 
B- (A N B), and A N B are mutually disjoint. 

Part 2: Show that (A — (A A B))U (B — (A NB))U(A NB)CA UB: Suppose 
x is any element in (A — (A N B)) U (B — (A N B)) U (A N B). By definition of union, 
xEA—(ANB)oxEB-—-(ANB)oxEANB. 
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1. In the first case, x E A—(A N B), and so x E A and x É A N B by definition of 
set difference. In particular, x E A and thus x E A U B. 

2. In the second case, x E B—(A N B), and so x E B and x É A N B by definition 
of set difference. In particular, x E B and thus x E A U B. 

3. In the third case, x E A N B, and so, in particular, x E A. Thus x E A U B. 


Hence, in all three cases, x E A U B, which completes the proof of part 2. 


b. P(A UB) = P(A- (AN B) U (B- (AN BNU (AN B)) by part (a) 
= P(A- (A N B)) + P(B- (A N B) +P(A N B) 
by exercise 13 at the end of the section and the fact that 
A-—(A N B), B-—(A N B), and A N B are mutually disjoint 


= P(A) — P(A N B) + P(B) — P(A N B) + P(A N B) 
by exercise 12 at the end of the section because 
ANBCAandANBCB 


= P(A)+ P(B)— P(A N B) by algebra. | 


Computing the Probability of a General Union of Two Events 


Suppose a card is chosen at random from an ordinary 52-card deck (see Section 9.1). What 
is the probability that the card is a face card (jack, queen, or king) or is from one of the red 
suits (hearts or diamonds)? 


Solution LetA be the event that the chosen card is a face card, and let B be the event that 
the chosen card is from one of the red suits. The event that the card is a face card or is from 
one of the red suits is A U B. Now MA) = 4:3 = 12 (because each of the four suits has 
three face cards), and so P(A) = 12/52. Also N(B) = 26 (because half the cards are red), 
and so P(B) = 26/52. Finally, N(A N B) = 6 (because there are three face cards in hearts 
and another three in diamonds), and so P(A M B) = 6/52. It follows from the formula for 
the probability of a union of any two events that 
12 26 6 32 


P(A U B) = P(A) + P(B)— P(A N B) = = + = 61.5%. 
(A U B) = P(A) + P(B) — PC Ha 59 a 5p a 


Thus the probability that the chosen card is a face card or is from one of the red suits is 
approximately 61.5%. E 


Expected Value 


People who regularly buy lottery tickets often justify the practice by saying that, even 
though they know that on average they will lose money, they are hoping for one significant 
gain, after which they believe they will quit playing. Unfortunately, when people who have 
lost money on a string of losing lottery tickets win some or all of it back, they generally 
decide to keep trying their luck instead of quitting. 

The technical way to say that on average a person will lose money on the lottery is to 
say that the expected value of playing the lottery is negative. 


Definition 


Suppose the possible outcomes of an experiment, or random process, are real num- 
bers dj, a2, 43, . . . , An, Which occur with probabilities pı, p2, P3, - - - , Pn. The ex- 
pected value of the process is 


n 
S ap = api tampi taps + ` +a pr 
k=1 
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Expected Value of a Lottery 


Suppose that 500,000 people pay $5 each to play a lottery game with the following prizes: 
one grand prize of $1,000,000, 10 second prizes of $1,000 each, 1,000 third prizes of $500 
each, and 10,000 fourth prizes of $10 each. What is the expected value of a ticket? 


Solution Each of the 500,000 lottery tickets has the same chance as any other of con- 
taining a winning lottery number, and so p = 500.000 for each k = 1, 2, 3,..., 500,000. 
Let a; be the net gain for an individual ticket a;, where a, = 999,995 (the net gain for 
the grand-prize ticket, which is one million dollars minus the $5 cost of the winning 


ticket), dy = a; = *** = a; = 995 (the net gain for each of the 10 second-prize tickets), 
an = 413 = *** = 4,91; = 495 (the net gain for each of the 1,000 third-prize tickets), and 
41012 = 4013 = *** = 11,011 = 5 (the net gain for each of the 10,000 fourth-prize tickets). 
Since the remaining 488,989 tickets just lose $5, ay) 912 = 411,013 = *** = 400,000 = 75. 


The expected value of a ticket is therefore 


500,000 500,000 1 
bJ a = b? ar: = because each p, = 1/500,000 
a | : TT i 
500,000 


1 
= 500,000 > ak by theorem 5.1.1(2) 


k=1 


= — +10- +1 -495 + 10, -5 + (—5)-488, 
300,00 0-995 + 1,000 -495 + 10,000 -5 + (—5)-488,989) 


= + +4 + — 2,444,94 
500,000 2995 + 9,950 + 495,000 + 50,000 — 2,444,945) 


= —1.78. 


In other words, a person who continues to play this lottery for a very long time will prob- 
ably win some money occasionally but on average will lose $1.78 per ticket. E 


eM eek Gambler’s Ruin 


A gambler repeatedly bets $1 that a coin will come up heads when tossed. Each time the 
coin comes up heads, the gambler wins $1; each time it comes up tails, he loses $1. The 
gambler will quit playing either when he is ruined (loses all his money) or when he has $M 
(where M is a positive number he has decided in advance). Let P,, be the probability that 
the gambler is ruined if he begins playing with $n. Then, if the coin is fair (has an equal 
chance of coming up heads or tails), 


Py. = LP, + P for each integer k with 2 < k < M. 
(This follows from the fact that if the gambler has $(k — 1), then he has an equal chance of 
winning $1 or losing $1. If he wins $1 then his chance of being ruined is P}, whereas if he 
loses $1 then his chance of being ruined is P,_>.) Also Py = 1 (because if he has $0, he is 
certain of being ruined) and Py = 0 (because once he has $M, he quits and so stands no 
chance of being ruined). Find an explicit formula for P„. How should the gambler choose 
M to minimize his chance of being ruined? 


Solution Multiplying both sides of P,_, = 5 Py T EP, by 2 and subtracting P,_» from 
both sides gives 


Py = 2Py—1 — Per. 
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This is a second-order homogeneous recurrence relation with constant coefficients. Be- 
cause P, — 2P;,_, + Py_2 = 0, its characteristic equation is 


ł—2t+1=0, 
which has the single root r = 1. Thus, by the single-root theorem from Section 5.8, 
Pa = Cr+ Dnr” = C+ Dn 

(since r = 1), where C and D are determined by two values of the sequence. Since Py = 1 
and Py = 0, 

1 = P= C+D-0=C 

0=Py=C+D-M=1+D-M. 
It follows that C = 1 and D = =}, and so 


1 M-n i : 
n=l y” = 7 for each integer n with 0 S n S M. 


P 


For instance, a gambler who starts with $20 and decides to quit either if his total grows to 
$100 or if he goes broke has the following chance of going broke: 

100-20 80 

27 100 100 


= 80%. 


Observe that the larger M is relative to n, the closer P, is to 1. In other words, the larger the 
amount of money the gambler sets himself as a target, the more likely he is to go broke. 


Conversely, the more modest he is in his goal, the more likely he is to reach it. E 
TEST YOURSELF 
1. IfA is an event in a sample space S, P(A) can 4. IfA and B are any events in a sample space S, 
take values between and . Moreover, P(A U B) = ___. 
PS) and EO) = ` 5. If the possible outcomes of a random process or 
2. If A and B are disjoint events in a sample space S, experiment are real numbers aj, a2, . . . , Ap, Which 
P(A UB) =___. occur with probabilities pı, po, ..., Pn, then the 


3. If A is an event in a sample space S, P(A) = —__ ea picid varie ict the process 


EXERCISESET9.8 


1. In any sample space S, what is P(O)? 5. Suppose A and B are events in a sample space S 
and suppose that P(A) = 0.6, P(B°) = 0.4, and 


2. Suppose A, B, and C are mutually exclusive events 


in a sample space S, A U BUC = S, and A and B 
have probabilities 0.3 and 0.5, respectively. 
a. Whatis P(A U B)? b. What is P(C)? 


. Suppose A and B are mutually exclusive events 


in a sample space S, C is another event in S, 

A U B U C = S, and A and B have probabilities 
0.4 and 0.2, respectively. 

a. What is P(A U B)? 


P(A N B) = 0.2. What is P(A U B)? 


. Suppose U and V are events in a sample space S 


and suppose that P(U‘) = 0.3, P(V) = 0.6, and 
P(U‘ U V°) = 0.4. What is P(U U V)? 


. Suppose a sample space S consists of three out- 


comes: 0, 1, and 2. Let A = {0}, B = {1}, and 
C = {2}, and suppose P(A) = 0.4 and P(B) = 0.3. 
Find each of the following: 


b. Is it possible that P(C) = 0.2? Explain. a. P(A UB) 
4. Suppose A and B are events in a sample space S$ Be A) 
; ask i c. PAUCO) 
with probabilities 0.8 and 0.7, respectively. Sup- d. P(A’) 


pose also that P(A N B) = 0.6. What is P(A U B)? 
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10. 


H 11. 


H 13. 


14. 


15. 


16. 


ANSWERS FOR TEST YOURSELF 


e. P(ACN B®) 
f. P(A° U B°) 


Redo exercise 7 assuming that P(A) = 0.5 and 
P(B) = 0.4. 


. Let A and B be events in a sample space S, 


and let C = S—(A U B). Suppose P(A) = 0.4, 
P(B) = 0.5, and P(A N B) = 0.2. Find each of the 
following: 

. P(A U B) 

P(C) 

P(A) 

. P(A A B9) 

. P(AS U B°) 

P(BSNC) 


>A OANA oD 


Redo exercise 9 assuming that P(A) = 0.7, 
P(B) = 0.3, and P(A M B) = 0.1. 


Prove that if S is any sample space and U and V 
are events in S with U C V, then P(U) S P(V). 


. Prove that if S is any sample space 


and U and V are any events in S, then 
P(V— U) = P(V)—-P(UN V). 


Use the axioms for probability and mathematical 
induction to prove that for each integer n = 2, if 
A1, Az, A3,...,A, are any mutually disjoint events 
in a sample space S, then 


k=1 


A lottery game offers $2 million to the grand 
prize winner, $20 to each of 10,000 second prize 
winners, and $4 to each of 50,000 third prize 
winners. The cost of the lottery is $2 per ticket. 
Suppose that 1.5 million tickets are sold. What is 
the expected gain or loss of a ticket? 


A company offers a raffle whose grand prize is a 
$40,000 new car. Additional prizes are a $1,000 
television and a $500 computer. Tickets cost $20 
each. Ticket income over the cost of the prizes will 
be donated to charity. If 3,000 tickets are sold, 
what is the expected gain or loss of each ticket? 


An urn contains four balls numbered 2, 2, 5, and 
6. If a person selects a set of two balls at random, 


17. 


18. 


19. 


H 20. 


21. 
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what is the expected value of the sum of the num- 
bers on the balls? 


An urn contains five balls numbered 1, 2, 2, 8, 
and 8. If a person selects a set of two balls at ran- 
dom, what is the expected value of the sum of the 
numbers on the balls? 


An urn contains five balls numbered 1, 2, 2, 8, 
and 8. If a person selects a set of three balls at 
random, what is the expected value of the sum of 
the numbers on the balls? 


When a pair of balanced dice are rolled and the 
sum of the numbers showing face up is com- 
puted, the result can be any number from 2 to 12, 
inclusive. What is the expected value of the sum? 


Suppose a person offers to play a game with 

you. In this game, when you draw a card from a 
standard 52-card deck, if the card is a face card 
you win $3, and if the card is anything else you 
lose $1. If you agree to play the game, what is your 
expected gain or loss? 


A person pays $1 to play the following game: The 
person tosses a fair coin four times. If no heads 
occur, the person pays an additional $2, if one 
head occurs, the person pays an additional $1, if 
two heads occur, the person just loses the initial 
dollar, if three heads occur, the person wins 

$3, and if four heads occur, the person wins $4. 
What is the person’s expected gain or loss? 


. A fair coin is tossed until either a head comes up 


or four tails are obtained. What is the expected 
number of tosses? 


. A gambler repeatedly bets that a die will come up 


6 when rolled. Each time the die comes up 6, the 
gambler wins $1; each time it does not, the gambler 
loses $1. He will quit playing either when he is ru- 
ined or when he wins $300. If P,, is the probability 
that the gambler is ruined when he begins play with 
$n, then Py; = EP, + 6 Px—2 for every integer k 
with 2 = k = 300. Also Py = 1 and P399 = 0. Find 
an explicit formula for P,, and use it to calculate 
Po. (Exercise 33 in Section 9.9 asks you to derive 
the recurrence relation for this sequence.) 


1.0;1;1;0 2.P(A)+P(B) 3.1—P(A) 4. P(A)+P(B)—P(ANB) 5.api+ap + tap, 
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EX] Conditional Probability, Bayes’ Formula, 


and Independent Events 


It is remarkable that a science which began with the consideration of games of 
chance should have become the most important object of human knowledge.... 

The most important questions of life are, for the most part, really only problems of 
probability. —Pierre-Simon Laplace, 1749-1827 


In this section we introduce the notion of conditional probability and discuss Bayes’ 
theorem and the kind of interesting results to which it leads. We then define the concept of 
independent events and give some applications. 


Conditional Probability 


Imagine a couple with two children, each of whom is equally likely to be a boy or a girl. 
Now suppose you are given the information that one is a boy. What is the probability that 
the other child is a boy? 

Figure 9.9.1 shows four equally likely combinations of gender for the children. You can 
imagine that the first letter refers to the older child and the second letter to the younger. 
Thus the combination BG indicates that the older child is a boy and the younger is a girl. 


BG GB GG 


FIGURE 9.9.1 


There are three combinations where one of the children is a boy, and in one of these 
three combinations the other child is also a boy. Given that you know one child is a boy, 
only these three combinations could be the case. So you can think of the set of those out- 
comes as a new sample space with three elements, all of which are equally likely. Within 
the new sample space, there is one combination where the other child is a boy. Thus it 
would be reasonable to say that the likelihood that the other child is a boy, given that at 
least one is a boy, is 1/3 = 333%. Given that the original sample space contained four 
outcomes note that the following computation gives the same result: 


P(at least one child is a boy and the other child is also a boy) _ 


z 


P(at least one child is a boy) 


Aue 


A generalization of this observation forms the basis for the following definition. 


Definition 


Let A and B be events in a sample space S. If P(A) # 0, then the conditional prob- 
ability of B given A, denoted P(B |A), is 
P(A A B) 


P(B|A) = 7a 9.9.1 
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Computing a Conditional Probability 


A pair of fair dice, one blue and the other gray, are rolled. What is the probability that the 
sum of the numbers showing face up is 8, given that both of the numbers are even? 


Solution The sample space is the set of all 36 outcomes obtained from rolling the two 
dice and noting the numbers showing face up on each. As in Section 9.1, denote by ab 
the outcome that the number showing face up on the blue die is a and the one on the 
gray die is b. Let A be the event that both numbers are even and B the event that the sum 
of the numbers is 8. Then A = {22, 24, 26, 42, 44, 46, 62, 64, 66}, B = {26, 35, 44, 53, 
62}, and A N B = {26, 44, 62}. Because the dice are fair (so all outcomes are equally 
likely), P(A) = 9/36, P(B) = 5/36, and P(A N B) = 3/36. By definition of conditional 
probability, 


P(ANB) $ 
P(A) E 9 37 


P(B|A) = 


Note that when both sides of the formula for conditional probability (formula 9.9.1) are 
multiplied by P(A), a formula for P(A N B) is obtained: 


P(A M B) = PIATRA): 9.9.2 


And dividing both sides of formula (9.9.2) by P(B|A) gives a formula for P(A): 


P(A N B) 


PA) = BD BIA | 9.9.3 


SEU Representing Conditional Probabilities with a Tree Diagram 


An urn contains 5 blue and 7 gray balls. Let us say that 2 are chosen at random, one after 
the other, without replacement. 


a. Find the following probabilities and illustrate them with a tree diagram: the probabil- 
ity that both balls are blue, the probability that the first ball is blue and the second is 
not blue, the probability that the first ball is not blue and the second ball is blue, and 
the probability that neither ball is blue. 


b. What is the probability that the second ball is blue? 
c. What is the probability that at least one of the balls is blue? 


d. If the experiment of choosing two balls from the urn is repeated many times over, 
what is the expected value of the number of blue balls? 


Solution Let S denote the sample space of all possible choices of two balls from the urn, 
let B, be the event that the first ball is blue, and let B, be the event that the second ball is 
blue. Then By is the event that the first ball is not blue and B; is the event that the second 
ball is not blue. 


a. Because there are 12 balls of which 5 are blue and 7 are gray, the probability that the 
first ball is blue is 


5 
P@:) = 75 
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and the probability that the first ball is not blue is 


P(By) — cs 
12 
If the first ball is blue, then the urn would contain 4 blue balls and 7 gray balls, and so 
P(B|B)) = = and P(B;|B,) = g 
11 ir 


where P(B,|B;) is the probability that the second ball is blue given that the first ball is 
blue and P(B5|B,) is the probability that the second ball is not blue given that the first 
ball is blue. It follows from formula (9.9.2) that the probability that both balls are blue is 


P(B, M By) = P(Bp|B,)-P(B,) =~ = 
11 12 132 
And the probability that the first ball is blue and the second ball is not blue is 
P(B, NBS) = P(BS|B,)-P(B) = = = 
11 12 132 


Similarly, if the first ball is not blue, then the urn would contain 5 blue balls and 
6 gray balls, and so 


6 

11 

where P(B,|B f) is the probability that the second ball is blue given that the first ball 
is not blue and P(By | BY) is the probability that the second ball is not blue given that 


the first ball is not blue. It follows from formula (9.9.2) that the probability that the 
first ball is not blue but the second ball is blue is 


E 5 Cc C 
P(B,|Bf) = 7 and P(Bs|By) = 


P(BE NB) = P(B)|BY-P(B) = 2 = 2. 
11 12 132 
And the probability that neither the first ball nor the second ball is blue is 
PEEN BS) = PBS| BY) PBS) = 2 = 4 
11 12 132 


The tree diagram in Figure 9.9.2 is a convenient way to help calculate these results. 


BiN By —> PB NB) =f 52 
B,N Bf —> P(B, NBS si EEE 


BEN B, —> P(B,NBY)= >. — 


BY N BS——> PB, nBA=É Tan 


FIGURE 9.9.2 


b. The event that the second ball is blue can occur in one of two mutually exclusive ways: 
Either the first ball is blue and the second is also blue, or the first ball is gray (not 
blue) and the second is blue. 
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In other words, B, is the disjoint union of By N B, and By N By. Hence 
P(B2) = P((B2 N By) U (B2 N BY)) 
= P(B, N By) + P(B, N BY) by probability axiom 3 


= OE by part (a) 
132 132 

55 5 

132 17 


Thus the probability that the second ball is blue is 5/12, the same as the probability 
that the first ball is blue. 


c. By formula 9.8.2, for the union of any two events, 


P(B, U By) = P(B)) + P(B2) — P(B; N Bo) 


= are by parts (a) and (b) 
12 12 132 

90 15 

= B2 27 


Thus the probability is 15/22, or approximately 68.2%, that at least one of the balls is 
blue. 


d. The event that neither ball is blue is the complement of the event that at least one of 
the balls is blue, and so 


P(O blue balls) = 1 — P(at least one ball is blue) by formula 9.8.1 


=J= a by part (c) 
22 

-T 

= za 


The event that one ball is blue can occur in one of two mutually exclusive ways: 
Either the second ball is blue and the first is not, or the first ball is blue and the 


second is not. Part (a) showed that the probability of the first way is is. and the 
probability of the second way is also B. Thus, by probability axiom 3, 


35 35 70 
P(1 blue ball) = + = ; 
132 132. 132 
Finally, by part (a), 
20 
P(2 blue balls) = ——. 
132 
Therefore, 


the expected value 
of the number = 0:P(0 blue balls) + 1-P(1 blue ball) + 2- P(2 blue balls) 
of blue balls 
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Bayes’ Theorem 


Suppose that one urn contains 3 blue and 4 gray balls and a second urn contains 5 blue and 
3 gray balls. A ball is selected by choosing one of the urns at random and then picking a 
ball at random from that urn. If the chosen ball is blue, what is the probability that it came 
from the first urn? 

This problem can be solved by carefully interpreting all the information that is known 
and putting it together in just the right way. Let A be the event that the chosen ball is blue, 
B, the event that the ball came from the first urn, and B, the event that the ball came from 
the second urn. Because 3 of the 7 balls in urn one are blue, and 5 of the 8 balls in urn two 
are blue, 


3 5 
P(A|B,) = a and P(A|B,) = rs 
And because the urns are equally likely to be chosen, 
1 
P(B)) = PB) = 5 


Moreover, by formula (9.9.2), 


31_ 3 

P(A N By) = P(A|By)-P(B)) = a ag 
P(ANB,) = P(A|B,)-P(B)) = == = = 
ae g2 i6 


Now A is the disjoint union of (A N B,) and (A N B,), and so by probability axiom 3, 


3435 59 
P(A) = P(A A B,) U (A N B2)) = P(A A Bi) + P(A N B) = — + — =. 
(A) = PK DUQ 2)) = PC dt PC 2) 14/16 112 
Finally, by definition of conditional probability, 
3 
P(B,NA 14-336 
P(B,|A) = A) E apy 
P(A) 5 826 


Thus, if the chosen ball is blue, the probability is approximately 40.7% that it came from 
the first urn. 

The steps used to derive the answer in the previous example can be generalized to prove 
Bayes’ theorem. (See exercises 9 and 10 at the end of this section.) Thomas Bayes was 
an English Presbyterian minister who devoted much of his energies to mathematics. The 
theorem that bears his name was published posthumously in 1763. The portrait at the left is 
the only one attributed to him, but its authenticity has recently come into question. 


E] 
a 
= 
= 
S 
2 
a 
E 
na 


Thomas Bayes Theorem 9.9.1 Bayes’ Theorem 


(1702-1761) Suppose a sample space S is a union of mutually disjoint events B,, B2, B3,..., Bn 
suppose A is an event in S, and suppose both A and each B, have nonzero probabilities 


for every integer k with 1 = k <n. Then 


P(A|B,)P(By) 
P(A|B,)P(B,) + P(A |B3)P(B3) + +++ + P(A|B,)P(B,) 


P(B,|A) = 
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Applying Bayes’ Theorem 


Most medical tests occasionally produce incorrect results, called false positives and false neg- 
atives. When a test is designed to determine whether a patient has a certain disease, a false 
positive result indicates that a patient has the disease when the patient does not have it. A false 
negative result indicates that a patient does not have the disease when the patient does have it. 

When large-scale health screenings are performed for diseases with relatively low inci- 
dence, those who develop the screening procedures have to balance several considerations: 
the per-person cost of the screening, follow-up costs for further testing of false positives, 
and the possibility that people who have the disease will develop unwarranted confidence 
in the state of their health. 

Consider a medical test that screens for a disease found in 5 people in 1,000. Suppose 
that the false positive rate is 3% and the false negative rate is 1%. Then 99% of the time a 
person who has the condition tests positive for it, and 97% of the time a person who does 
not have the condition tests negative for it. (See exercise 4 at the end of this section.) 


a. What is the probability that a randomly chosen person who tests positive for the dis- 
ease actually has the disease? 


b. What is the probability that a randomly chosen person who tests negative for the dis- 
ease does not in fact have the disease? 


Solution Consider a person chosen at random from among those screened. Let A be the 
event that the person tests positive for the disease, B, the event that the person actually has 
the disease, and B, the event that the person does not have the disease. Then 


P(A|B;) = 0.99, P(A‘|B,) = 0.01, P(A‘|B>) = 0.97, and P(A|B3) = 0.03. 
Also, because 5 people in 1,000 have the disease, 
P(B,) = 0.005 and P(B,) = 0.995. 


a. By Bayes’ theorem, 


P(A|B,)P(B)) 
P(A|B,)P(B,) + P(A |B2)P(B2) 
(0.99)(0.005) 

7 (0.99)(0.005) + (0.03)(0.995) 

= 0.1422 = 14.2%. 


P(B,|A) = 


Thus the probability that a person with a positive test result actually has the disease is 
approximately 14.2%. 


b. By Bayes’ theorem, 
P(A‘| By)P(B2) 

P(A‘| B,)P(B)) + P(A‘| By) P(Bp) 
(0.97)(0.995) 

~ (0.01)(0.005) + (0.97)(0.995) 

= 0.999948 = 99.995%. 


P(B,|A) = 


Thus the probability that a person with a negative test result does not have the disease is 
approximately 99.995%. 

You might be surprised by these numbers, but they are fairly typical of the situation 
where the screening test is significantly less expensive than a more accurate test for the 
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same disease yet produces positive results for nearly all people with the disease. Using the 
screening test limits the expense of unnecessarily using the more costly test to a relatively 
small percentage of the population being screened, while only rarely indicating that a per- 
son who has the disease is free of it. E 


Independent Events 


Suppose a fair coin is tossed twice. It seems intuitively clear that the outcome of the first 
toss does not depend in any way on the outcome of the second toss, and conversely, the out- 
come of the second toss does not depend on the outcome of the first toss. In other words, if, 
for instance, A is the event that a head is obtained on the first toss and B is the event that a 
head is obtained on the second toss, then if the coin is tossed randomly both times, events 
A and B should be independent in the sense that P(A |B) = P(A) and P(B|A) = P(B). This 
intuitive idea of independence is supported by the following analysis. If the coin is fair, 
then the four outcomes HH, HT, TH, and TT are equally likely, and 


A = {HH, HT}, B= {TH, HH}, AQN B= {HH}. 


Hence 
P(A) = P(B) = z = 3 
4 2 
But also 
1 1 
P(A|B) = s i <= L a P(B|A) = 2 + = A 
(B) 5 2 P(A) 5 2 


and thus P(A|B) = P(A) and P(B|A) = P(B). 
To obtain more convenient form for the definition of independence, observe that 


if P(B) + 0 and P(A|B) = P(A), then P(A N B) = P(A|B)-P(B) = P(A): P(B). 
By the same argument, 
if P(A) # 0 and P(B|A) = P(B) then P(A N B) = P(A): P(B). 
Conversely (see exercise 18 at the end of this section), 
if P(A N B) = P(A): P(B) and P(A) + 0 then P(B|A) = P(B) 
and 


if P(A N B) = P(A): P(B) and P(B) # 0 then P(A|B) = P(A). 


Thus, we can eliminate the requirement that the probabilities be nonzero if we use the fol- 
lowing product formula to define independent events. 


Note It would be natural 
to think that mutually 
disjoint events would be 
independent, but in fact Definition 
almost the opposite is 
true: Mutually disjoint 
events with nonzero prob- 
abilities are dependent. ICA 1) 1B) = JAAN 1202). 


If A and B are events in a sample space S, then A and B are independent if, and 
only if, 


See eee Disjoint Events and Independence 


Let A and B be events in a sample space S, and suppose A N B = ©, P(A) + 0, and 
P(B) + 0. Show that P(A N B) # P(A): P(B). 
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Solution Because A N B = ©, P(A N B) = Oby probability axiom 2. But P(A): P(B) # 0 
because neither P(A) nor P(B) equals zero. Thus P(A N B) # P(A): P(B). B 


The following example and its immediate consequence show how the independence of 
two events extends to their complements. 
See eee The Probability of AM B“ WhenA and B Are Independent Events 


Suppose A and B are independent events in a sample space S. Show that A and B® are also 
independent. 


Solution The solutions for exercises 8 and 28 in Section 6.2 show that for all sets A and B, 
(D(ANB)\UANBY=A 
and 
QANBNANB)=©. 
It follows that probability axiom 3 may be applied to equation (1) to obtain 
P((A A B) U (A N B®)) = P(A N B) + P(A A B°) = P(A). 
Solving for P(A N B°) gives that 
P(A N B®) = P(A) -P(A A B) 
= P(A)— P(A)-P(B) because A and B are independent 


= P(A)(1 — P(B)) by factoring out P(A) 
= P(A): P(BY by formula 9.8.1. 
Thus A and B“ are independent events. B 


It follows immediately from Example 9.9.5 that if A and B are independent, then A‘ and 
B are also independent and so are A‘ and B“. (See exercise 22 at the end of this section.) 
These results are applied in Example 9.9.6. 


Computing Probabilities of Intersections of Two Independent Events 


A coin is loaded so that the probability of heads is 0.6. Suppose the coin is tossed twice. 
Although the probability of heads is greater than the probability of tails, there is no reason 
to believe that whether the coin lands heads or tails on one toss will affect whether it lands 
heads or tails on the other toss. Thus it is reasonable to assume that the results of the tosses 
are independent. 


a. What is the probability of obtaining two heads? 

b. What is the probability of obtaining one head? 

c. What is the probability of obtaining no heads? 

d. What is the probability of obtaining at least one head? 


Solution The sample space S consists of the four outcomes {HH, HT, TH, TT}, which are 
not equally likely. Let E be the event that a head is obtained on the first toss, and let F be 
the event that a head is obtained on the second toss. Then P(E) = P(F) = 0.6, and it is to 
be assumed that E and F are independent. 


a. The event of obtaining two heads is E N F. And because E and F are independent, 


P(two heads) = P(E N F) = P(E): P(F) = (0.6)(0.6) = 0.36 = 36%. 
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b. One head can be obtained in two mutually exclusive ways: head on the first toss and 
tail on the second, or tail on the first toss and head on the second. Thus, the event of 
obtaining exactly one head is (E N F®) U (E° N F). Also (EN F$) A (E° N F) = Ø, 
and, moreover, by the formula for the probability of the complement of an event, 
P(E‘) = P(F®) = 1 — 0.6 = 0.4. Hence 

P(one head) = P((E F‘) U (E° N F)) 
= P(E) P(F\ + P(E‘): P(F) by Example 9.9.5 and exercise 22 
= (0.6)(0.4) + (0.4)(0.6) 
= 0.48 = 48%. 
c. The event of obtaining no heads is E° N F°. Thus, by exercise 22, 


P(no heads) = P(E® N F°) = P(E‘): P(F*) = (0.4)(0.4) = 0.16 = 16%. 


d. There are two ways to solve this problem. One is to observe that because the event of 
obtaining one head and the event of obtaining two heads are mutually disjoint, 
P(at least one head) = P(one head) + P(two heads) 
= 0.48 + 0.36 by parts (a) and (b) 
= 0.84 = 84%. 
The second way is to use the fact that the event of obtaining at least one head is the 
complement of the event of obtaining no heads. So 
P(at least one head) = 1 — P(no heads) 
1— 0.16 by part (c) 
= 0.84 = 84%. E 


EU ea Expected Value of Tossing a Loaded Coin Twice 


Suppose that a coin is loaded so that the probability of heads is 0.6, and suppose the coin 
is tossed twice. If this experiment is repeated many times, what is the expected value of 
the number of heads? 


Solution Think of the outcomes of the coin tosses as just 0, 1, or 2 heads. Example 9.9.6 
showed that the probabilities of these outcomes are 0.16, 0.48, and 0.36, respectively. Thus, 
by definition of expected value, the 


expected number of heads = 0- (0.16) + 1- (0.48) + 2- (0.36) = 1.2. E 


What if a loaded coin is tossed more than twice? Suppose it is tossed ten times, or a 
hundred times. What are the probabilities of various numbers of heads? 

The previous examples illustrated the fact that the probability of an intersection of two 
events is the product of their probabilities whenever the two events are independent. Is 
there a way to generalize the concept of independence so that the probability of the inter- 
section of more than two events can be found by multiplying their probabilities? 

To answer this question, we first define three events A, B, and C to be pairwise inde- 
pendent if, and only if, 


P(A N B) = P(A): P(B), P(A A C) = P(A): P(C), and P(B N C) = P(B): P(C). 


However, as the next example shows, events can be pairwise independent without having 
the probability of their intersection be the product of their probabilities. Moreover, events 
can satisfy the condition P(A N B N C) = P(A): P(B): P(C) without being pairwise inde- 
pendent (see exercise 26 at the end of this section). 
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CU ee Exploring Independence for Three Events 


Suppose that a fair coin is tossed twice. Let A be the event that a head is obtained on the 
first toss, B the event that a head is obtained on the second toss, and C the event that either 
two heads or two tails are obtained. Show that A, B, and C are pairwise independent but do 
not satisfy the condition P(A N B N C) = P(A): P(B): P(C). 


Solution Because there are four equally likely outeomes—HH, HT, TH, and TT—it is 
clearthat P(A) = P(B) = P(C) 5. Youcan also see that A N B= {HH},AN C= {HA}, 
BO C= {HH}, and A N B N C = {HH}. Hence P(A AB) = PAN C) = P(BN OC) = 
L, and so P(A N B) = P(A): P(B), P(A N ©) = P(A): P(C), and P(B N ©) = P(B): P(C). 
Thus A, B, and C are pairwise independent. However, 

3 
P(A A BN C) = P({HA}) = + (3) = P(A): P(B): P(C) a 


Because of situations like the ones in Example 9.9.8 and exercise 26, four conditions 
must be included in the definition of independence for three events. 


Definition 


Let A, B, and C be events in a sample space S. A, B, and C are pairwise independent 
if, and only if, they satisfy conditions 1-3 below. They are mutually independent if, 
and only if, they satisfy all four conditions below. 


1. P(A N B) = P(A)-P(B) 
2. PAN © =P(A):P(O) 
3. P(B N ©) = P(B):P(C) 
4. PAN BNC) = P(A)-P(B):P(O) 


The definition of mutual independence for any collection of n events with n = 2 general- 
izes the two definitions given previously. 


Definition 


Events A,, A>, A3, . . . , A„ in a sample space S are mutually independent if, and only 
if, the probability of the intersection of any subset of the events is the product of the 
probabilities of the events in the subset. 


EU eee Tossing a Loaded Coin Ten Times 


A coin is loaded so that the probability of heads is 0.6 (and thus the probability of tails is 
0.4). Suppose the coin is tossed ten times. As in Example 9.9.6, it is reasonable to assume 
that the results of the tosses are mutually independent. 


a. What is the probability of obtaining eight heads? 
b. What is the probability of obtaining at least eight heads? 


Solution 


a. For each i = 1, 2,..., 10, let H; be the event that a head is obtained on the ith toss, 
and let T; be the event that a tail is obtained on the ith toss. Suppose that the eight 
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heads occur on the first eight tosses and that the remaining two tosses are tails. This is 
the event H, N Hy 1 H; N H, N As N Hs NO H3 N Ag A To N Tio. For simplicity, we 
denote it as HHHHHHHHTT. By definition of mutually independent events, 


P(HHHHHHHHTT) = (0.6)°(0.4)°. 


Because of the commutative law for multiplication, if the eight heads occur on any 
other of the ten tosses, the same number is obtained. For instance, if we denote the 
event Hı N M A Ts N H4 N H; N Hs N H; AO Bg A ToN Hoby HHTHHHHH 
T H, then 


P(HHTHHHHHTH) = (0.6)(0.4)(0.6)°(0.4)(0.6) = (0.6)°(0.4)”. 


Now there are as many different ways to obtain eight heads in ten tosses as there are 
subsets of eight elements (the toss numbers on which heads are obtained) that can be 
chosen from a set of ten elements. This number is ea ) It follows that, because the dif- 
ferent ways of obtaining eight heads are all mutually exclusive, 


($) 8 4? 
P(eight heads) = 8 (0.6 (0.4. 


b. By reasoning similar to that in part (a), 


the number of different 10 
P(nine heads) = | ways nine heads can be |- (0.6)°(0.4)! = ( 5 Jogos, 
obtained in ten tosses 


and 


the number of different 10 
P(ten heads) = | ways ten heads can be _|-(0.6)!'°(0.4)° = ( ojoo”. 
obtained in ten tosses 


Because obtaining eight, obtaining nine, and obtaining ten heads are mutually disjoint 
events, 


P(at least eight heads) = P(eight heads) + P (nine heads) + P(ten heads) 


_ (10 soaa f 19 9 10 10 
Note Binomial proba- = ( 8 Jos (0.4) + ( 9 Jos (0.4) + (io) (0.6) 
bilities occur in situations 
with multiple, mutually = 0.167 = 16.7%. E 


independent repetitions of 
a random process, all of 
which have the same two 


possible outcomes with n\ wx 1—p) 
Pe 07D 


Note the occurrence of the binomial coefficients (") in solutions to problems like the one 
in Example 9.9.9. For that reason, probabilities of the form 


the same probabilities on 
each repetition. 
where 0 = p = 1, are called binomial probabilities. 


TEST YOURSELF 


1. If A and B are any events in a sample space S and 2. Bayes’ theorem says that if a sample space S is a 
P(A) # 0, then the conditional probability of B union of mutually disjoint events B4, Bo,..., B,, 
given A, denoted P(B|A), equals : each with a nonzero probability, if A is an event 
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3. 


EXERCISE SET9.9 


1. 


9.9 CONDITIONAL PROBABILITY, BAYES’ FORMULA, AND INDEPENDENT EVENTS 


in S with P(A) # 0, and if k is an integer with 
1 <k <n, then 


Events A and B in a sample space S are indepen- 
dent if, and only if, ——. 


673 


4. Events A, B, and C ina sample space S are mutually 


independent if, and only if, : , j 
and 


Suppose P(A |B) = 1/2 and P(A N B) = 1/6. 
What is PB)? 


. Suppose P(X|Y) = 1/3 and P(Y) = 1/4. What is 


P(X ¥)? 


. The instructor of a discrete mathematics class 


gave two tests. Twenty-five percent of the students 
received an A on the first test and 15% of the stu- 
dents received A’s on both tests. What percent of 
the students who received A’s on the first test also 
received A’s on the second test? 


a. Prove that if A and B are any events in 
a sample space S, with P(B) # 0, then 
P(A‘|B) = 1—P(A|B). 

b. Explain how the result in part (a) justifies the 
following statements: (1) If the probability of a 
false positive on a test for a condition is 4%, then 
there is a 96% probability that a person who 
does not have the condition will have a negative 
test result. (2) If the probability of a false nega- 
tive on a test for a condition is 1%, then there is a 
99% probability that a person who does have the 
condition will test positive for it. 


. Suppose that A and B are events in a sample space 


S and that P(A), P(B), and P(A |B) are known. 
Derive a formula for P(A | B°). 


. An urn contains 25 red balls and 15 blue balls. 


Two are chosen at random, one after the other, 

without replacement. 

a. Use a tree diagram to help calculate the follow- 
ing probabilities: the probability that both balls 
are red, the probability that the first ball is red 
and the second is not, the probability that the 
first ball is not red and the second is red, the 
probability that neither ball is red. 

b. What is the probability that the second ball 
is red? 

c. What is the probability that at least one of the 
balls is red? 


. Redo exercise 6 assuming that the urn contains 


30 red balls and 40 blue balls. 


. A pool of 10 semifinalists for a job consists of 


7 men and 3 women. Because all are considered 

equally qualified, the names of two of the semifi- 

nalists are drawn, one after the other, at random, 

to become finalists for the job. 

a. What is the probability that both finalists are 
women? 

b. What is the probability that both finalists are 
men? 


Hc. What is the probability that one finalist is a 
woman and the other is a man? 
H 9. Prove Bayes’ theorem for n = 2. That is, prove 


10. 
Tl. 


12. 


H 73. 


that if a sample space S is a union of mutually 
disjoint events B, and B}, if A is an event in S with 
P(A) + 0, and if k = 1 or k = 2, then 

P(A| BPB) 
P(A|B,)P(B,) + P(A|By)P(B>) 


P(B,|A) = 


Prove the full version of Bayes’ theorem. 


One urn contains 12 blue balls and 7 white balls, 

and a second urn contains 8 blue balls and 19 white 

balls. An urn is selected at random, and a ball is 

chosen from the urn. 

a. What is the probability that the chosen ball is 
blue? 

b. If the chosen ball is blue, what is the probabil- 
ity that it came from the first urn? 


Redo exercise 11 assuming that the first urn con- 
tains 4 blue balls and 16 white balls and the sec- 
ond urn contains 10 blue balls and 9 white balls. 


One urn contains 10 red balls and 25 green balls, 
and a second urn contains 22 red balls and 15 green 
balls. A ball is chosen as follows: First an urn is se- 
lected by tossing a loaded coin with probability 0.4 
of landing heads up and probability 0.6 of landing 
tails up. If the coin lands heads up, the first urn is 
chosen; otherwise, the second urn is chosen. Then a 
ball is picked at random from the chosen urn. 
a. What is the probability that the chosen ball is 
green? 
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14. 


15. 


17. 


18. 


19. 


CHAPTER9 COUNTING AND PROBABILITY 


b. If the chosen ball is green, what is the prob- 
ability that it was picked from the first urn? 


A drug-screening test is used in a large popula- 
tion of people of whom 4% actually use drugs. 
Suppose that the false positive rate is 3% and the 
false negative rate is 2%. Thus a person who uses 
drugs tests positive for them 98% of the time, and 
a person who does not use drugs tests negative for 
them 97% of the time. 

a. What is the probability that a randomly chosen 
person who tests positive for drugs actually 
uses drugs? 

b. What is the probability that a randomly chosen 
person who tests negative for drugs does not 
use drugs? 


Two different factories both produce a certain 
automobile part. The probability that a component 
from the first factory is defective is 2%, and the 
probability that a component from the second 
factory is defective is 5%. In a supply of 180 of the 
parts, 100 were obtained from the first factory and 
80 from the second factory. 
a. What is the probability that a part chosen at 
random from the 180 is from the first factory? 
b. What is the probability that a part chosen 
at random from the 180 is from the second 
factory? 
c. What is the probability that a part chosen at 
random from the 180 is defective? 
d. If the chosen part is defective, what is the 
probability that it came from the first factory? 


. Three different suppliers—x, Y, and Z—provide 


produce for a grocery store. Twelve percent of 
produce from X is superior grade, 8% of produce 
from Y is superior grade, and 15% of produce 
from Z is superior grade. The store obtains 20% of 
its produce from X, 45% from Y, and 35% from Z. 
a. If apiece of produce is purchased, what is the 
probability that it is superior grade? 
b. Ifa piece of produce in the store is superior 
grade, what is the probability that it is from X? 


Prove that if A and B are events in a sample space 
S with the property that P(A|B) = P(A) and 
P(A) + 0, then P(B|A) = P(B). 


Prove that if P(A O B) = P(A): P(B), P(A) # 0, and 
P(B) + 0, then P(A|B) = P(A) and P(B|A) = P(B). 


A pair of fair dice, one blue and the other gray, 
are rolled. Let A be the event that the number face 
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20. 


21. 


22. 


23. 


24. 


25. 


up on the blue die is 2, and let B be the event that 
the number face up on the gray die is 4 or 5. Show 
that P(A|B) = P(A) and P(B|A) = P(B). 


Suppose a fair coin is tossed three times. Let A 
be the event that a head appears on the first toss, 
and let B be the event that an even number of 
heads is obtained. Show that P(A|B) = P(A) and 
P(B\A) = P(B). 


If A and B are events in a sample space S and 
A N B = ©, what must be true in order for A and 
B to be independent? Explain. 


Prove that if A and B are independent events in a 
sample space S, then A“ and B are also indepen- 
dent, and so are A‘ and B®. 


A student taking a multiple-choice exam does 
not know the answers to two questions. All have 
five choices for the answer. For one of the two 
questions, the student can eliminate two answer 
choices as incorrect but has no idea about the 
other answer choices. For the other question, the 
student has no clue about the correct answer at all. 
Assume that whether the student chooses the cor- 
rect answer on one of the questions does not affect 
whether the student chooses the correct answer on 
the other question. 
a. What is the probability that the student will 
answer both questions correctly? 
b. What is the probability that the student will 
answer exactly one of the questions correctly? 
c. What is the probability that the student will 
answer neither question correctly? 


A software company uses two quality assurance 

(QA) checkers X and Y to check an application for 

bugs. X misses 12% of the bugs and Y misses 15%. 

Assume that the QA checkers work independently. 

a. What is the probability that a randomly chosen 
bug will be missed by both QA checkers? 

b. If the program contains 1,000 bugs, what num- 
ber can be expected to be missed? 


A coin is loaded so that the probability of heads is 

0.7 and the probability of tails is 0.3. Suppose that 

the coin is tossed twice and that the results of the 

tosses are independent. 

a. What is the probability of obtaining exactly 
two heads? 

b. What is the probability of obtaining exactly 
one head? 

c. What is the probability of obtaining no heads? 


* 26. 


H 27. 


28. 


29. 


30. 


9.9 CONDITIONAL PROBABILITY, BAYES’ FORMULA, AND INDEPENDENT EVENTS 


d. What is the probability of obtaining at least 
one head? 


Describe a sample space and events A, B, and C, 
where P(A N BM C) = P(A): P(B): P(C) but A, 
B, and C are not pairwise independent. 


The example used to introduce conditional prob- 
ability described a family with two children each 
of whom was equally likely to be a boy or a girl. 
The example showed that if it is known that one 
child is a boy, the probability that the other child 
is a boy is 1/3. Now imagine the same kind of 
family—two children each of whom is equally 
likely to be a boy or a girl. Suppose you meet one 
of the children and that child is a boy. What is the 
probability that the other child is a boy? Explain. 
(Be careful. The answer may surprise you.) 


A coin is loaded so that the probability of heads is 

0.7 and the probability of tails is 0.3. Suppose that 

the coin is tossed ten times and that the results of 

the tosses are mutually independent. 

a. What is the probability of obtaining exactly 
seven heads? 

b. What is the probability of obtaining exactly 
ten heads? 

c. What is the probability of obtaining no heads? 

d. What is the probability of obtaining at least 
one head? 


Suppose that ten items are chosen at random from 

a large batch delivered to a company. The manu- 

facturer claims that just 3% of the items in the 

batch are defective. Assume that the batch is large 

enough so that even though the selection is made 

without replacement, the number 0.03 can be used 

to approximate the probability that any one of the 

ten items is defective. In addition, assume that be- 

cause the items are chosen at random, the outcomes 

of the choices are mutually independent. Finally, 

assume that the manufacturer’s claim is correct. 

a. What is the probability that none of the ten is 
defective? 

b. What is the probability that at least one of the 
ten is defective? 

c. What is the probability that exactly four of the 
ten are defective? 

d. What is the probability that at most two of the 
ten are defective? 


Suppose the probability of a false positive result 
on a mammogram is 4% and that radiologists’ 


31. 


Ha. 
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interpretations of mammograms are mutually in- 
dependent in the sense that whether or not a radi- 
ologist finds a positive result on one mammogram 
does not influence whether or not a radiologist 
finds a positive result on another mammogram. 
Assume that a woman has a mammogram every 
year for ten years. 
a. What is the probability that she will have no 
false positive results during that time? 
b. What is the probability that she will have at 
least one false positive result during that time? 
c. What is the probability that she will have ex- 
actly two false positive results during that time? 
d. Suppose that the probability of a false negative 
result on a mammogram is 2%, and assume 
that the probability that a randomly chosen 
woman has breast cancer is 0.0002. 

(i) If a woman has a positive test result one 
year, what is the probability that she actu- 
ally has breast cancer? 

(ii) If a woman has a negative test result one 
year, what is the probability that she actu- 
ally has breast cancer? 


Empirical data indicate that approximately 103 out 

of every 200 children born are male. Hence the 

probability of a newborn being male is about 

51.5%. Suppose that a family has six children, and 

suppose that the genders of all the children are 

mutually independent. 

What is the probability that none of the chil- 

dren is male? 

b. What is the probability that at least one of the 
children is male? 

c. What is the probability that exactly five of the 
children are male? 


32. A person takes a multiple-choice exam in which 


each question has four possible answers. Suppose 

that the person has no idea about the answers to 

three of the questions and simply chooses ran- 

domly for each one. 

a. What is the probability that the person will 
answer all three questions correctly? 

b. What is the probability that the person will 
answer exactly two questions correctly? 

c. What is the probability that the person will 
answer exactly one question correctly? 

d. What is the probability that the person will 
answer no questions correctly? 

e. Suppose that the person gets one point of cred- 
it for each correct answer and that 1/3 point 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 


Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


676 CHAPTER9 COUNTING AND PROBABILITY 


is deducted for each incorrect answer. What is axioms and the definition of conditional probability 
the expected value of the person’s score for the to derive the equation 
three questions? 1 5 
: : Pii = -Pet Pko 
33. In exercise 23 of Section 9.8, let C; be the event Erl 6 G 6 kaa 


that the gambler has k dollars, wins the next roll 
of the die, and is eventually ruined, let D, be the 
event that the gambler has k dollars, loses the next 
roll of the die, and is eventually ruined, and let 

P, be the probability that the gambler is eventu- 
ally ruined if he has k dollars. Use the probability 


H 34. Use conditional probability to analyze exercise 
20 in Section 9.1. Let X be the event that the prize 
is not behind door A, and let Y be the event that 
you switch and choose the door with the prize. 
Should you switch? Explain why or why not. 


ANSWERS FOR TEST YOURSELF 
PANB) ,, Pe P(A|B,)P(By) 
P(A) P(A|B,)P(B,) + P(A |B2)P(B2) +++ + P(A|B,)P(B,) 


3. P(A N B) = P(A): P(B) 


4. P(A N B) = P(A): P(B); P(A N C) = P(A): P(C), P(B AN C) = P(B): P(C); P(A N B N ©) = P(A): P(B): P(C) 
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THEORY OF GRAPHS 
AND TREES 


Throughout the book graphs and trees have been used as convenient visualizations. For in- 
stance, a graph can model a wide variety of structures such as the arrangement of electric 
power lines or fiber optic cables, a transportation system, a knowledge base, or a collection 
of computers ranging from a small local area network to the entire world wide web. A graph 
model can be used to solve a logical problem, color a map, or schedule meetings. A possibil- 
ity tree shows all potential results of a multistep operation with a finite number of outcomes 
for each step, the directed graph of a relation on a set shows which element of the set are 
related to which, a Hasse diagram illustrates the relations among elements in a partially or- 
dered set, and a PERT diagram shows which tasks must precede which in executing a project. 

In Chapter 1 we introduced the basic terminology of graphs, and in Section 4.9 we used 
properties of even and odd integers and direct and indirect proof to prove the handshake 
theorem and derive some of its consequences. We first proved the formula for the number 
of edges in a complete graph on n vertices using the handshake theorem, and then reproved 
it using mathematical induction in Section 5.3, recursion in Section 5.6, and combinatorial 
reasoning in Section 9.5. 

In this chapter we go more deeply into the mathematics of graphs and trees by exploring 
the concepts of connectedness, Euler and Hamiltonian circuits, representation of graphs 
by matrices, isomorphisms of graphs, the relations between the number of vertices and the 
number of edges in a tree, properties of rooted trees, spanning trees, and finding short- 
est paths in graphs. Applications include uses of graphs and trees in the study of decision 
problems, chemistry, data storage, computer language syntax, and transportation networks. 


EAI Trails, Paths, and Circuits 


One can begin to reason only when a clear picture has been formed in the 
imagination. —W. W. Sawyer, Mathematician’s Delight, 1943 


The subject of graph theory began in the year 1736 when the great mathematician Leon- 
hard Euler published a paper giving the solution to the following puzzle: 


The town of Königsberg in Prussia (now Kaliningrad in Russia) was built at a point where 
two branches of the Pregel River came together. It consisted of an island and some land 
along the river banks. These were connected by seven bridges as shown in Figure 10.1.1. 
The question is this: Is it possible for a person to take a walk around town, starting 
and ending at the same location and crossing each of the seven bridges exactly once?* 


*In his original paper, Euler did not require the walk to start and end at the same point. The analysis of the 
problem is simplified, however, by adding this condition. Later in the section we discuss walks that start and 
end at different points. 
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Pregel River VA 


AKG Images 


FIGURE 10.1.1 The Seven Bridges of Königsberg 


To solve this puzzle, Euler translated it into a graph theory problem. He noticed that all 
points of a given land mass can be identified with each other since a person can travel from 
any one point to any other point of the same land mass without crossing a bridge. Thus for 
the purpose of solving the puzzle, the map of Kénigsberg can be identified with the graph 
shown in Figure 10.1.2, in which the vertices A, B, C, and D represent land masses and the 
seven edges represent the seven bridges. 


Bettmann/Getty Images 


Leonhard Euler 
(1707-1783) D 


FIGURE 10.1.2 Graph Version of Königsberg Map 


In terms of this graph, the question becomes the following: 


Is it possible to find a route through the graph that starts and ends at 
some vertex, one of A, B, C, or D, and traverses each edge exactly once? 


Equivalently: 


Is it possible to trace this entire graph, starting and ending at the same 
point, without either ever lifting your pencil from the paper or crossing 
an edge more than once? 


Take a few minutes to think about the question yourself. Can you find a route that meets 
the requirements? Try it! 

Looking for a route is frustrating because you continually find yourself at a vertex 
that does not have an unused edge on which to leave, while elsewhere there are unused 
edges that must still be traversed. If you start at vertex A, for example, each time you 
pass through vertex B, C, or D, you use up two edges because you arrive on one edge 
and depart on a different one. So, if it is possible to find a route that uses all the edges 
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of the graph and starts and ends at A, then the total number of arrivals and departures 
from each vertex B, C, and D must be a multiple of 2. Or, in other words, the degrees 
of the vertices B, C, and D must be even. But they are not: deg(B) = 5, deg(C) = 3, and 
deg(D) = 3. Hence there is no route that solves the puzzle by starting and ending at A. 
Similar reasoning can be used to show that there are no routes that solve the puzzle by 
starting and ending at B, C, or D. Therefore, it is impossible to travel all around the city 
crossing each bridge exactly once. 


Definitions 


Travel in a graph is accomplished by moving from one vertex to another along a sequence 
of adjacent edges. In the graph below, for instance, you can go from u; to u4 by taking fı to 
uy and then f} to u4. This is represented by writing 


Uy fit fyi, 


Or you could take the roundabout route 


ufi U2 f3uz faun fz U3 fs U4 foty f7U2f3Usfs U4. 


Certain types of sequences of adjacent vertices and edges are of special importance 
in graph theory: those that do not have a repeated edge, those that do not have a repeated 
vertex, and those that start and end at the same vertex. 


Definition 


Let G be a graph, and let v and w be vertices in G. 


A walk from v to w is a finite alternating sequence of adjacent vertices and edges of 
G. Thus a walk has the form 


Vo €1 V1 62° Vn—1n Vw 


where the v’s represent vertices, the e’s represent edges, vp = v, v, = w, and for each 
i= 1,2, ... n,v;-; and v; are the endpoints of e;. The trivial walk from v to v con- 
sists of the single vertex v. 


A trail from v to w is a walk from v to w that does not contain a repeated edge. 
A path from v to w is a trail that does not contain a repeated vertex. 
A closed walk is a walk that starts and ends at the same vertex. 


A circuit is a closed walk that contains at least one edge and does not contain a 
repeated edge. 


A simple circuit is a circuit that does not have any other repeated vertex except the 
first and last. 
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For ease of reference, these definitions are summarized in the following table: 


Repeated | Repeated | Starts and Ends at | Must Contain at 
Edge? Vertex? the Same Point? Least One Edge? 

Walk allowed allowed allowed no 
Trail no allowed allowed no 
Path no no no no 
Closed walk allowed allowed yes no 
Circuit no allowed yes yes 
Simple no first and yes yes 
circuit last only 


Often a walk can be specified unambiguously by giving either a sequence of edges or a 
sequence of vertices. The next two examples show how this is done. 


Notation for Walks 


Example 10.1.1 


a. In the graph below, the notation e)e,e4e3 refers unambiguously to the following walk: 
V1€1V2€2V3€4V3€3V2. On the other hand, the notation e; is ambiguous if used by itself to 
refer to a walk. It could mean either v,e,v2 or v2e)4. 


ez 


e4 
Vi Uy v 


a 
b. In the graph of part (a), the notation vzv3 is ambiguous if used to refer to a walk. 
It could mean v2e7V3 or v2e3V3. On the other hand, in the graph below, the notation 
V1V2V2V3 refers unambiguously to the walk v;ev2ezv2€3v3. 


65 
e A . . 
Yy ea wn 83 y a 


Note that if a graph G does not have any parallel edges, then any walk in G is uniquely 
determined by its sequence of vertices. 


Example 10.1.2 


Walks, Trails, Paths, and Circuits 


In the graph below, determine which of the following walks are trails, paths, circuits, or 
simple circuits. 


a. V1E1V2€3V3€4V3€5V4 b. C1 €3€5€5€6 C. VoV3V4V5V3V6V2 


d. Vov3V4V5VoV2 e. VyeyV2e1V1 f. vy 
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Solution 


a. This walk has a repeated vertex but does not have a repeated edge, so it is a trail from 
vı to v4 but not a path. 


b. This is just a walk from y; to vs. It is not a trail because it has a repeated edge. 


c. This walk starts and ends at v>, contains at least one edge, and does not have a re- 
peated edge, so it is a circuit. Since the vertex v3 is repeated in the middle, it is not a 
simple circuit. 


d. This walk starts and ends at v2, contains at least one edge, does not have a repeated 
edge, and does not have a repeated vertex. Thus it is a simple circuit. 


e. This is just a closed walk starting and ending at v4. It is not a circuit because edge e} is 
repeated. 


f. The first vertex of this walk is the same as its last vertex, but it does not contain an edge, 
and so it is not a circuit. It is a closed walk from v; to vj. (It is also a trail from v; to v;.) E 


Because most of the major developments in graph theory have happened relatively recently 
and in a variety of different contexts, the terms used in the subject have not been standard- 
ized. For example, what this book calls a graph is sometimes called a multigraph, what 
this book calls a simple graph is sometimes called a graph, what this book calls a vertex 
is sometimes called a node, and what this book calls an edge is sometimes called an arc. 
Similarly, instead of the word trail, the word path is sometimes used; instead of the word 
path, the words simple path are sometimes used; and instead of the words simple circuit, 
the word cycle is sometimes used. The terminology in this book is among the most com- 
mon, but if you consult other sources, be sure to check their definitions. 


Subgraphs 


Definition 


A graph H is said to be a subgraph of a graph G if, and only if, every vertex in H is 
also a vertex in G, every edge in H is also an edge in G, and every edge in H has the 
same endpoints as it has in G. 


Subgraphs 


List all subgraphs of the graph G with vertex set {v,, v2} and edge set {e), e2, e3}, where 
the endpoints of e; are vı and v, the endpoints of e, are vı and v}, and e; is a loop at v}. 


Solution G can be drawn as shown below. 


vi e3 


v3 


There are 11 subgraphs of G, which can be grouped according to those that do not have any 
edges, those that have one edge, those that have two edges, and those that have three edges. 
The 11 subgraphs are shown in Figure 10.1.3 on the next page. 
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vi vi vi vi vi ie 
e e 
ey 
ez 
e e 
v v v v 
1 2 3 4 5 6 
Vi (on Vi Uy ez v ez vi ez 
ei ei ei 
g) e2 e2 
v U2 v U2 vy 
7 8 9 10 11 
FIGURE 10.1.3 E 
Connectedness 


It is easy to understand the concept of connectedness on an intuitive level. Roughly speak- 
ing, a graph is connected if it is possible to travel from any vertex to any other vertex along 
a sequence of adjacent edges of the graph. The formal definition of connectedness is stated 
in terms of walks. 


Definition 


Let G be a graph. Two vertices v and w of G are connected if, and only if, there is 
a walk from v to w. The graph G is connected if, and only if, given any two vertices 
v and w in G, there is a walk from v to w. Symbolically: 


G is connected «æ V vertices v and w in G, Ja walk from v to w. 


If you take the negation of this definition, you will see that a graph G is not connected 
if, and only if, there exist two vertices of G that are not connected by any walk. 


Connected and Disconnected Graphs 


Which of the following graphs are connected? 
v3 
U4 vy Us U6 v v 
V3 U4 
v Us e 7 
vi V6 vi v3 “r V6 
Ug v 
(a) (b) (c) 


Solution The graph represented in (a) is connected, whereas those of (b) and (c) are 
not. To understand why (c) is not connected, recall that in a drawing of a graph, two 
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edges may cross at a point that is not a vertex. Thus the graph in (c) can be redrawn as 
follows: 


V6 | 


Some useful facts relating circuits and connectedness are collected in the following 
lemma. Proofs of (a) and (b) are left for the exercises. The proof of (c) is in Section 10.4. 


Lemma 10.1.1 
Let G be a graph. 


a. If Gis connected, then any two distinct vertices of G can be connected by a path. 


b. If vertices v and w are part of a circuit in G and one edge is removed from the 
circuit, then there still exists a trail from v to w in G. 


c. If Gis connected and G contains a circuit, then an edge of the circuit can be re- 
moved without disconnecting G. 


Look back at Example 10.1.4. The graphs in (b) and (c) are both made up of three pieces, 
each of which is itself a connected graph. A connected component of a graph is a con- 
nected subgraph of largest possible size. 


Definition 
A graph H is a connected component of a graph G if, and only if, 
1. H is subgraph of G; 
2. H is connected; and 


3. no connected subgraph of G has H as a subgraph and contains vertices or edges 
that are not in H. 


The fact is that any graph is a kind of union of its connected components. 


Sena CNey Connected Components 


Find all connected components of the following graph G. 


Vy Us v6 
e3 
ez ê e4 
U4 5 
vi ey V3 Ug v 


Solution G has three connected components: H,, H}, and H, with vertex sets V}, V>, and 
V; and edge sets E,, E>, and E3, where 


Vi = (V1, Vo, v3}, E, = {e;, e2}, 
Vo = {v4}, E, =Ø, 
V3 = {V5, Ve, Vn Vgh E3 = {€3, e4, €s}. E 
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Euler Circuits 


Now we return to consider general problems similar to the puzzle of the Königsberg bridges. 
The following definition is made in honor of Euler. 


Definition 


Let G be a graph. An Euler circuit for G is a circuit that contains every vertex and 
every edge of G. That is, an Euler circuit for G is a sequence of adjacent vertices and 
edges in G that has at least one edge, starts and ends at the same vertex, uses every 
vertex of G at least once, and uses every edge of G exactly once. 


The analysis used earlier to solve the puzzle of the Königsberg bridges generalizes to 
prove the following theorem: 


Theorem 10.1.2 


If a graph has an Euler circuit, then every vertex of the graph has positive even degree. 


Proof: 

Suppose G is a graph that has an Euler circuit. [We must show that given any vertex v of G, 
the degree of v is even.] Let v be any particular but arbitrarily chosen vertex of G. Since the 
Euler circuit contains every edge of G, it contains all edges incident on v. Now imagine 
taking a journey that begins in the middle of one of the edges adjacent to the start of the 
Euler circuit and continues around the Euler circuit to end in the middle of the starting 
edge. (See Figure 10.1.4. There is such a starting edge because the Euler circuit has at least 
one edge.) Each time v is entered by traveling along one edge, it is immediately exited by 
traveling along another edge (since the journey ends in the middle of an edge). 


Start here First entry/exit 


pair of edges In this example, the Euler circuit 
vy iS VoU; UV_V3U4U2U5UQ, and v is v3. 
v3 Each time v, is entered by one 


y edge, it is exited by another edge. 
0 


Second entry/exit 
pair of edges 
FIGURE 10.1.4 Example for the Proof of Theorem 10.1.2 


Because the Euler circuit uses every edge of G exactly once, every edge inci- 
dent on v is traversed exactly once in this process. Hence the edges incident on 
v occur in entry/exit pairs, and consequently the degree of v must be a positive 
multiple of 2. But that means that v has positive even degree /as was to be shown]. 


Recall that the contrapositive of a statement is logically equivalent to the statement. The 
contrapositive of Theorem 10.1.2 is as follows: 


Contrapositive Version of Theorem 10.1.2 


If some vertex of a graph has odd degree, then the graph does not have an Euler circuit. 
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This version of Theorem 10.1.2 is useful for showing that a given graph does not have 
an Euler circuit. 


Showing That a Graph Does Not Have an Euler Circuit 


Show that the graph below does not have an Euler circuit. 


e7 


Solution Vertices v, and v3 both have degree 3, which is odd. Hence by (the contraposi- 
tive form of) Theorem 10.1.2, this graph does not have an Euler circuit. a 


Now consider the converse of Theorem 10.1.2: If every vertex of a graph has even de- 
gree, then the graph has an Euler circuit. Is this true? The answer is no. There is a graph G 
such that every vertex of G has even degree but G does not have an Euler circuit. In fact, 
there are many such graphs. The illustration below shows one example. 


vy v3 
ei 863 Every vertex has even degree, 
but the graph does not have 
ez C4 an Euler circuit. 
v U4 


Note that the graph in the preceding drawing is not connected. It turns out that 
although the converse of Theorem 10.1.2 is false, a modified converse is true: If ev- 
ery vertex of a graph has positive even degree and if the graph is connected, then 
the graph has an Euler circuit. The proof of this fact is constructive: It contains an 
algorithm to find an Euler circuit for any connected graph in which every vertex has 
even degree. 


Theorem 10.1.3 


If a graph G is connected and the degree of every vertex of G is a positive even inte- 
ger, then G has an Euler circuit. 


Proof: 

Suppose that G is any connected graph and suppose that every vertex of G is a positive 
even integer. [We must find an Euler circuit for G.] Construct a circuit C by the following 
algorithm: 


Step 1: Pick any vertex v of G at which to start. 
[This step can be accomplished because the vertex set of G is nonempty by 
assumption.] 


Step 2: Pick any sequence of adjacent vertices and edges, starting and ending at v 
and never repeating an edge. Call the resulting circuit C. 


(continued on page 686) 
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[This step can be performed for the following reasons: Since the degree of each 
vertex of G is a positive even integer, as each vertex of G is entered by traveling 
on one edge, either the vertex is v itself and there is no other unused edge adja- 
cent to v, or the vertex can be exited by traveling on another previously unused 
edge. Since the number of edges of the graph is finite (by definition of graph), 
the sequence of distinct edges cannot go on forever. The sequence eventually 
returns to v because the degree of v is a positive even integer, and so each time 
an edge leads out from v to another vertex, there must be a different edge that 
connects back in to v.] 


Step 3: Check whether C contains every edge and vertex of G. If so, C is an Euler 
circuit, and we are finished. If not, perform the following steps. 


Step 3a: Remove all edges of C from G and also any vertices that become 
isolated when the edges of C are removed. Call the resulting sub- 
graph G”. 
[Note that G' may not be connected (as illustrated in Figure 10.1.5), 
but every vertex of G' has positive, even degree (since removing the 
edges of C removes an even number of edges from each vertex, the dif- 
ference of two even integers is even, and isolated vertices with degree 0 
were removed).] 


C 


/ 


u w 


z 


g 
FIGURE 10.1.5 


Step 3b: Pick any vertex w common to both C and G”. 
[There must be at least one such vertex since G is connected. (See exer- 
cise 50.) (In Figure 10.1.5 there are two such vertices: u and w.)] 


Step 3c: Pick any sequence of adjacent vertices and edges of G’, starting 
and ending at w and never repeating an edge. Call the resulting 
circuit C’. 


[This can be done since each vertex of G' has positive, even degree and 
G' is finite. See the justification for step 2.] 


Step 3d: Patch C and C’ together to create a new circuit C” as follows: Start 
at v and follow C all the way to w. Then follow C’ all the way 
back to w. After that, continue along the untraveled portion of C to 
return to v. 

[The effect of executing steps 3c and 3d for the graph of Figure 10.1.5 is 
shown in Figure 10.1.6.] 
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l LX 


FIGURE 10.1.6 


Step 3e: Let C = C” and go back to step 3. 


Since the graph G is finite, execution of the steps outlined in this algorithm must 
eventually terminate. At that point an Euler circuit for G will have been constructed. 
(Note that because of the element of choice in steps 1, 2, 3b, and 3c, a variety of dif- 
ferent Euler circuits can be produced by using this algorithm.) 


Finding an Euler Circuit 


Use Theorem 10.1.3 to check that the graph below has an Euler circuit. Then use the algo- 
rithm from the proof of the theorem to find an Euler circuit for the graph. 


Solution Observe that 


deg(a) = deg(b) = deg(c) = deg(f) = deg(g) = deg(i) = deg(j) = 2 
and that deg(d) = deg(e) = deg(h) = 4. 


Hence all vertices have even degree. Also, the graph is connected. Thus, by Theorem 10.1.3, 
the graph has an Euler circuit. 

To construct an Euler circuit using the algorithm of Theorem 10.1.3, let v = a and let 
C be 


C: abcda. 


C is represented by the labeled edges shown below. 
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Observe that C is not an Euler circuit for the graph but that C intersects the rest of the graph 
at d. Let C’ be 


C’: deghjid. 
Patch C’ into C to obtain 
C": abcdeghjida. 
Set C = C". Then C is represented by the labeled edges shown below. 


Observe that C is not an Euler circuit for the graph but that it intersects the rest of the graph 
at e and h. Let C’ be 


C”: efhe. 
Patch C’ into C to obtain 
C": abcdefheghjida. 
Set C = C". Then C is represented by the labeled edges shown below. 


Since C includes every edge of the graph exactly once, C is an Euler circuit for the graph. E 


In exercise 51 at the end of this section you are asked to show that any graph with 
an Euler circuit is connected. This result can be combined with Theorems 10.1.2 and 
10.1.3 to give a complete characterization of graphs that have Euler circuits, as stated in 
Theorem 10.1.4. 


Theorem 10.1.4 


A graph G has an Euler circuit if, and only if, G is connected and every vertex of G 
has positive even degree. 


A corollary to Theorem 10.1.4 gives a criterion for determining when it is possible to 
find a walk from one vertex of a graph to another, passing through every vertex of the 
graph at least once and every edge of the graph exactly once. 


Definition 


Let G be a graph, and let v and w be two distinct vertices of G. An Euler trail from 
v to w is a sequence of adjacent edges and vertices that starts at v, ends at w, passes 
through every vertex of G at least once, and traverses every edge of G exactly once. 
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Corollary 10.1.5 


Let G be a graph, and let v and w be two distinct vertices of G. There is an Euler trail 
from v to w if, and only if, G is connected, v and w have odd degree, and all other 
vertices of G have positive even degree. 


The proof of this corollary is left as an exercise. 


eT CMCAe:s Finding an Euler Trail 


The floor plan shown below is for a house that is open for public viewing. Is it possible to 
find a trail that starts in room A, ends in room B, and passes through every interior door- 
way of the house exactly once? If so, find such a trail. 


Solution Let the floor plan of the house be represented by the graph below, where the 
edges indicate the openings between the rooms. 


Each vertex of this graph has even degree except for A and B, each of which has degree 1. 
Hence by Corollary 10.1.5, there is an Euler trail from A to B. One such trail is 


AGHFEIHEKJDCEB. E] 


Hamiltonian Circuits 


Theorem 10.1.4 completely answers the following question: Given a graph G, is it possible 
to find a circuit for G in which all the edges of G appear exactly once? A related question 
is this: Given a graph G, is it possible to find a circuit for G in which all the vertices of G 
(except the first and the last) appear exactly once? 

In 1859 the Irish mathematician Sir William Rowan Hamilton introduced a puzzle in 
the shape of a dodecahedron (DOH-dek-a-HEE-dron). (Figure 10.1.7 contains a drawing of 
a dodecahedron, which is a solid figure with 12 identical pentagonal faces.) 


Hulton Archive/Getty Images 


i 


ea, 


Sir William Hamilton 
(1805-1865) FIGURE 10.1.7 Dodecahedron 
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Each vertex was labeled with the name of a city—London, Paris, Hong Kong, New York, 
and so on. The problem Hamilton posed was to start at one city and tour the world by 
visiting each other city exactly once and returning to the starting city. One way to solve 
the puzzle is to imagine the surface of the dodecahedron stretched out and laid flat in the 
plane, as follows: 


One solution is the circuit 
ABCDEFGHIJKLMNOPORSTA, 


whose edges are indicated with black lines. Note that although every city is visited, many 
edges are omitted from the circuit. (More difficult versions of the puzzle required that 
certain cities be visited in a certain order.) 

The following definition is made in honor of Hamilton. 


Definition 


Given a graph G, a Hamiltonian circuit for G is a simple circuit that includes every 
vertex of G. That is, a Hamiltonian circuit for G is a sequence of adjacent vertices 
and distinct edges in which every vertex of G appears exactly once, except for the 
first and the last, which are the same. 


Note that although an Euler circuit for a graph G must include every vertex of G, it may 
visit some vertices more than once and hence may not be a Hamiltonian circuit. On the 
other hand, a Hamiltonian circuit for G does not need to include all the edges of G and 
hence may not be an Euler circuit. 

Despite the analogous-sounding definitions of Euler and Hamiltonian circuits, the 
mathematics of the two are very different. Theorem 10.1.4 gives a simple criterion for de- 
termining whether a given graph has an Euler circuit. Unfortunately, there is no analogous 
criterion for determining whether a given graph has a Hamiltonian circuit, nor is there even 
an efficient algorithm for finding such a circuit. There is, however, a simple technique that 
can be used in many cases to show that a graph does not have a Hamiltonian circuit. This 
follows from the following considerations: 

Suppose a graph G with at least two vertices has a Hamiltonian circuit C, given con- 
cretely as 


C: voei": Vn—1€nYn (=Vo)- 
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Since C is a simple circuit, all the e; are distinct and all the v; are distinct except that 
Vo = V,. Let H be the subgraph of G that is formed using the vertices and edges of C. An 


example of such an His shown below. 


U7 U4 


vo 


63 
vi ez Vy 


C: Up € Uj Vz €3U3 C44 E5U5 CGUG C7U7Lg Vo 


The edges of H are shown in black. 


Note that H has the same number of edges as it has vertices since all its edges are distinct 
and so are its vertices. Also, by definition of Hamiltonian circuit, every vertex of G is a 
vertex of H, and H is connected since any two of its vertices lie on a circuit. In addition, ev- 
ery vertex of H has degree 2. The reason for this is that there are exactly two edges incident 
on any vertex. These are e; and e;,, for any vertex v; except vo = v,, and they are e; and e, 
for vo(=v,). These observations have established the truth of the following proposition in 
all cases where G has at least two vertices. 


Proposition 10.1.6 


If a graph G has a Hamiltonian circuit, then G has a subgraph H with the following 
properties: 

1. H contains every vertex of G. 

2. His connected. 

3. H has the same number of edges as vertices. 

4. Every vertex of H has degree 2. 


Note that if G contains only one vertex and G has a Hamiltonian circuit, then the circuit 
has the form v e v, where v is the vertex of G and e is an edge incident on v. In this case, the 
subgraph H consisting of v and e satisfies conditions (1)—(4) of Proposition 10.1.6. 

Recall that the contrapositive of a statement is logically equivalent to the statement. The 
contrapositive of Proposition 10.1.6 says that if a graph G does not have a subgraph H with 
properties (1)—(4), then G does not have a Hamiltonian circuit. 


Showing That a Graph Does Not Have a Hamiltonian Circuit 


Prove that the graph G shown below does not have a Hamiltonian circuit. 


e d 


Solution If G has a Hamiltonian circuit, then by Proposition 10.1.6, G has a subgraph H 
that (1) contains every vertex of G, (2) is connected, (3) has the same number of edges as 
vertices, and (4) is such that every vertex has degree 2. Suppose such a subgraph H exists. 
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In other words, suppose there is a connected subgraph H of G such that H has five vertices 
(a, b, c, d, e) and five edges and such that every vertex of H has degree 2. Since the degree 
of b in G is 4 and every vertex of H has degree 2, two edges incident on b must be removed 
from G to create H. Edge {a, b} cannot be removed because if it were, vertex a would have 
degree less than 2 in H. Similar reasoning shows that edges {e, b}, {b, a}, and {b, d} can- 
not be removed either. It follows that the degree of b in H must be 4, which contradicts the 
condition that every vertex in H has degree 2 in H. Hence no such subgraph H exists, and 
so G does not have a Hamiltonian circuit. E 


The next example illustrates a type of problem known as a traveling salesman problem. 
It is a variation of the problem of finding a Hamiltonian circuit for a graph. 


ZEW Cee A Traveling Salesman Problem 


Imagine that the drawing below is a map showing four cities and the distances in kilometers be- 
tween them. Suppose that a salesman must travel to each city exactly once, starting and ending 
in city A. Which route from city to city will minimize the total distance that must be traveled? 


B 
30 ë 
7 25 
A 40 D 


Solution This problem can be solved by writing all possible Hamiltonian circuits start- 
ing and ending at A and calculating the total distance traveled for each. 


Route Total Distance (In Kilometers) 
ABCDA 30 + 30+ 25 +40 = 125 
ABDCA 30+ 35+ 25 +50 = 140 
ACBDA 50+ 30+ 35 +40 = 155 


ACDBA 140 [A B D CA backward] 
ADBCA 155 [A C B D A backward] 
ADCBA 125 [A B C DA backward] 


Thus either route A B CDA or A D C B A gives a minimum total distance of 125 
kilometers. E 


The general traveling salesman problem involves finding a Hamiltonian circuit to mini- 
mize the total distance traveled for an arbitrary graph with n vertices in which each edge is 
marked with a distance. One way to solve the general problem is to use the method of Exam- 
ple 10.1.10: Write down all Hamiltonian circuits starting and ending at a particular vertex, 
compute the total distance for each, and pick one for which this total is minimal. However, 
even for medium-sized values of n this method is impractical. For a complete graph with 
30 vertices, it would be necessary to check (29!)/2 = 4.42 x 10°° Hamiltonian circuits that 
start and end at a particular vertex. Even if each circuit could be found and its total distance 
computed in just one nanosecond, it would require approximately 1.4 Xx io” years to finish 
the computation. At present, there is no known algorithm for solving the general traveling 
salesman problem that is more efficient. However, there are efficient algorithms that find 
“pretty good” solutions—that is, circuits that, while not necessarily having the least possible 
total distances, have smaller total distances than most other Hamiltonian circuits. 
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Answers to Test Yourself questions are located at the end 3. Removing an edge from a circuit in a graph does 
of each section. not 
1. Let G be a graph and let v and w be vertices in G. 4. An Euler circuit in a graph is 
(a) A walk from v to w is 5. A graph has a Euler circuit if, and only if, l 
b) A trail from v to w is ; . : : 
(o) i 6. Given vertices v and w in a graph, there is an 
© A path from v to w is __. Euler trail from v to w if, and only if, 
(d) A closed walk is ____. : ; EET ; 
Tanu 7. A Hamiltonian circuit in a graph is 
(e) A circuit is —— 
(f) A simple circuit is 8. Ifa graph G has a Hamiltonian circuit, then G has 
.. : a subgraph H with the following properties: , 
(g) A trivial walk is : And 
(h) Vertices v and w are connected if, and only ; ; er 
if 9. A traveling salesman problem involves finding 
a that minimizes the total distance traveled for 
2. A graph is connected if, any only if, . a graph in which each edge is marked with a distance. 
EXERCISE SET 10.1° 
1. In the graph below, determine whether the follow- a. Can this walk be written unambiguously as 
ing walks are trails, paths, closed walks, circuits, V1V2v,? Why? 
simple circuits, or just walks. b. Can this walk be written unambiguously as 
a. V9e{V1E10V5C9V2E2V1 b. V4e7V2€9V5€10V1€3V2€9V5 eez? Why? 
Cc d. : i 
v2 V5SV2V3V4V4V5 4. Consider the following graph. 
€. V9V3V4V5V2V4V3V2 f. es€g€1023 
e2 
7 6% 5 
v N My v4 
e4 
a. How many paths are there from v; to v4? 
2. In the graph below, determine whether the follow- b. How many trails are there from v; to val 
ing walks are trails, paths, closed walks, circuits, c. How many walks are there from v; to v4? 
simple circuits, or just walks. 5. Consider the following graph. 
a. V1e2V2€3V3€4V4E5V2€2V1€1V0 b. V2V3V4V5V2 
C. V4V2V3V4V5V2V4 d. V2V1V5V2V3V4V2 ĉi 
e. VoV5V2V3V4V2V] f. V5V4V2V1 Pe D es 
<i. 
e4 
a. How many paths are there from a to c? 
b. How many trails are there from a to c? 
c. How many walks are there from a to c? 
6. An edge whose removal disconnects the graph 


and consider the walk vje,v2e2v). 


of which it is a part is called a bridge. Find all 
bridges for each of the graphs at the top of the 
next page. 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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a v v b. vo uy v does not have an Euler circuit. Give another ex- 
ample of a graph satisfying these conditions. 


¥3 v3 11. Is it possible for a citizen of Königsberg to make 


a tour of the city and cross each bridge exactly 

v5 "4 v6 U5 twice? (See Figure 10.1.1.) Explain. 

Determine which of the graphs in 12-17 have Euler circuits. 
If the graph does not have an Euler circuit, explain why 
not. If it does have an Euler circuit, describe one. 


Vio 
12. Vy 13. vo 
U9 
ey eq \ \ vı 
7. Given any positive integer n, (a) find a connected s 
graph with n edges such that removal of just one K v3 is 
edge disconnects the graph; (b) find a connected A es k %3 
graph with n edges that cannot be disconnected by $ 
the removal of any single edge. vy UG 
8. Find the number of connected components for 14. a 15. t T 
each of the following graphs. 
i AE W $ 
b E i b s w 
° < 
a ê f h aN P r Z y Fd 
f 
d 
b. ‘3 e d 
i ” 16. vi 17. 4 B 
z x vo v 
C D 
y Us V3 
c. d. J 
v4 E F 
E v- UV- . . . 
: 2 18. Is it possible to take a walk around the city whose 
f DA map is shown below, starting and ending at the 
same point and crossing each bridge exactly once? 
If so, how can this be done? 
9. Each of (a)—() describes a graph. In each case an- = 
swer yes, no, or not necessarily to this question: Does E 
the graph have an Euler circuit? Justify your answers. : 
a. Gis a connected graph with five vertices of £ 
degrees 2, 2, 3, 3, and 4. E 
b. G is a connected graph with five vertices of H} 
degrees 2, 2, 4, 4, and 6. : 
c. Gis a graph with five vertices of degrees 2, 2, : 
4, 4, and 6. : 
10. The solution for Example 10.1.6 shows a graph 5 


for which every vertex has even degree but which 
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For each of the graphs in 19-21, determine whether 
there is an Euler trial from u to w. If there is, find such 


a trail. 
19. Up 20. p c d 
v7 vı 
| | e 
a uf 
u 
h w 
V6 v 
Us V3 g 
U4 
w 
21. v Uo 
Vi 
u 
V6 e 
V3 Vy 
w U4 
Us 


22. The following is a floor plan of a house. Is it 
possible to enter the house in room A, travel 
through every interior doorway of the house 
exactly once, and exit out of room E? If so, how 
can this be done? 


5 A B c 
| | | | 
| | T | 
- G D 
H 
F E T 


23. Find all subgraphs of each of the following graphs. 


a. v b. e v 


Vi v Vv 
vi 0 1 3 
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Definition: If G is a simple graph, the complement of G, 
denoted G’, is obtained as follows: The vertex set of G’ 
is identical to the vertex set of G. However, two distinct 
vertices v and w of G” are connected by an edge if, and 
only if, v and w are not connected by an edge in G. For 
example, if G is the graph 


v U3 


then G” is 


UD 
U4 
U2 

e 
U4 


24. Find the complement of each of the following 


graphs. 


a. Vy b. v e———e v, 


Ui U3 


U4 


v3 


U4 


25. a. Find the complement of the graph K4, the com- 


plete graph on four vertices. 
b. Find the complement of the graph K3 2, the 
complete bipartite graph on (3, 2) vertices. 


26. Suppose that in a group of five people A, B, C, D, 


and E the following pairs of people are acquainted 

with each other. 

A and C, A and D, B and C, C and D, C and E. 

a. Draw a graph to represent this situation. 

b. Draw a graph that illustrates who among these 
five people are not acquainted. That is, draw 
an edge between two people if, and only if, 
they are not acquainted. 


H 27. Let G be a simple graph with n vertices. What is 


the relation between the number of edges of G and 
the number of edges of the complement G’? 


28. Show that at a party with at least two people, there 


are at least two mutual acquaintances or at least 
two mutual strangers. 
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Find Hamiltonian circuits for each of the graphs in 29 and30. H 38. Give two examples of graphs that have Euler cir- 
30 cuits but not Hamiltonian circuits. 


a 
H 39. Give two examples of graphs that have Hamilto- 
nian circuits but not Euler circuits. 
b 
H 40. Give two examples of graphs that have circuits 
E that are both Euler circuits and Hamiltonian 
circuits. 
3 k c 
f H 41. Give two examples of graphs that have Euler 
circuits and Hamiltonian circuits that are not the 
l same. 
U4 J 4 d 


42. A traveler in Europe wants to visit each of the 
cities shown on the map exactly once, starting 
and ending in Brussels. The distance (in kilome- 


29. 


F S 

e e 

S Ss 
< 


Show that none of the graphs in 31-33 has a Hamiltonian 


circuit. 

ters) between each pair of cities is given in the 
H31. b d 32. a c 3 table. Find a Hamiltonian circuit that minimizes 
; oe the total distance traveled. (Use the map to 
č narrow the possible circuits down to just a few. 
a £ = Then use the table to find the total distance for 
cat each of those.) 
g 
8 f j h 
33. 


B 
N 
G D 


In 34-37, find Hamiltonian circuits for those graphs that 
have them. Explain why the other graphs do not. 


è Berlin 


° © Düsseldorf 
Brussels 


e Luxembourg 


H 34. a b 35. 


e Munich 


Berlin Brussels Düsseldorf Luxembourg Munich 


Brussels 783 
Diisseldorf 564 223 


b 
C 
e 
z 
vı b 
Luxembourg 764 219 224 
Pa Munich 585 771 613 517 
Up v s p Paris 1,057 308 497 375 832 
d 


43. a. Prove that if a walk in a graph contains a re- 
peated edge, then the walk contains a repeated 
vertex. 
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44. 


45. 


46. 


47. 


H 48. 


49. 


50. 


ANSWERS FOR TEST YOURSELF 


b. Explain how it follows from part (a) that any 
walk with no repeated vertex has no repeated 
edge. 


Prove Lemma 10.1.1(a): If G is a connected graph, 
then any two distinct vertices of G can be con- 
nected by a path. (You may use the result stated in 
exercise 43.) 


Prove Lemma 10.1.1(b): If vertices v and w are 
part of a circuit in a graph G and one edge is 
removed from the circuit, then there still exists a 
trail from v to w in G. 


Draw a picture to illustrate Lemma 10.1.1(c): If 

a graph G is connected and G contains a circuit, 
then an edge of the circuit can be removed without 
disconnecting G. 


Prove that if there is a trail in a graph G from a ver- 
tex v to a vertex w, then there is a trail from w to v. 


If a graph contains a circuit that starts and ends at 
a vertex v, does the graph contain a simple circuit 
that starts and ends at v? Why? 


Prove that if there is a circuit in a graph that starts 
and ends at a vertex v and if w is another vertex in 
the circuit, then there is a circuit in the graph that 
starts and ends at w. 


Let G be a connected graph, and let C be any cir- 
cuit in G that does not contain every vertex of C. 
Let G” be the subgraph obtained by removing all 
the edges of C from G and also any vertices that 
become isolated when the edges of C are removed. 
Prove that there exists a vertex v such that v is in 
both C and G”. 


51. 


52. 
53. 


* 54. 


* 55. 


*H 56. 


57. 
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Prove that any graph with an Euler circuit is con- 
nected. 


Prove Corollary 10.1.5. 


For what values of n does the complete graph 
K,, with n vertices have (a) an Euler circuit? (b) a 
Hamiltonian circuit? Justify your answers. 


For what values of m and n does the complete 
bipartite graph on (m, n) vertices have (a) an Euler 
circuit? (b) a Hamiltonian circuit? Justify your 
answers. 


What is the maximum number of edges a simple 
disconnected graph with n vertices can have? 
Prove your answer. 


a. Prove that if G is any bipartite graph, then ev- 
ery circuit in G has an even number of edges. 

b. Prove that if G is any graph with at least two 
vertices and if G does not have a circuit with 
an odd number of edges, then G is bipartite. 


An alternative proof for Theorem 10.1.3 has 
the following outline. Suppose G is a con- 
nected graph in which every vertex has even 
degree. Suppose the path C: vye,v2@2V3.. . CnVn+1 
has maximum length in G. That is, C has at 
least as many vertices and edges as any other 
path in G. First derive a contradiction from 

the assumption that v; # v,. Next let H be the 
subgraph of G that contains all the vertices and 
edges in C. Then derive a contraction from the 
assumption that H # G. Show that H contains 
every vertex of G, and show that H contains 
every edge of G. 


1. (a) a finite alternating sequence of adjacent vertices and 


there is a walk from one to the other 


3. disconnect the 


edges of G (b) a walk that does not contain a repeated graph 4. a circuit that contains every vertex and every edge 


edge (c) atrail that does not contain a repeated vertex of the graph 5. the graph is connected, and every vertex 


(d) a walk that starts and ends at the same vertex (e)a has positive, even degree 6. the graph is connected, v and 
closed walk that contains at least one edge and does not w have odd degree, and all other vertices have positive even 
(f) a circuit that does not have 
(g)a 
(h) there is 


2. given any two vertices in the graph, 


contain a repeated edge degree 7. a simple circuit that includes every vertex of the 


any repeated vertex other than the first and the last graph 8. H contains every vertex of G; H is connected; 
walk consisting of a single vertex and no edge H has the same number of edges as vertices; every vertex of 


a walk from v to w H has degree 2 9. Hamiltonian circuit 
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EYA Matrix Representations of Graphs 


Order and simplification are the first steps toward the mastery of a subject. 
—Thomas Mann, The Magic Mountain, 1924 


How can graphs be represented inside a computer? It happens that all the information 
needed to specify a graph can be conveyed by a structure called a matrix, and matrices 
(matrices is the plural of matrix) are easy to represent inside computers. This section con- 
tains some basic definitions about matrices and matrix operations, a description of the 
relation between graphs and matrices, and some applications. 


Matrices 


Matrices are two-dimensional analogues of sequences. They are also called two-dimensional 
arrays. 


Definition 


An m X n (read “m by n”) matrix A over a set S is a rectangular array of elements 
of S arranged into m rows and n columns: 


e C212 tte? 1 y a Qin 
aa an 7" Qj ` An 
A= 0 i 3 ” l 
aj ai aay dij can Ain <— ith row of A 
amı Am2 ch amj aoe Amn 


jth column of A 


We write A = (a;;). 


The ith row of A is 
[an anp 7 ain] 
and the jth column of A is 
aij 
aj 
Amj 


The entry a;jin the ith row and jth column of A is called the ijth entry of A. An m X n 
matrix is said to have size m X n. If A and B are matrices, then A = B if, and only if, 
A and B have the same size and the corresponding entries of A and B are all equal; that is, 


aj =b; foreveryi=1,2,...,mandj=1,2,...,n. 
A matrix for which the numbers of rows and columns are equal is called a square 
matrix. If A is a square matrix of size n X n, then the main diagonal of A consists of all 


the entries a11, 422, . - - , Ann 
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ayı 4g AG Ain 
an 4920 ` Au ` Ahn 
ai an Co Qu din 
Qn Qn2 at ni ae Ann 
% 


main diagonal of A 


See wae Matrix Terminology 


The following is a3 X 3 matrix over the set of integers. 


1 0 -3 
A=| 4 -1 5 
-2 2 0 


a. What is a3, the entry in row 2, column 3? 
b. What is the second column of A? 


c. What are the entries in the main diagonal of A? 


Solution 
0 

a. a3 =5 b. | -1 c. 1, —1, and 0 | 
2 


Matrices and Directed Graphs 


Consider the directed graph shown in Figure 10.2.1. This graph can be represented by the 
matrix A = (q;;) for which a;j = the number of arrows from v; to v;, for every i = 1, 2, 3 
and j = 1, 2, 3. Thus a; = 1 because there is one arrow from v; to vj; a}. = 0 because 
there is no arrow from vy; to v2, a23 = 2 because there are two arrows from v, to v3, and so 
forth. A is called the adjacency matrix of the directed graph. For convenient reference, the 
rows and columns of A are often labeled with the vertices of the graph G. 


ei e3 
vY uù n% 
&2 
z a „fı 0 0 
e4 A= v 1 1 2 
es es | kk 0 0 
U3 
Directed Graph G Adjacency Matrix of G 
(a) (b) 


FIGURE 10.2.1 A Directed Graph and Its Adjacency Matrix 


Definition 


Let G be a directed graph with ordered vertices v, v2, . . . , V,. The adjacency matrix 
of G is the n X n matrix A = (a;;) over the set of nonnegative integers such that 


@,, — the number of arrows from v; to vj forall, 7 — 1, ®oo06 nie 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


700 CHAPTER10 THEORY OF GRAPHS AND TREES 


Note that nonzero entries along the main diagonal of an adjacency matrix indicate the 
presence of loops, and off-diagonal entries larger than 1 correspond to parallel edges. 
Moreover, if the vertices of a directed graph are reordered, then the entries in the rows and 
columns of the corresponding adjacency matrix are moved around. 


The Adjacency Matrix of a Graph 


The two directed graphs shown below differ only in the ordering of their vertices. Find 
their adjacency matrices. 


e5 e5 
U] œ, V3 ©. 
ey e3 v ees vi 
e e 
1 A 1 A 
V3 v 


(a) (b) 


Solution Since both graphs have three vertices, both adjacency matrices are 3 X 3 ma- 
trices. For (a), all entries in the first row are 0 since there are no arrows from v; to any other 
vertex. For (b), the first two entries in the first row are | and the third entry is 0 since from 
vı there are single arrows to v; and to vz and no arrows to v3. Continuing the analysis in this 
way, you obtain the following two adjacency matrices: 


v vn V3 v vn V3 
77/0 0 0 vuyl 1 0 
A=»]|O 1 1 A="]1 0 2 
»|2 1 0 310 0 0 

(a) (b) E 


If you are given a square matrix with nonnegative integer entries, you can construct a 
directed graph with that matrix as its adjacency matrix. However, the matrix does not tell 
you how to label the edges, so the directed graph is not uniquely determined. 


Obtaining a Directed Graph from a Matrix 


Let 
0 1 1 0 
1 1 2 
A= j 
0 0 1 1 
2 1 0 0 


Draw a directed graph that has A as its adjacency matrix. 


Solution Let G be the graph corresponding to A, and let v4, vz, v3, and v, be the vertices of 
G. Label A across the top and down the left side with these vertex names, as shown below. 


Vy va Ys va 
“ToO. 1 1 0 
»į|1 1 0 2 

A= 
“0 0 1 1 


“12 1 0 0 


Then, for instance, the 2 in the fourth row and the first column means that there are two 
arrows from v4 to vı. The O in the first row and the fourth column means that there is no 
arrow from v; to vy. A corresponding directed graph is shown on the next page (without 
edge labels because the matrix does not determine those). 
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Ui 


v 


U3 


U4 


Matrices and Undirected Graphs 


Once you know how to associate a matrix with a directed graph, the definition of the ma- 
trix corresponding to an undirected graph should seem natural to you. As before, you must 
order the vertices of the graph, but in this case you simply set the ijth entry of the adja- 
cency matrix equal to the number of edges connecting the ith and jth vertices of the graph. 


Definition 


Let G be an undirected graph with ordered vertices v4, v2,..., Vn. The adjacency ma- 
trix of G is the n X n matrix A = (a;;) over the set of nonnegative integers such that 


a;; = the number of edges connecting v; and v; 


foneverny mandi = Me ooo 5 1 


Finding the Adjacency Matrix of a Graph 


Find the adjacency matrix for the graph G shown below. 


Solution 
Vy Vo n Wg 
Yi 0 1 0 1 
vt) 12 1 
A= 
210 2 0 0 
v“%~L1 1 0 1 E 


The entries of A satisfy the condition, a;; = a;;, for every i, j = 1,2,...,. This implies 
that the appearance of A remains the same if the entries of A are flipped across its main 
diagonal. A matrix, like A, with this property is said to be symmetric. 


Definition 


An n X n square matrix A = (a;;) is called symmetric if, and only if, for every 
Tandi Iso oagllh 
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Serums Symmetric Matrices 


Which of the following matrices are symmetric? 
0 1 2 


1 2 
a. f d b. | 1 1 0 c. A : o 
2. O -3 


Solution Only (b) is symmetric. In (a) the entry in the first row and the second column 
differs from the entry in the second row and the first column; the matrix in (c) is not even 
square. a 


It is easy to see that the matrix of any undirected graph is symmetric since it is always 
the case that the number of edges joining v; and v; equals the number of edges joining v; 
and v; for every i and j = 1,2,...,7. 


Matrices and Connected Components 
Consider a graph G, as shown below, that consists of several connected components. 


ei 


v4 % 
Vy 
v ez es e7 eg 
ez eq %5 a 
V3 &6 
The adjacency matrix of G is 
1 0 1:0 0:0 0 
0 02.0 0:0 0 
th ces Die cas Og Onc 
A=!]0 0 0:0 1.0 0 
0 0 0 1 1,0 0 
0 0 0:0 0:0 2 
0 0 0:0 0.2 0 


As you can see, A consists of square matrix blocks (of different sizes) down its diagonal 
and blocks of 0’s everywhere else. The reason is that vertices in each connected component 
share no edges with vertices in other connected components. For instance, since vı, v2, and 
v3 Share no edges with v4, vs, V6, Or v7, all entries in the top three rows to the right of the 
third column are 0 and all entries in the left three columns below the third row are also 0. 
Sometimes matrices whose entries are all 0’s are themselves denoted 0. If this convention 


is followed here, A is written as 

1 0 1 

0 0 2 

1 2 0 

A= 0 1 

CO ilO 

C2 POV lS 5 
2 0 
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The previous reasoning can be generalized to prove the following theorem: 


Theorem 10.2.1 


Let G be a graph with connected components G;, G2, . . . , Gg. If there are n; vertices 
in each connected component G; and these vertices are numbered consecutively, 
then the adjacency matrix of G has the form 


w0 0O = 00 
On 0O a 00 
© O 4 = 00 
© 0 © = O A 


where each A;is the n; X n; adjacency matrix of G; for every i = 1, 2,...,k, and the 
O’s represent matrices whose entries are all 0. 


Matrix Multiplication 


Matrix multiplication is an enormously useful operation that arises in many contexts, in- 
cluding the investigation of walks in graphs. Although matrix multiplication can be de- 
fined in quite abstract settings, the definition for matrices whose entries are real numbers 
will be sufficient for our applications. The product of two matrices is built up of scalar or 
dot products of their individual rows and columns. 


Definition 


Suppose that all entries in matrices A and B are real numbers. If the number of ele- 
ment, n, in the ith row of A equals the number of elements in the jth column of B, 
then the scalar product or dot product of the ith row of A and jth column of B is 
the real number obtained as follows: 


bi; 


J 


bo 


[ay ain °° Gin | = Glyn a °°? Pahl) 


nj* 
b 


nj 


Seay Multiplying a Row anda Column 
=] 


2 
[3 0 -1 2] = 3-(—1)+0-2+(-1)-3+2°0 


=—-3+0-3+0=-—-6 a 


More generally, if A and B are matrices whose entries are real numbers and if A and B 
have compatible sizes in the sense that the number of columns of A equals the number of 
rows of B, then the product AB is defined. It is the matrix whose ijth entry is the scalar 
product of the ith row of A times the jth column of B, for all possible values of i and j. 
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Definition 
Let A = (a;j) be an m X k matrix and B = (b,;) ak X n matrix with real entries. The 
(matrix) product of A times B, denoted AB, is that matrix (c;;) defined as follows: 
41 42 ` AIk bı Diz Din 
42, 490 e by, by Don 
ai 42 dix | 
Am Am2 Amk bu bp Dkn 
Gp = ey = Gp 
Ga e @ e O 
Cil Ci2 aia Cij ses Cin 
Cm1 Cm2 oe) Cmj i Cmn 
where 
k 
Cy = Ady anba t+ °° + iby = X rbr 
r=1 
foneachn = Mh Bo eoc mandi = WP sco git 


Computing a Matrix Product 


4 3 
Let A = È h l and B = | 2 | Compute AB. 
=h =] 


Solution A has size 2 X 3 and B has size 3 X 2, so the number of columns of A equals 
the number of rows of B and the matrix product of A and B can be computed. Then 


4 3 
| 2 (0) i 2 ee e ie 
—1 1 0 =) = C21 C22 


where 


c1 = 2°4+0-2+3-(—2) =2 

Cy. = 2°3+0°:24+3-(-1) = 3 
2 0 3 

cn = (-1)-4+1:2+0-(-2) = -2 | — 


2 0 3 
c» = (-1):3+1-2+0-(-1) = -1 | 
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AB = | : | 

ot en | Po 

Matrix multiplication is both similar to and different from multiplication of real num- 

bers. One difference is that although the product of any two numbers can be formed, only 

matrices with compatible sizes can be multiplied. For example, if A is a 3 X 2 matrix and 

Bisa2 X 4 matrix, then AB can be computed because the number of columns of A equals 

the number of rows of B. But BA does not exist because B has 4 columns, A has 3 rows, 
and 4 # 3. 


Another difference is that multiplication of real numbers is commutative (for all real 
numbers a and b, ab = ba), whereas matrix multiplication is not. For instance, 


Jo to =f a) fo alo fo a 


On the other hand, both real number and matrix multiplications are associative: 
(ab)c = a(bc), for all elements a, b, and c for which the products are defined. This is 
proved in Example 10.2.8 for products of 2 X 2 matrices. Additional exploration of matrix 
multiplication is offered in the exercises at the end of this section. 


Hence 


Associativity of Matrix Multiplication for 2 x 2 Matrices 


Prove that if A, B, and C are 2 X 2 matrices over the set of real numbers, then 
(AB)C = A(BC),. 


Solution Suppose A = (a;;), B = (b;;), and C = (c;;) are particular but arbitrarily cho- 
sen 2 X 2 matrices with real entries. Since the numbers of rows and columns are all the 
same, AB, BC, (AB)C, and A(BC) are defined. Let AB = (d;;) and BC = (e;;). Then for 
each integer i = 1, 2 and j = 1, 2, 


2 
the ijth entry of (AB)C = }'dj,c,; hy deletion orihe 
r=1 product of AB and C 
= dic; + dnp; by definition of È 
2 2 ete 
by definition of the 
= & arbre = (> Gir bale product of A and B 
r=1 r=1 


= (aibi + iby )C i; + (ajD 12 + jnby)C9; by definition of > 
= aby 101) + Aigb21C1j + iD 12Coj + Ainb22C;- 


Similarly, the ijth entry of A(BC) is 


2 
(A(BC));; = > Gir yj 


r=1 


= aye ij + Qj2€2; 
2 2 
= al > bes) + a > bse) 
r=1 r=1 
= aj(by C1; + Dy2Ca{) + albai; + b22Coj) 
= 4D, 101; + D1 2Caj + Ajgb21C1j + anba; 


= ayb1 Cy + Gigby Cy; + GD 2Caj + abaca; 
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Comparing the results of the two computations shows that for each i and j, 
the ijth entry of (AB)C = the ijth entry of A(BC). 


Since all corresponding entries are equal, (AB)C = A(BC), as was to be shown. E 


As far as multiplicative identities are concerned, there are both similarities and differ- 
ences between real numbers and matrices. You know that the number 1 acts as a multi- 
plicative identity for products of real numbers. It turns out that there are certain matrices, 
called identity matrices, that act as multiplicative identities for certain matrix products. 
For instance, mentally perform the following matrix multiplications to check that for any 
real numbers a, b, c, d, e, f, g, h, and i, 


PEIE: 


and 


These computations show that F : acts as an identity on the left side for multiplication 
1 
0 
0 


0 0 
with 2 X 3 matrices and that 1 0] acts as an identity on the left side for multiplica- 
0 1l 


tion with 3 X 3 matrices. Note that lo | cannot act as an identity on the right side for 


multiplication with 2 X 3 matrices because the sizes are not compatible. 


Definition 


For each positive integer n, the n X n identity matrix, denoted I, = (6;;) or just I 
(if the size of the matrix is obvious from context), is the n X n matrix in which all 
the entries in the main diagonal are l’s and all other entries are 0’s. In other words, 


tee 
=k 7 n for every ty = U2 T: 


akg-images/akg-images 


Leopold Kronecker oe . . 
(1823-1891) The German mathematician Leopold Kronecker introduced the symbol ô;; to make matrix 


computations more convenient. In his honor, this symbol is called the Kronecker delta. 


An Identity Matrix Acts as an Identity 


Prove that if A is any m X n matrix and I is the n X n identity matrix, then AI = A. 


Proof: 

Let A be any n X n matrix and let a;; be the ijth entry of A for each integer i = 1,2,...,m 
and j = 1,2,...,. Consider the product AI, where I is the n X n identity matrix. Observe 
that 
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ay] a\2 a Gin 1 0 ee 0 a1 a12 oe Ain 
aj a» ° an 0 loc: 0 [|an an >o an 
Ami Am2 ae Amn 0 0 = i Am) Am2 aa Amn 
because 
n 
the ijth entry of AI = $ a;,ô,; by definition of 1 
r=l 
= A181; + apðyj + °° by definition of È 
tajt G26, 
= 49); since 6;; = 0 whenever k # j 
= aij since 6;; = 1 


the ijth entry of A. 
Thus AI = A, as was to be shown. a 


In exercise 14 at the end of this section you are asked to show that if I is the m X m identity 
matrix, then IA = A. 


There are also similarities and differences between real numbers and matrices with 
respect to the computation of powers. Any number can be raised to a nonnegative integer 
power, but a matrix can be multiplied by itself only if it has the same number of rows as 
columns. As for real numbers, however, the definition of matrix powers is recursive. Just as 
any number to the zeroth power is defined to be 1, so any n X n matrix to the zeroth power 
is defined to be the n X n identity matrix. The nth power of ann X n matrix A is defined 
to be the product of A with its (n — 1)st power. 


Definition 


For any n X n matrix A, the powers of A are defined as follows: 


A° =I where Lis then Xn identity matrix 


A" = AA” ' for every integer n = 1. 


Sen away Powers of a Matrix 


1 2 
Let A = f of Compute A°, Al, A?, and A°. 


0 
Solution A’ = the 2 X 2 identity matrix = [o 1 


Pape geal! 2 22) 2 
B -= 2 ol2 ofl |2 4 
sali 215 2]-[9 a 
7 ~|2 ol2 4] f10 4 
Counting Walks of Length N 


A walk in a graph consists of an alternating sequence of vertices and edges. If repeated 
edges are counted each time they occur, then the number of edges in the sequence is called 
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the length of the walk. For instance, the walk v.e33e4V22V2e3V3 has length 4 (counting e3 
twice). Consider the following graph G: 


ez 


vı 


U3 


How many distinct walks of length 2 connect v and v2? You can list the possibilities sys- 
tematically as follows: From v4, the first edge of the walk must go to some vertex of G: v}, 
v2, Or v3. There is one walk of length 2 from v; to v, that starts by going from v, to vı: 


v€ V1€1V2. 
There is one walk of length 2 from v, to v, that starts by going from v, to vz: 
V2€2V2€2V2. 
And there are four walks of length 2 from v, to vz that start by going from v3 to v3: 
V2€3V3€4V2 
V2€4V3E3V2 
V2€3V3E3V2 
V2€4V3€4V2. 
Thus the answer is six. 
The general question of finding the number of walks that have a given length and con- 


nect two particular vertices of a graph can easily be answered using matrix multiplication. 
Consider the adjacency matrix A of the graph G shown above: 


v v V3 
vu JO 1 0 
A=}1 1 2 
vy [0 2 0 
Compute A’ as follows: 
O 1 OjO 1 0 1 1 2 
1 1 2||1 1 2|=|1 6 2 
0 2 Oj[0 2 0 2 2 4 


Note that the entry in the second row and the second column is 6, which equals the number 
of walks of length 2 from v, to v2. This is no accident! To compute a2, you multiply the 
second row of A times the second column of A to obtain a sum of three terms: 


1 
[1 1 2]|1|=1-1+1:1+2-2. 
2 
Observe that 
; number of number of number of pairs 
the first term 
: = | edges from |:| edges from | = | of edges from 
of this sum 
vz to vy vı to vy v to vı and v; to vz 
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Now consider the ith term of this sum, for each i = 1, 2, and 3. It equals the number of 
edges from v, to v; times the number of edges from v; to v2. By the multiplication rule this 
equals the number of pairs of edges from v to v; and from v; back to v2. And this equals 
the number of walks of length 2 that start and end at v, and pass through v;. Since this 
analysis holds for each term of the sum for i = 1, 2, and 3, the sum as a whole equals the 
total number of walks of length 2 that start and end at v2: 
1-14+1-14+2:-2=14+1+4=6. 

More generally, if A is the adjacency matrix of a graph G, the ijth entry of LN equals the 
number of walks of length 2 connecting the ith vertex to the jth vertex of G. Even more 
generally, if n is any positive integer, the ijth entry of A” equals the number of walks of 
length n connecting the ith and the jth vertices of G. 


Theorem 10.2.2 
If G is a graph with vertices v4, v2, . . . , Vm and A is the adjacency matrix of G, then 
for each positive integer n and for all integers i, j = 1,2,...,m, 


the ijth entry of A” = the number of walks of length n from v; to v;. 


Proof (by mathematical induction): 
Suppose G is a graph with vertices v1, v2, . . . , Vm and A is the adjacency matrix of 
G. Let P(n) be the sentence 


For all integers i, j = 1,2,....,m, <— P(n) 
the ijth entry of A” = the number of walks of length n from v; to v;. 


We will show that P(n) is true for every integer n = 1. 
Show that P(1) is true: 
The ijth entry of A’ = the ijth entry of A because A! = A 


= the number of edges by definition of adjacency matrix 
connecting v; to vj 


= the number of walks of because a walk of length 1 
length 1 from v; to Vj contains a single edge. 


Show that for every integer k with k = 1, if P (k) is true then P(k +1) is true: 
Let k be any integer with k = 1, and suppose that 


For all integers 7, j = 1, 2,...,m, <— P(k) 
the ijth entry of A‘ = the number of walks of length k from v; to v; inductive 
hypothesis 


We must show that 
For all integers 7, j = 1,2,...,m, <— P(k+1) 
the ijth entry of A‘*! = the number of walks of length k + 1 from v; to v;. 
Let A = (a;;) and A‘ = (b;). Since A‘*! = AAS, the ijth entry of A‘*! is ob- 
tained by multiplying the ith row of A by the jth column of A‘: 
the ijth entry of A+! = apb + apb + >*t + dimbmj 10.2.1 
for every i, j = 1,2,...,m. Now consider the individual terms of this sum: a; is the 


number of edges from v; to vı; and, by the inductive hypothesis, bı; is the number of 
walks of length k from v; to v;. Now any edge from v; to vı can be joined with any 


(continued on page 710) 
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walk of length k from v; to v; to create a walk of length k + 1 from v; to v; with vı as 
its second vertex. Thus, by the multiplication rule, 


the number of walks of length k + 1 from 
ajbi; = $ 
i v; to v; that have v, as their second vertex 


More generally, for each integer r = 1, 2,...,m, 


p the number of walks of length k + 1 from 
i FE y = ; 
"l | y; to v; that have v, as their second vertex 


Because every walk of length k+ 1 from v; to v; must have one of the vertices 
V1, V2, . ++, Vm as its second vertex, the total number of walks of length k + 1 from v; 


to v; equals the sum in (10.2.1), which equals the ijth entry of A‘ Hence 
the ijth entry of A‘! = the number of walks of length k + 1 from v; to v; 


[as was to be shown]. 
[Since both the basis step and the inductive step have been proved, the sentence P(n) is 
true for every integer n = 1.] 


TEST YOURSELF 


1. In the adjacency matrix for a directed graph, the 4. The ijth entry in the product of two matrices A 
entry in the ith row and jth column is and B is obtained by multiplying row of A 
by row of B. 


2. In the adjacency matrix for an undirected graph, 
the entry in the ith row and jth column is ——. 5. Inann X n identity matrix, the entries on the 


b's ee main diagonal are all and the off-diagonal 
3. Ann X n square matrix is called symmetric if, n aag z 
: ; : ; entries are all 
and only if, for all integers i and j from 1 to n, the 
entry in row and column equals the 6. If Gis a graph with vertices vj, v2, . . . , Vm and A 
entry in row and column ____. is the adjacency matrix of G, then for each positive 


integer n and for all integers i and j with i, j = 1, 
2,...,m, the ijth entry of A” = __. 


EXERCISE SET 10.2 


1. Find real numbers a, b, and c such that the follow- 


a. e Gal 
ing are true. vi Vy Uy 
ez 2 
P a+b a-c _ 1 0 é, A es |e, 
c b-a =]. 3 e 
v ês 
b b alk 3| “4 E "3 
c~a 2b-a 1 -2 3. Find directed graphs that have the following adja- 


cency matrices: 


2. Find the adjacency matrices for the following ti 


directed graphs. 


O O O Fr 
O = OỌ ŅN 
orno 
oN OF 
=. = ke © 
ooo Oo 


0 1 
2 1 
1 1 
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4. Find adjacency matrices for the following (undi- 
rected) graphs. 


a v V b 
e3 
(2 
e e4 5 
U4 ts 


c. K4, the complete graph on four vertices 
d. K,3, the complete bipartite graph on (2, 3) 
vertices 


5. Find graphs that have the following adjacency 
matrices. 


1 0 1 0 2 0 
a |O 1 2 b. |2 1 0 
1 2 0 0 0 1 


6. The following are adjacency matrices for graphs. 
In each case determine whether the graph is con- 
nected by analyzing the matrix without drawing 


the graph. 
2 
a. 2000 
a |1 1 0 blo oii 
1 
ua 0 0 1 1 


7. Suppose that for every positive integer i, all 
the entries in the ith row and ith column of the 
adjacency matrix of a graph are 0. What can you 
conclude about the graph? 
8. Find each of the following products. 
1 1 
a. [2 -of b. [4 -1 7]|2 
0 
9. Find each of the following products. 


ef allo 2 a 


c. Be 3] 
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1 1 1 =2 
10. LetA = | |s = | J and 


For each of the following, determine whether 
the indicated product exists, and compute it if it 
does. 

a. AB b. BA c. A? d. BC e. CB 


f. B? g. B? h. Œ i. AC j. CA 


11. Give an example different from that in the 
text to show that matrix multiplication is not 
commutative. That is, find 2 X 2 matrices A 
and B such that AB and BA both exist but 
AB + BA. 


12. Let O denote the matrix i of Find 2 x 2 


matrices A and B such that A # O and B + O but 
AB = O. 


13. Let O denote the matrix f 4 Find 2 x 2 


matrices A and B such that A # B, B # O, and 
AB + O, but BA = O. 


In 14-18, assume the entries of all matrices are real 
numbers. 


H 14. Prove that if I is the m X m identity matrix and A 
is any m X n matrix, then IA = A. 


15. Prove that if A is an m X m symmetric matrix, 
then A? is symmetric. 


16. Prove that matrix multiplication is associative: If 
A, B, and C are any m X k, k X r, and r X n ma- 
trices, respectively, then (AB)C = A(BC). (Hint: 
Summation notation is helpful.) 


17. Use mathematical induction and the result of 
exercise 16 to prove that if A is any m X m matrix, 
then A”A = AA” for each integer n = 1. 


18. Use mathematical induction to prove that if A is 
an m X m symmetric matrix, then for any integer 
n= 1, A” is also symmetric. 
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20. 


21. 
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1 1 2 
LetA=]1 0 1|.Find A? and A°. 
2 1 0 


. Let G be the graph with vertices v4, v2, and v3 


and with A as its adjacency matrix. Use the 
answers to part (a) to find the number of walks 
of length 2 from v; to v3 and the number of 
walks of length 3 from v, to v3. Do not draw G 
to solve this problem. 


. Examine the calculations you performed in an- 


swering part (a) to find five walks of length 2 
from v; to v3. Then draw G and find the walks 
by visual inspection. 


The following is an adjacency matrix for a graph: 


vi v2 V3 V4 


oo 1 1 0 
eli 0 2 4 
ait 2-0.1 
„Loi 1 4 


Answer the following questions by examining the 
matrix and its powers only, not by drawing the graph: 


b. 


How many walks of length 2 are there from v) 
to v3? 
How many walks of length 2 are there from v, 
to v4? 
How many walks of length 3 are there from v; 
to v4? 
How many walks of length 3 are there from v, 
to v3? 


Let A be the adjacency matrix for K}, the complete 
graph on three vertices. Use mathematical induc- 
tion to prove that for each positive integer n, all the 
entries along the main diagonal of A” are equal to 
each other and all the entries that do not lie along 
the main diagonal are equal to each other. 


22. a. Draw a graph that has 


ANSWERS FOR TEST YOURSELF 


Ne COC So 
i) 
eno o CO 
CON KS & 
oo fF m. N 


as its adjacency matrix. Is this graph bipartite? 


Definition: Given an m X n matrix A whose ijth entry is 


denoted a 
entry is a 


ip the transpose of A is the matrix A‘ whose ijth 


i; ;, for each i = 1,2,...,mandj=1,2,...,n. 


H b. 


23. a. 


H b. 


Note that the first row of A becomes the first 
column of A’, the second row of A becomes the 
second column of A’, and so forth. For instance, 


0 1l 
0 2 1 
ifA = ,thenA'=]2 2 

1 2 3 1 3 


Show that a graph with n vertices is bipartite 
if, and only if, for some labeling of its vertices, 
its adjacency matrix has the form 


O A 
ea 
where A isak X (n — k) matrix for some integer 
k such that O < k < n, the top left O represents 
a k X k matrix all of whose entries are 0, A‘ 
is the transpose of A, and the bottom right O 


represents an (n— k) X (n—k) matrix all of 
whose entries are 0. 


Let G be a graph with n vertices, and let v and 
w be distinct vertices of G. Prove that if there 
is a walk from v to w, then there is a walk from 
v to w that has length less than or equal to 
n=l: 

If A = (a;;) and B = (b;;) are any m X n 
matrices, the matrix A + B is the m X n 
matrix whose ijth entry is a;; + b;; for each 
i=1,2,...,mandj=1,2,...,n.LetG 
be a graph with n vertices where n > 1, and 
let A be the adjacency matrix of G. Prove that 
G is connected if, and only if, every entry of 
A+A°+-+- +A”! is positive. 


1. the number of arrows from v; (the ith vertex) to v; (the 


jth vertex) 2. the number of edges connecting v; (the ith 
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vertex) and vj (the jth vertex) 3.ij;j;1 4.67 5.1;0 


6. the number of walks of length n from v; to v; 
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WEJ lsomorphisms of Graphs 


Thinking is a momentary dismissal of irrelevancies. —R. Buckminster Fuller, 1969 


The two drawings shown in Figure 10.3.1 both represent the same graph: Their vertex and 
edge sets are identical, and their edge-endpoint functions are the same. Call this graph G. 


Vv vi 
es ey ei 
e3 
Us v v3 U4 
g 
e4 ez es i 
ez 
uy 83 n Us v 
FIGURE 10.3.1 


Now consider the graph G’ represented in Figure 10.3.2. 


Vy 


ez ei 


FIGURE 10.3.2 


Observe that G” is a different graph from G (for instance, in G the endpoints of e, are vı and 
v2, whereas in G’ the endpoints of e; are vı and v3). Yet G” is certainly very similar to G. In 
fact, if the vertices and edges of G’ are relabeled by the functions shown in Figure 10.3.3, 
then G’ becomes the same as G. 


Vertices Vertices Edges Edges 
ofG of G' ofG of G' 


FIGURE 10.3.3 


Note that these relabeling functions are one-to-one and onto. 

Two graphs that are the same except for the labeling of their vertices and edges are 
called isomorphic. The word isomorphism comes from the Greek, meaning “same form.” 
Isomorphic graphs are those that have essentially the same form. 


Definition 


Let G and G’ be graphs with vertex sets V(G) and V(G’) and edge sets E(G) and 
E(G’'), respectively. G is isomorphic to G’ if, and only if, there exist one-to-one cor- 
respondences g: V(G) — V(G') and h: E(G) — E(G’) that preserve the edge-endpoint 
functions of G and G” in the sense that for each v E V(G) and e E E(G), 


vis anendpoint ofe «æ g(v) is an endpoint of h(e). 10B 
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In words, G is isomorphic to G’ if, and only if, the vertices and edges of G and G’ can 
be matched up by one-to-one, onto functions in such a way that the edges between cor- 
responding vertices correspond to each other. 

It is common in mathematics to identify isomorphic objects with each other. For in- 
stance, if we are given a graph G with five vertices, where each pair of vertices is con- 
nected by an edge, then we often identify G with Ks, saying that G is K; rather than that G 
is isomorphic to Ks. 


Showing That Two Graphs Are Isomorphic 


Show that the following two graphs are isomorphic. 


wi w3 
f fa 
fi Se fs 
fi fe 
ws w4 
G G' 


Solution To solve this problem, you must find functions g: VG) — V(G’) and 
h: E(G) — E(G’) such that for each v E V(G) and e E E(G), v is an endpoint of e if, and 
only if, g(v) is an endpoint of A(e). Setting up such functions is partly a matter of trial and er- 
ror and partly a matter of deduction. For instance, since e, and e; are parallel [have the same 
endpoints], h(e) and h(e3) must be parallel also. So h(e2) = fı and h(e3) = fo or hlez) = fr 
and h(e3) = fı. Also, the endpoints of e and e3 must correspond to the endpoints of f; and fz, 
and so g(v3) = w, and g(v4) = ws or g(v3) = ws and g(v4) = Wy. 

Similarly, since v; is the endpoint of four distinct edges (e4, e7, €s, and e4), g(v,) must 
also be the endpoint of four distinct edges [because every edge incident on g(v,) is the image 
under h of an edge incident on v; and h is one-to-one and onto]. But the only vertex in G’ that 
has four edges coming out of it is wz, and so g(v\) = w2. Now if g(v3) = wy, then since vı 
and v3 are endpoints of e; in G, g(v,) = wz and g(v3) = w; must be endpoints of A(e,) in G”. 
This implies that h(e,) = fs. 

By continuing in this way, possibly making some arbitrary choices as you go, you even- 
tually can find functions g and A to define the isomorphism between G and G”. One pair of 
functions (there are several) is the following: 


V(G) VG") E (G) 


E(G') 


It is not hard to show that graph isomorphism is an equivalence relation on a set of 
graphs; in other words, it is reflexive, symmetric, and transitive. 
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Note As a consequence 
of the symmetry property, 
you can simply say “G 
and G’ are isomorphic” 
instead of “G is isomor- 
phic to G’” or “G” is 
isomorphic to G.” 


Example 10.3.2 


10.3 ISOMORPHISMS OF GRAPHS 715 


Theorem 10.3.1 Graph Isomorphism Is an Equivalence Relation 


Let S be a set of graphs and let R be the relation of graph isomorphism on S. Then R 
is an equivalence relation on S. 


Proof: 
R is reflexive: Given any graph G in S, define a graph isomorphism from G to G by 
using the identity functions on the set of vertices and on the set of edges of G. 


R is symmetric: Given any graphs G and G” in S such that G is isomorphic to G’, we 
must show that G” is isomorphic to G. 


This is true because if g and h are vertex and edge correspondences from G to 
G” that preserve the edge-endpoint functions, then g! and h | are vertex and edge 
correspondences from G” to G that preserve the edge-endpoint functions. 


R is transitive: Given any graphs G, G’, and G” in S such that G is isomorphic to G” 
and G” is isomorphic to G”, we must show that G is isomorphic to G”. 


This follows from the fact that if g} and h; are vertex and edge correspondences 
from G to G’ that preserve the edge-endpoint functions of G and G’ and if g, and hy 
are vertex and edge correspondences from G’ to G” that preserve the edge-endpoint 
functions of G' and G”, then g,° gı and h° h; are vertex and edge correspondences 
from G to G” that preserve the edge-endpoint functions of G and G”. 


Finding Representatives of Isomorphism Classes 


Find all nonisomorphic graphs that have two vertices and two edges. In other words, find 
a collection of representative graphs with two vertices and two edges such that every graph 
with two vertices and two edges is isomorphic to one in the collection. 


Solution There are four nonisomorphic graphs that have two vertices and two edges. 
These can be drawn without vertex and edge labels because any two labelings give iso- 
morphic graphs. 


om GC. oy. GE 


(a) (b) (c) (d) 


To see that these four drawings show all the nonisomorphic graphs with two vertices 
and two edges, first check whether one of the edges joins the two vertices or not. If it does, 
there are two possibilities: The second edge can also join the two vertices (as in (a)) or 
it can be a loop incident on one of them (as in (b)—it makes no difference which vertex 
is chosen to have the loop because interchanging the two vertex labels gives isomorphic 
graphs). If neither edge joins the two vertices, then both edges are loops. In this case, there 
are only two possibilities: Either both loops are incident on the same vertex (as in (c)) or 
the two loops are incident on separate vertices (as in (d)). There are no other possibilities 
for placing the edges, so the listing is complete. E 


Now consider the question, “Is there a general method to determine whether graphs G 
and G’ are isomorphic?” In other words, is there some algorithm that will accept graphs 
G and G” as input and produce a statement as to whether they are isomorphic? In fact, 
there is such an algorithm. It consists of generating all one-to-one, onto functions from the 
set of vertices of G to the set of vertices of G’ and from the set of edges of G to the set of 
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edges of G’ and checking each pair to determine whether it preserves the edge-endpoint 
functions of G and G’. The problem with this algorithm is that it takes an unreasonably 
long time to perform, even on a high-speed computer. If G and G’ each have n vertices 
and m edges, the number of one-to-one correspondences from vertices to vertices is n! and 
the number of one-to-one correspondences from edges to edges is m!, so the total num- 
ber of pairs of functions to check is n!-m!. For instance, if m = n = 20, there would be 
20! -20! = 5.9 x 10° pairs to check. Assuming that each check takes just | nanosecond, 
the total time would be approximately 1.9 X 10 years! 

Unfortunately, there is no more efficient general method known for checking whether 
two graphs are isomorphic. However, there are some simple tests that can be used to show 
that certain pairs of graphs are not isomorphic. For instance, if two graphs are isomorphic, 
then they have the same number of vertices (because there is a one-to-one correspondence 
from the vertex set of one graph to the vertex set of the other). It follows that if you are given 
two graphs, one with 16 vertices and the other with 17, you can immediately conclude that 
the two are not isomorphic. More generally, a property that is preserved by graph isomor- 
phism is called an isomorphic invariant. For instance, “having 16 vertices” is an isomor- 
phic invariant: If one graph has 16 vertices, then so does any graph that is isomorphic to it. 


Definition 


A property P is called an invariant for graph isomorphism if, and only if, given any 
graphs G and G’, if G has property P and G” is isomorphic to G, then G’ has property P. 


Theorem 10.3.2 


Each of the following properties is an invariant for graph isomorphism, where n, m, 
and k are all nonnegative integers: 


1. has n vertices 6. has a simple circuit of length k 
2. has m edges 7. has m simple circuits of length k 
3. has a vertex of degree k 8. is connected 

4. has m vertices of degree k 9. has an Euler circuit 

5. has a circuit of length k 10. has a Hamiltonian circuit. 


Showing That Two Graphs Are Not Isomorphic 


Show that the following pairs of graphs are not isomorphic by finding an isomorphic in- 


variant that they do not share. 
l KD 
, 
G G 
' 
H H 
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Solution 


a. G has nine edges; G’ has only eight. 
b. H has a vertex of degree 4; H’ does not. E 


We prove part (3) of Theorem 10.3.2 and leave the proofs of the other parts as exercises. 


Proof of Theorem 10.3.2, Part (3) 


Prove that if G is a graph that has a vertex of degree k and G’ is isomorphic to G, then G’ 
has a vertex of degree k. 


Proof: 


Suppose G and G’ are isomorphic graphs and G has a vertex v of degree k, where k is a 
nonnegative integer. [We must show that G' has a vertex of degree k.] Since G and G” are iso- 
morphic, there are one-to-one, onto functions g and h from the vertices of G to the vertices 
of G' and from the edges of G to the edges of G’ that preserve the edge-endpoint functions 
in the sense that for all edges e and all vertices u of G, u is an endpoint of e if, and only if, 
g(u) is an endpoint of h(e). An example for a particular vertex v is shown below. 


ez h(ez) 
e h(e;) 
3 
v 
e4 
65 
Degree v =3+2.2=7 Degree g(v) =3+2:2=7 
Let e4, éo,..., €m be the m distinct edges that are incident on a vertex v in G, where m is a 
nonnegative integer. Then h(e,), h(e2),..., H(@,,) are m distinct edges that are incident on 
g(v) in G’. [The reason why h(e,), h(ey),..., hen) are distinct is that h is one-to-one and e}, 
ez, . . - , €m are distinct. And the reason why h(e,), h(é2), . . . , hem) are incident on g(v) is that g 
and h preserve the edge-endpoint functions of G and G' and e}, e2, .. . , €m are incident on v.] 


Also, there are no edges incident on g(v) other than the ones that are images under h of 
edges incident on v [because g is onto and g and h preserve the edge-endpoint functions of 
G and G']. Thus the number of edges incident on v equals the number of edges incident 
on g(v). 

Finally, an edge e is a loop at v if, and only if, h(e) is a loop at g(v), so the number of 
loops incident on v equals the number of loops incident on g(v). [For since g and h preserve 
the edge-endpoint functions of G and G’, a vertex w is an endpoint of e in G if, and only if, g(w) 
is an endpoint of h(e) in C”. It follows that v is the only endpoint of e in G if, and only if, g(v) is 
the only endpoint of h(e) in G'.] 

Now the degree of v, which is k, equals the number of edges incident on v that are not 
loops plus twice the number of edges incident on v that are loops (since each loop contrib- 
utes 2 to the degree of v). But we have already shown that the number of edges incident on 
v equals the number of edges incident on g(v), with the number of loops incident on v equal 
to the number of loops incident on g(v). Hence g(v) also has degree k. a 


Graph Isomorphism for Simple Graphs 


When graphs G and G’ are both simple, the definition of G being isomorphic to G’ can 
be written without referring to the correspondence between the edges of G and the edges 
of G’. 
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Definition 


If Gand G’ are simple graphs, then G is isomorphic to G’ if, and only if, there exists 
a one-to-one correspondence g from the vertex set V(G) of G to the vertex set V(G’) 
of G’ that preserves the edge-endpoint functions of G and G’ in the sense that for all 
vertices u and v of G, 


{u, v} isanedgeinG © {g(u), g(v)} is an edge in G”. 10332 


Isomorphism of Simple Graphs 


Are the two graphs shown below isomorphic? If so, define an isomorphism. 


b x 
d Z 
G G 


Solution Yes. Define g: V(G) — V(G’) by the arrow diagram shown below. 


V(G) V(G') 


Then g is one-to-one and onto by inspection. The fact that g preserves the edge-endpoint 
functions of G and G” is shown by the following table: 


Edges of G Edges of G’ 
{a, b} {y,w} = {8(a), g(b)} 
{a, c} {y, x} = {8(a), g(c)} 
{a, d} {y, z} = {g(a), g(d)} 
{c, d} {x, z} = {g(), 8(d)} 
| 
TEST YOURSELF — > 
1. If Gand G' are graphs, then G is isomorphic to 2. A property P is an invariant for graph isomor- 
G” if, and only if, there exist a one-to-one corre- phism if, and only if, given any graphs G and G’, 
spondence g from the vertex set of G to the vertex if G has property P and G” is isomorphic to G 
set of G’ and a one-to-one correspondence h from then 


the edge set of G to the edge set of G’ such that 
for every vertex v and every edge e in G, v is an 
endpoint of e if, and only if, 


3. Some invariants for graph isomorphisms are 


2 3 3 z] 2 , 


; ; , and 
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For each pair of graphs G and G’ in 1-5, determine 
whether G and G’ are isomorphic. If they are, give func- 
tions g: V(G) — V(G’) and h: E(G) — E(G’) that define 
the isomorphism. If they are not, give an invariant for 
graph isomorphism that they do not share. 


Loy U4 
fi 
W2 fs 
e e e al w3 
1 3 4 
fa Ja 
e 
U2 U3 w4 
G G' 
2. 
e l2 
V. 
5 v3 z 
3 
e4 
&6 
v 
i 
€7 
e5 wi 
G G' 
3. fi 


V4 
vu “4 Ís fa f 
e e3 
e5 
v 6 U3 w4 
G G' 


w3 


For each pair of simple graphs G and G’ in 6-13, deter- 
mine whether G and G’ are isomorphic. If they are, give a 
function g: V(G) — V (G’) that defines the isomorphism. 
If they are not, give an invariant for graph isomorphism 
that they do not share. 


6. 
v v V3 U4 


7. ü 


a i 
U4 U3 W4 


8. 
a c u 
f d z 
G A 
9. 
a b u x 
e d 
P 
10. 
a 
Ge 
&§ 
G a 
11. 


a c 
f d 
12. x T 
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13. 


14. 


15. 


16. 


CHAPTER10 THEORY OF GRAPHS AND TREES 


Prove that each of the properties in 21-29 is an invariant 
for graph isomorphism. Assume that n, m, and k are all 
nonnegative integers. 


21. Has n vertices 22. Has m edges 


23. Has a circuit of length k 
24. Has a simple circuit of length k 
Draw all nonisomorphic simple graphs with three H 25. Has m vertices of degree k 


vertiges: 26. Has m simple circuits of length k 


Draw all nonisomorphic simple graphs with four H 27. Is connected 28. Has an Euler circuit 
vertices. 

29. Has a Hamiltonian circuit 
Draw all nonisomorphic graphs with three verti- 


30. Show that the following two graphs are not iso- 


ces and no more than two edges. : : : : 
morphic by supposing they are isomorphic and 


17. Draw all nonisomorphic graphs with four vertices deriving a contradiction. 
and no more than two edges. 
i ; : i al e 3 és e6 
H 18. Draw all nonisomorphic graphs with four vertices S o n yA y 
and three edges. a 
G 
19. Draw all nonisomorphic graphs with six vertices, 
all having degree 2. 
. . TEENA" fh h A k 
20. Draw four nonisomorphic graphs with six verti- e ° e ° 
wi wz w3 wa f Ws We 
ces, two of degree 4 and four of degree 3. 5 
G' 


ANSWERS FOR TEST YOURSELF 


1. g(v) is an endpoint of h(e) 2.G' has property P 3. has n vertices; has m edges; has a vertex of degree k; has m vertices of 
degree k; has a circuit of length k; has a simple circuit of length k; has m simple circuits of length k; is connected; has an Euler 


circuit; has a Hamiltonian circuit 


J Trees: Examples and Basic Properties 


We are not very pleased when we are forced to accept a mathematical truth by virtue 
of a complicated chain of formal conclusions and computations, which we traverse 
blindly, link by link, feeling our way by touch. We want first an overview of the aim 
and of the road; we want to understand the idea of the proof, the deeper context. 
—Hermann Weyl, 1885-1955 


If a friend asks what you are studying and you answer “trees,” your friend may think you 
are taking a course in botany. But trees are also a subject for mathematical investigation. In 
mathematics, a tree is a connected graph that does not contain any circuits. Mathematical 
trees are similar in certain ways to their botanical namesakes. 


Definition 


A graph is said to be circuit-free if, and only if, it has no circuits. A graph is called 
a tree if, and only if, it is circuit-free and connected. A trivial tree is a graph that 
consists of a single vertex. A graph is called a forest if, and only if, it is circuit-free 
and not connected. 
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All the graphs shown in Figure 10.4.1 are trees, whereas those in Figure 10.4.2 are not. 


NZ. 


y 


(a) (b) (d) 
FIGURE 10.4.1 Trees. All the graphs in (a)—(d) are connected and circuit-free. 
(a) (b) (c) (d) 
FIGURE 10.4.2 Non-trees. The graphs in (a), (b), and (c) all have circuits, and the graph in (d) is not connected. E 


Examples of Trees 


The following examples illustrate just a few of the many and varied situations in which 


mathematical trees arise. 


SCUGOY: WA A Decision Tree 


During orientation week, a college administers a mathematics placement exam to all enter- 
ing students. The exam consists of two parts, and placement recommendations are made 
as indicated by the tree shown in Figure 10.4.3. Read the tree from left to right to decide 
what course should be recommended for a student who scored 9 on part I and 7 on part II. 


Score on 
part I 


et) Math 120 
Score on 
part II ~ | 
<10 Math 110 
>10 
Lel Math 110 
=8,9,10 Score on 
part II p | 
<6 Math 100 
<8 
Math 100 
FIGURE 10.4.3 
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Example 10.4.3 


IBM Archives 


John Backus 
(1924-1998) 


Peter Naur 
(1928-2016) 


Solution Since the student scored 9 on part I, the score on part II is checked. Since it is 
greater than 6, the student should be advised to take Math 110. E 


A Parse Tree 


In the last 30 years, Noam Chomsky and others have developed new ways to describe the 
syntax (or grammatical structure) of natural languages such as English. As is discussed 
briefly in Chapter 12, this work has proved useful in constructing compilers for high-level 
computer languages. In the study of grammars, trees are often used to show the derivation 
of grammatically correct sentences from certain basic rules. Such trees are called syntac- 
tic derivation trees or parse trees. 

A very small subset of English grammar, for example, specifies that 


a sentence can be produced by writing first a noun phrase and then a verb phrase; 

. a noun phrase can be produced by writing an article and then a noun; 

anoun phrase can also be produced by writing an article, then an adjective, and then a noun; 
a verb phrase can be produced by writing a verb and then a noun phrase; 

. one article is “the”; 

. one adjective is “young”; 

one verb is “caught”; 


. one noun is “man”; 


om N DAW KR wWHND 


one (other) noun is “ball.” 


The rules of a grammar are called productions. It is customary to express them using the 
shorthand notation illustrated below. This notation, introduced by John Backus in 1959 and 
modified by Peter Naur in 1960, was used to describe the computer language Algol and is 
called the Backus-Naur notation. In the notation, the symbol | represents the word or, and 
angle brackets ( ) are used to enclose terms to be defined (such as a sentence or noun phrase). 


1. (sentence) — (noun phrase) (verb phrase) 

2., 3. (noun phrase) — (article) (noun) | (article) (adjective) (noun) 
4. (verb phrase) — (verb) (noun phrase) 

5. (article) — the 

6. (adjective) —> young 

7. (verb) — caught 

8., 9. (noun) — man | ball 


The derivation of the sentence “The young man caught the ball” from the above rules is 
described by the tree shown below. 


(sentence) 
ee- Oa 
(noun phrase) (verb phrase) 
(article) (adjective) (noun) a fo phrase) 
d, ome mm di (article) (noun) 


the ball 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


10.4 TREES: EXAMPLES AND BASIC PROPERTIES 723 


In the study of linguistics, syntax refers to the grammatical structure of sentences, and se- 
mantics refers to the meanings of words and their interrelations. A sentence can be syntactically 
correct but semantically incorrect, as in the nonsensical sentence “The young ball caught the 
man,” which can be derived from the rules given above. Or a sentence can contain syntactic er- 
rors but not semantic ones, as, for instance, when a two-year-old child says, “Me hungry!” m 


Structure of Hydrocarbon Molecules 


The German physicist Gustav Kirchhoff (1824—1887) was the first to analyze the behavior 
of mathematical trees in connection with the investigation of electrical circuits. Soon af- 
ter (and independently), the English mathematician Arthur Cayley used the mathematics 
of trees to enumerate all isomers for certain hydrocarbons. Hydrocarbon molecules are 
composed of carbon and hydrogen; each carbon atom can form up to four chemical bonds 
with other atoms, and each hydrogen atom can form one bond with another atom. Thus 
the structure of hydrocarbon molecules can be represented by graphs such as those shown 
below, in which the vertices represent atoms of hydrogen and carbon, denoted H and C, and 
the edges represent the chemical bonds between them. 


H H H H H H H 
a a ee a 
oe es & — 
f 
Butane Isobutane 


Note that each of these graphs has four carbon atoms and ten hydrogen atoms, but the two 
graphs show different configurations of atoms. When two molecules have the same chemi- 
cal formulas (in this case C,Hj,) but different chemical bonds, they are called isomers. 

Certain saturated hydrocarbon molecules contain the maximum number of hydrogen 
atoms for a given number of carbon atoms. Cayley showed that if such a saturated hydro- 
carbon molecule has k carbon atoms, then it has 2k + 2 hydrogen atoms. The first step in 
doing so is to prove that the graph of such a saturated hydrocarbon molecule is a tree. Prove 
Arthur Cayley this using proof by contradiction. (You are asked to finish the derivation of Cayley’s result 
(1821-1895) in exercise 4 at the end of this section.) 


Bettmann/Getty Images 


Solution Suppose there is a hydrocarbon molecule that contains the maximum number 
of hydrogen atoms for the number of its carbon atoms and whose graph G is not a tree. [We 
must derive a contradiction.] Since G is not a tree, G is not connected or G has a circuit. But 
the graph of any molecule is connected (all the atoms in a molecule must be connected to 
each other), and so G must have a nontrivial circuit. Now the edges of the circuit can link 
only carbon atoms because every vertex of a circuit has degree at least 2 and a hydrogen 
atom vertex has degree 1. Delete one edge of the circuit and add two new edges to join each 
of the newly disconnected carbon atom vertices to a hydrogen atom vertex as shown below. 


= = G e 
| 7| Tai 
Delete ao 
i Cc PPNA H 
Rest of circuit `~- e 
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The resulting molecule has two more hydrogen atoms than the given molecule, but the 
number of carbon atoms is unchanged. This contradicts the supposition that the given mol- 
ecule has the maximum number of hydrogen atoms for the given number of carbon atoms. 
Hence the supposition is false, and so G is a tree. E 


Characterizing Trees 


There is a somewhat surprising relation between the number of vertices and the number of 
edges of a tree. It turns out that if n is a positive integer, then any tree with n vertices (no 
matter what its shape) has n — 1 edges. Perhaps even more surprisingly, a partial converse 
to this fact is also true—namely, any connected graph with n vertices and n — | edges is a 
tree. It follows from these facts that if even one new edge (but no new vertex) is added to a 
tree, the resulting graph must contain a circuit. Also, from the fact that removing an edge 
from a circuit does not disconnect a graph, it can be shown that every connected graph has 
a subgraph that is a tree. It follows that if n is a positive integer, any graph with n vertices 
and fewer than n — 1 edges is not connected. 

A small but very important fact necessary to derive the first main theorem about trees 
is that any nontrivial tree must have at least one vertex of degree 1. 


Lemma 10.4.1 


Any tree that has more than one vertex has at least one vertex of degree 1. 


A constructive way to understand this lemma is to imagine being given a tree T with 
more than one vertex. You pick a vertex v at random and then search outward along a path 
from v looking for a vertex of degree 1. As you reach each new vertex, you check whether 
it has degree 1. If it does, you are finished. If it does not, you exit from the vertex along a 
different edge from the one you entered on. Because T is circuit-free, the vertices included 
in the path never repeat. And since the number of vertices of T is finite, the process of 
building a path must eventually terminate. When that happens, the final vertex v’ of the 
path must have degree 1. This process is illustrated below. 


Start here Search outward from v to 
find vertex v' of degree 1. 


This discussion is made precise in the following proof. 


Proof: 


Let T be a particular but arbitrarily chosen tree that has more than one vertex, and 
consider the following algorithm: 


Step 1: Pick a vertex v of T and let e be an edge incident on v. 
[If there were no edge incident on v, then v would be an isolated vertex. But 
this would contradict the assumption that T is connected (since it is a tree) and 
has at least two vertices.] 
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Step 2: While deg(v) > 1, repeat steps 2a, 2b, and 2c: 
Step 2a: Choose e’ to be an edge incident on v such that e’ # e. [Such an edge 
exists because deg(v) > 1 and so there are at least two edges incident 
on v.] 


Step 2b: Let v’ be the vertex at the other end of e’ from v. [Since T is a tree, e' 
cannot be a loop and therefore e' has two distinct endpoints.] 


Step 2c: Let e = e’ and v = v’. [This is just a renaming process in preparation 
for repeating step 2.] 


The algorithm just described must eventually terminate because the set of verti- 
ces of the tree T is finite and T is circuit-free. When it does, a vertex v of degree 1 
will have been found. 


Using Lemma 10.4.1 it is not difficult to show that, in fact, any tree that has more than 
one vertex has at least two vertices of degree 1. This extension of Lemma 10.4.1 is left to 
exercise 5 at the end of this section. Exercise 29 outlines a nonconstructive proof of this 
fact that uses proof by contradiction. 


Definition 


Let T be a tree. If T has at least two vertices, then a vertex of degree 1 in T is called 
a leaf (or a terminal vertex), and a vertex of degree greater than | in T is called 
an internal vertex (or a branch vertex). The unique vertex in a trivial tree is also 
called a leaf or terminal vertex. 


ZEUGT AJ Leaves and Internal Vertices in Trees 


Find all leaves (or terminal vertices) and all internal (or branch) vertices in the following 
tree: 


V, 
0 v v 
v3 
U6 d 
U4 


v us 


Solution The leaves (or terminal vertices) are vo, V2, V4, Vs, V7, and vg. The internal (or 
branch) vertices are ve, vı, and v3. E 


The following is the first of the two main theorems about trees: 


Theorem 10.4.2 


For any positive integer n, any tree with n vertices has n — | edges. 


The proof is by mathematical induction. In the inductive step, we assume that any tree 
with k vertices has k — 1 edges, and our job is to show that any tree with k + | vertices has 
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k edges. To do this, we start with an arbitrarily chosen tree T with k + 1 vertices. Then we 
try to find a vertex and an edge that we can remove from T to create a sub-tree T’ with k 
vertices. If this step is successful, we can apply the inductive hypothesis to show that T’ has 
k— 1 edges, and when we replace the vertex and edge that we removed, we can conclude 
that T has k edges. 

In order to find the vertex and edge to remove from T, we use Lemma 10.4.1, which 
states that T has a vertex v of degree 1. Since T is connected, v is attached to the rest of T 
by a single edge e as sketched below. 


If e and v are removed from T, what remains is a tree T’ with (k+ 1)— 1 = k vertices. By 
inductive hypothesis, then, T’ has k — 1 edges. Now the original tree T has one more vertex 
and one more edge than T’. Hence T must have (k— 1) + 1 = k edges. A formal version of 
this argument is given below. 


Proof (by mathematical induction): 
Let the property P(n) be the sentence 
Any tree with n vertices has n — 1 edges. <—P(n) 


We use mathematical induction to show that this property is true for every integer 
mN 


Show that P(1) is true: Let T be any tree with one vertex. Then T has zero edges 
(since it contains no loops). Since 0 = | — 1, then P(1) is true. 


Show that for every integer k = 1, if P(k) is true then P(k +1) is true: 


Suppose k is any positive integer for which P(ķ) is true. In other words, suppose that 


Any tree with k vertices has k — 1 edges. <— P(k) 
inductive hypothesis 


We must show that P(k + 1) is true. In other words, we must show that 


Note We can’t assume Any tree with k + 1 vertices has (k+ 1)— 1 = k edges. <—P(k+1) 
anything about T except : wake . . 
datha tt liverioes, Let T be a particular but arbitrarily chosen tree with k + 1 vertices. [We must show 


that T has k edges.] Since k is a positive integer, (k + 1) = 2, and so T has more than 
one vertex. Hence by Lemma 10.4.1, T has a vertex v of degree 1. Also, since T has 
more than one vertex, there is at least one other vertex in T besides v. Thus there is 
an edge e connecting v to the rest of T. Define a subgraph T” of T so that 


VT) = VWT)—{v} and E(T') = E(T) — {e}. 
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Then 


1. The number of vertices of T’ is (k+ 1)— 1 = k. 


2. T' is circuit-free (since T is circuit-free, and removing an edge and a vertex can- 
not create a circuit). 


3. T’ is connected (see exercise 24 at the end of this section). 


Hence, by the definition of tree, T’ is a tree. Since T’ has k vertices, by inductive 
hypothesis 


the number of edges of T’ = (the number of vertices of T’) — 1 
=k-1. 
It follows that 


the number of edges of T = (the number of edges of T’) + 1 
=(k-1)+1 
=k. 


[This is what was to be shown.] 


ZEWG AJ Determining Whether a Graph Is a Tree 
A graph G has ten vertices and twelve edges. Is it a tree? 


Solution No. By Theorem 10.4.2, any tree with ten vertices has nine edges, not twelve. E 


Seu YA Finding Trees Satisfying Given Conditions 
Find all nonisomorphic trees with four vertices. 


Solution By Theorem 10.4.2, any tree with four vertices has three edges. Thus the total 
degree of a tree with four vertices must be 6. Also, every tree with more than one vertex 
has at least two vertices of degree 1 (see the comment following Lemma 10.4.1 and exer- 
cises 5 and 29 at the end of this section). Thus the following combinations of degrees for 
the vertices are the only ones possible: 


1,1,1,3 and 1,1,2,2. 


There are two nonisomorphic trees corresponding to both of these possibilities, as shown 
below. 


and 


To prove the second major theorem about trees, we need another lemma. 


Lemma 10.4.3 


If G is any connected graph, C is any circuit in G, and any one of the edges of C is 
removed from G, then the graph that remains is connected. 
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Essentially, the reason why Lemma 10.4.3 is true is that any two vertices in a circuit are 
connected by two distinct paths. It is possible to draw the graph so that one of these goes 
“clockwise” and the other goes “counterclockwise” around the circuit. For example, in the 
circuit below, the clockwise path from vz to v3 is 


V2€3V3 
and the counterclockwise path from v3 to v3 is 


V2€2V1E1V0€6V5€5V4€4V3, Where V6 = Vo. 


VU, e3 V3 


——— 
Clockwise 


vi 


Counterclockwise 


Vo E6 Us 


Proof: 
Suppose G is a connected graph, C is a circuit in G, and e is an edge of C. Form a 
subgraph G’ of G by removing e from G. Thus 

V(G') = V(G) 

E(G’) = E(G) — {e}. 
We must show that G’ is connected. [To show a graph is connected, we must show 
that if u and w are any vertices of the graph, then there exists a walk in G' from u to w.] 
Suppose u and w are any two vertices of G’. [We must find a walk from u to w.] Since 


the vertex sets of G and G” are the same, because u and w are both vertices of G, and 
since G is connected, there is a walk W in G from u to w. 


Case 1 (e is not an edge of W): The only edge in G that is not in G’ is e, so in this 
case W is also a walk in G”. Hence u is connected to w by a walk in G”. 


Case 2 (e is an edge of W): In this case the walk W from u to w includes a section of 
the circuit C that contains e. Let C be denoted as follows: 


C: Voe1V1€2V2 ** + CnVn (= Vo). 


Now e is one of the edges of C, so, to be specific, let e = e,. Then the walk W con- 
tains either the sequence 


Vk—1€kVk OF VREKVk-1. 


If W contains vg—1€Vp, connect v,_ to vg by taking the “counterclockwise” walk W' 
defined as follows: 


AN — 
W’: Vk—1Ek—1Vk-2 ° t VoCnYn—-1°** ekt Vk Where v, = Vo. 


An example showing how to go from u to w while avoiding e; is given in Figure 10.4.4. 
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To go from u to w 
while avoiding e,, go 
counterclockwise 
around the circuit. 


FIGURE 10.4.4 An Example of a Walk from u to w That Does Not Include Edge ep 


If W contains v,e,v,_1, connect vz to v,_, by taking the “clockwise” walk W” defined 
as follows: 


ny = 
W”: Viens Vet 1 Vpe1V1€2° tt ek—1Vg-1 Where v, = Vo. 


Now patch either W’ or W” into W to form a new walk from u to w. For instance, to 
patch W’ into W, start with the section of W from u to vy,_,, then take W’ from vg—1 
to v,, and finally take the section of W from vx to w. If this new walk still contains 
an occurrence of e, just repeat the process described previously until all occurrences 
are eliminated. /This must happen eventually since the number of occurrences of e in 
C is finite.] The result is a walk from u to w that does not contain e and hence is a 
walk in G”. 

The previous arguments show that both in case 1 and in case 2 there is a walk in 
G' from u to w. Since the choice of u and w was arbitrary, G’ is connected. 


The second major theorem about trees is a modified converse for Theorem 10.4.2. 


Theorem 10.4.4 


For any positive integer n, if G is a connected graph with n vertices and n — 1 edges, 
then G is a tree. 


Proof: 

Let n be a positive integer and suppose G is a particular but arbitrarily chosen graph 
that is connected and has n vertices and n — | edges. [We must show that G is a tree. 
Now a tree is a connected, circuit-free graph. Since we already know G is connected, it 
suffices to show that G is circuit-free.] Suppose G is not circuit-free. That is, suppose 
G has a circuit C. [We must derive a contradiction.] By Lemma 10.4.3, an edge of C 
can be removed from G to obtain a graph G” that is connected. If G’ has a circuit, 
then repeat this process: Remove an edge of the circuit from G’ to form a new con- 
nected graph. Continue repeating the process of removing edges from circuits until 
eventually a graph G” is obtained that is connected and is circuit-free. By definition, 
G" is a tree. Since no vertices were removed from G to form G”, G” has n vertices 
just as G does. Thus, by Theorem 10.4.2, G” has n— 1 edges. But the supposition 
that G has a circuit implies that at least one edge of G is removed to form G”. Hence 
G” has no more than (n — 1) — 1 = n— 2 edges, which contradicts its having n — 1 
edges. So the supposition is false. Hence G is circuit-free, and therefore G is a tree 
[as was to be shown]. 
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Theorem 10.4.4 is not a full converse of Theorem 10.4.2. Although it is true that every 
connected graph with n vertices and n — | edges (where n is a positive integer) is a tree, it 
is not true that every graph with n vertices and n — 1 edges is a tree. 


Seley A Graph with n Vertices and n — 1 Edges That Is Nota Tree 
Give an example of a graph with five vertices and four edges that is not a tree. 


Solution By Theorem 10.4.4, such a graph cannot be connected. One example of such an 
unconnected graph is shown below. 


Vy} U4 


v e v v. 
2 3 U3 5 E] 


Corollary 10.4.5 


If G is any graph with n vertices and m edges, where m and n are positive integers 
and m = n, then G has a circuit. 


Proof (by contradiction): 

Suppose not. That is, suppose there is a graph G with n vertices and m edges, where 
m and n are positive integers and m = n, and suppose G does not have a circuit. 
Let Gi, Go,..., Gg be the connected components of G, and let 1), nz, ..., ng be the 
number of vertices of Gi, Go,..., Gg, respectively. Because G4, G2, ..., Gy are the 
connected components of G, 


k 


Sin, =n. 


i=1 


Since G does not have a circuit, none of G1, G3, . . . , Gg have circuits either. So, since 
each is connected, each is a tree. By Theorem 10.4.4, the number of edges of each G; 
is n;-,. Now because G is composed of its connected components, 


k 
the number of edges of G = > (the number of edges of G) 
i=l 
= (m=i +m hi1) 
Mitt eng) Ue Lees 1) 


kts 
=n-—k, 
<n since k= 1. 
Thus the number of edges of G is less than n, which contradicts the hypothesis that 


the number of edges of G, namely, m, is greater than or equal to n. Hence the sup- 
position is false and G has a circuit. 
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10.4 TREES: EXAMPLES AND BASIC PROPERTIES 731 


1. A circuit-free graph is a graph with 


2. A forest is a graph that is 
graph that is 


, and a tree is a 


3. A trivial tree is a graph that consists of 


4. Any tree with at least two vertices has at least one 
vertex of degree ____ 


EXERCISE SET 10.4 


. Ifa tree T has at least two vertices, then a terminal 


vertex (or leaf) in T is a vertex of degree 
and an internal vertex (or branch vertex) in Tis a 
vertex of degree 


. For any positive integer n, any tree with n vertices 


has 


For any positive integer n, if G is a connected 
graph with n vertices and n — | edges then ___ 


1. Read the tree in Example 10.4.2 from left to right 
to answer the following questions. 
a. A student scored 12 on part I and 4 on part II. 
What course should the student take? 
b. A student scored 8 on part I and 9 on part II. 
What course should the student take? 


2. Draw trees to show the derivations of the 
following sentences from the rules given in 
Example 10.4.3. 

a. The young ball caught the man. 
b. The man caught the young ball. 


H 3. What is the total degree of a tree with n vertices? 
Why? 


4. Let G be the graph of a hydrocarbon molecule 
with the maximum number of hydrogen atoms for 
the number of its carbon atoms. 

a. Draw the graph of G if G has three carbon 
atoms and eight hydrogen atoms. 

b. Draw the graphs of three isomers of C;Hp. 

c. Use Example 10.4.4 and exercise 3 to prove 
that if the vertices of G consist of k carbon at- 
oms and m hydrogen atoms, then G has a total 
degree of 2k + 2m — 2. 

Hd. Prove that if the vertices of G consist of k car- 
bon atoms and m hydrogen atoms, then G has a 
total degree of 4k + m. 

e. Equate the results of (c) and (d) to prove Cayley’s 
result that a saturated hydrocarbon molecule 
with k carbon atoms and a maximum number of 
hydrogen atoms has 2k + 2 hydrogen atoms. 


H5. Extend the argument given in the proof of 
Lemma 10.4.1 to show that a tree with more than 
one vertex has at least two vertices of degree 1. 


If graphs are allowed to have an infinite number of 
vertices and edges, then Lemma 10.4.1 is false. Give 
a counterexample that shows this. In other words, 
give an example of an “infinite tree” (a connected, 
circuit-free graph with an infinite number of verti- 
ces and edges) that has no vertex of degree 1. 


Find all leaves (or terminal vertices) and all inter- 
nal (or branch) vertices for the following trees. 


a. v 
U5 
v 1 U4 
u3 
V6 
v7 
V: 
b. v vy 2 
Us 
v3 v7 
U6 Ug 


In each of 8-21, either draw a graph with the given specifi- 
cations or explain why no such graph exists. 


8. 
9. 
10. 
T". 
12. 
13. 


Tree, nine vertices, nine edges 

Graph, connected, nine vertices, nine edges 
Graph, circuit-free, nine vertices, six edges 
Tree, six vertices, total degree 14 

Tree, five vertices, total degree 8 


Graph, connected, six vertices, five edges, has a 
circuit 
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14. Graph, two vertices, one edge, not a tree 

15. Graph, circuit-free, seven vertices, four edges 
16. Tree, twelve vertices, fifteen edges 

17. Graph, six vertices, five edges, not a tree 

18. Tree, five vertices, total degree 10 


19. Graph, connected, ten vertices, nine edges, has a 
circuit 


20. Simple graph, connected, six vertices, six edges 
21. Tree, ten vertices, total degree 24 


22. A connected graph has twelve vertices and eleven 


edges. Does it have a vertex of degree 1? Why? 


23. A connected graph has nine vertices and twelve 
edges. Does it have a circuit? Why? 


24. Suppose that v is a vertex of degree | in a connect- 
ed graph G and that e is the edge incident on v. Let 
G” be the subgraph of G obtained by removing v 


and e from G. Must G’ be connected? Why? 


25. A graph has eight vertices and six edges. Is it con- 
nected? Why? 


ANSWERS FOR TEST YOURSELF 


H 26. 


27. 


H 28. 


29. 


30. 
31. 


Hb. 


If a graph has n vertices and n — 2 or fewer edges, 
can it be connected? Why? 


A circuit-free graph has ten vertices and nine 
edges. Is it connected? Why? 


Is a circuit-free graph with n vertices and at least 
n— | edges connected? Why? 


Prove that every nontrivial tree has at least two 
vertices of degree 1 by filling in the details and 
completing the following argument: Let T be a 
nontrivial tree and let S be the set of all paths from 
one vertex to another in T. Among all the paths 

in S, choose a path P with a maximum number of 
edges. (Why is it possible to find such a P?) What 
can you say about the initial and final vertices of 
P? Why? 


Find all nonisomorphic trees with five vertices. 


a. Prove that the following is an invariant for 
graph isomorphism: A vertex of degree i is 
adjacent to a vertex of degree j. 

Find all nonisomorphic trees with six vertices. 


1. no circuits 


5. 1; greater than 1 (Or: at least 2) 6. n-— 1 edges 


Rooted Trees 


7. Gis a tree 


2. circuit-free and not connected; connected and circuit-free 3. a single vertex (and no edges) 4. 1 


Let us grant that the pursuit of mathematics is a divine madness of the 


human spirit, a refuge from the goading urgency of contingent happenings. 
—Alfred North Whitehead, 1861—1947 


An outdoor tree is rooted and so is the kind of family tree that shows all the descendants 
of one particular person. The terminology and notation of rooted trees blends the language 
of botanical trees and that of family trees. In mathematics, a rooted tree is a tree in which 
one vertex has been distinguished from the others and is designated the root. Given any 
other vertex v in the tree, there is a unique path from the root to v. (After all, if there were 
two distinct paths, a circuit could be constructed.) The number of edges in such a path is 
called the level of v, and the height of the tree is the length of the longest such path. It is 
traditional in drawing rooted trees to place the root at the top (as is done in family trees) 
and show the branches descending from it. 
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Definition 


A rooted tree is a tree in which there is one vertex that is distinguished from the 
others and is called the root. The level of a vertex is the number of edges along the 
unique path between it and the root. The height of a rooted tree is the maximum 
level of any vertex of the tree. Given the root or any internal vertex v of a rooted tree, 
the children of v are all those vertices that are adjacent to v and are one level farther 
away from the root than v. If w is a child of v, then v is called the parent of w, and 
two distinct vertices that are both children of the same parent are called siblings. 
Given two distinct vertices v and w, if v lies on the unique path between w and the 
root, then v is an ancestor of w and w is a descendant of v. 


These terms are illustrated in Figure 10.5.1. 


Root Level 0 
v is a child of u. Level 1 
u is the parent of v. 
v and w are siblings. — -Level 2 
- Level 3 
if 

1 

( — Level 4 


Vertices in the enclosed region 
are descendants of u, which 
is an ancestor of each. 


FIGURE 10.5.1 A Rooted Tree 


eT Meee Rooted Trees 


Consider the tree with root vg shown below. 
a. What is the level of v5? b. What is the level of vo? 
c. What is the height of this rooted tree? d. What are the children of v3? 


e. What is the parent of v)? f. What are the siblings of vg? 
g. What are the descendants of v3? 
h. How many leaves (terminal vertices) are on the tree? 
vo 
vi v2 Us 
U4 Us U6 
Ur Ug Vg vio 

Solution 
a. 2 b. 0 G3 d. vsandve €. vo f. vyandvọ g. vs, vę vio h. 6 
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Note that in the tree shown below, the root is vo, vı has level 1, v; is the child of vo, and 
both vo and y; are leaves (terminal vertices). 


Vvo 


vı 


Binary Trees 


When every vertex in a rooted tree has at most two children and each child is designated 
either the (unique) left child or the (unique) right child, the result is a binary tree. 


Definition 


A binary tree is a rooted tree in which every parent has at most two children. Each 
child in a binary tree is designated either a left child or a right child (but not both), 
and every parent has at most one left child and one right child. A full binary tree is 
a binary tree in which each parent has exactly two children. 

Given any parent v in a binary tree T, if v has a left child, then the left subtree of 
v is the binary tree whose root is the left child of v, whose vertices consist of the left 
child of v and all its descendants, and whose edges consist of all those edges of T that 
connect the vertices of the left subtree. The right subtree of v is defined analogously. 


These terms are illustrated in Figure 10.5.2. 


Root 


v is the left 


child of u. i 


v 


x is the right 


ra child of w. 


= 
er a N 


“+7 


Seen 7 


Right subtree of w 


Left subtree of w 


FIGURE10.5.2 A Binary Tree 


Seley Representation of Algebraic Expressions 


Binary trees are used in many ways in computer science. One use is to represent algebraic 
expressions with arbitrary nesting of balanced parentheses. For instance, the following 
(labeled) binary tree represents the expression a/b: The operator is at the root and acts on 
the left and right children of the root in left-to-right order. 
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More generally, the binary tree shown below represents the expression a/(c + d). In 
such a representation, the internal vertices are arithmetic operators, the leaves are vari- 


ables, and the operator at each vertex acts on its left and right subtrees in left-to-right 


order. 


Draw a binary tree to represent the expression ((a — b)-c) + (d/e). 


Solution 


An interesting theorem about binary trees says that if you know the number of internal 
vertices of a full binary tree, then you can calculate both the total number of vertices and 
the number of leaves, and conversely. More specifically, a full binary tree with k internal 


vertices has a total of 2k + 1 vertices of which k + 1 are leaves. 


Theorem 10.5.1 


If k is a positive integer and T is a full binary tree with k internal vertices, then (1) T 
has a total of 2k + 1 vertices, and (2) T has k + 1 leaves. 


Proof: 

Suppose k is a positive integer and T is a full binary tree with k internal vertices. 
(1) Observe that the set of all vertices of T can be partitioned into two disjoint sub- 
sets: the set of all vertices that have a parent and the set of all vertices that do not 
have a parent. Now there is just one vertex that does not have a parent, namely the 
root. Also, since every internal vertex of a full binary tree has exactly two children, 
the number of vertices that have a parent is twice the number of parents, or 2k, since 
each parent is an internal vertex. Hence 


the number of the number of 
the total number . . 
, = | vertices that + | vertices that do 
of vertices of T 
have a parent not have a parent 
= 2k + 1. 


(continued on page 736) 
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(2) Because it is also true that the total number of vertices of T equals the number of 
internal vertices plus the number of leaves, 


p total ane i ie number of | 4 ic number 


of vertices of T internal vertices leaves 
the number of 
= k + . 
leaves 


Now equate the two expressions for the total number of vertices of T: 


the number of 
2k+1=k+ : 


leaves 
Solving this equation gives 


the number of 
leaves 


|- (2k+1)-k=k+1. 


Thus the total number of vertices is 2k + 1 and the number of leaves is k + 1 [as was 
to be shown]. 


Seu cay Determining Whether a Certain Full Binary Tree Exists 
Is there a full binary tree that has 10 internal vertices and 13 terminal vertices? 


Solution No. By Theorem 10.5.1, a full binary tree with 10 internal vertices has 
10+ 1 = 11 leaves, not 13. E 


Another interesting theorem about binary trees specifies the maximum number of 
leaves of a binary tree of a given height. Specifically, the maximum number of leaves of a 
binary tree of height h is 2”. Another way to say this is that a binary tree with t leaves has 
height of at least log; t. 


Theorem 10.5.2 

For every integer h = 0, if T is any binary tree with height h and ¢ leaves, then 
Pe 

Equivalently: logo t Sh. 


Proof (by strong mathematical induction): 
Let P(h) be the sentence 


If T is any binary tree of height h, then T has at most 2" leaves. <— P(h) 


Show that P(0) is true: We must show that if T is any binary tree of height 0, then 
T has at most 2° leaves. Suppose T is a tree of height 0. Then T consists of a single 
vertex, the root. By definition this is also a leaf, and so the number of leaves is 
t= 1 = 2’ = 2}, Hence t = 2" [as was to be shown]. 
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Show that for every integer k = 0, if P@) is true for each integer i from 0 through 
k, then it is true for k +1: 
Let k be any integer with k = 0, and suppose that 


For each integer i from 0 through k, if T is any < inductive hypothesis 
binary tree of height i, then T has at most 2' leaves. 


We must show that 


If T is any binary tree of height k + 1, then T <— P(k+1) 
has at most 2‘*! leaves. 


Let T be a binary tree of height k + 1, root v, and ft leaves. Because k = 0, we have 
that k+ 1 = 1 and so v has at least one child. 


Case 1 (v has only one child): In this case, we may assume without loss of generality that 

v’s child is a left child vz, and that vz is the root of the subtree Tz; of v. (This situation is 

illustrated in Figure 10.5.3.) Let t; be the number of leaves in Tz. By inductive hypothesis, 

DA= 2* because the height of 7; is one less than the height of T, which is k + 1. Also since 

the root v has only one child, v is also a leaf, and hence the total number of leaves in T is 

one more than the number of leaves in Tz. Finally, 2k = 2° = 1 because k = 0. 
Therefore, 


t=t +15 2+1 2+2 = 2.2 = oF"), 


Level 0 


Se ee EE Level 1 


-Level 2 


Level 3 


FIGURE10.5.3 A Binary Tree Whose Root Has One Child 
Case 2 (v has two children): In this case, v has both a left child, vz, and a right child, vp, 


and vz and ve are roots of a left subtree T; and a right subtree Tp. Note that T; and Tp 
are binary trees because T is a binary tree. (This situation is illustrated in Figure 10.5.4.) 


— Level 0 


Level 1 


Level 2 


I 
TEEN ee Level 3 


7------- Level 4 


Left subtree T; Right subtree Tp 


FIGURE10.5.4 A Binary Tree Whose Root Has Two Children 
(continued on page 738) 
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Let t; and tg be the numbers of leaves in T; and Tp, respectively, and let hz and hp be 
the heights of T; and Tp, respectively. Because T has height k + 1, then h; = k and 
hp =k, and so, by inductive hypothesis, 


tpx2™ and tex 2". 


Now the leaves of T consist exactly of the leaves of T; together with the leaves of 
Tr. Therefore, 


t= t ttgr = ght + 2r by inductive hypothesis 
since h; = k and hg =k. 


Hence, 
(2737 =n by basic algebra. 
Thus the number of leaves is at most 2+! [as was to be shown]. 


Since both the basis step and the inductive step have been proved, we conclude 
that for every integer h = 0, if T is any binary tree with height / and ż leaves, then 
ts 2" 

The equivalent inequality log, t = h follows from the fact that the logarithmic 
function with base 2 is increasing. In other words, for all positive real numbers x 
and y, 


if x < y then logy x < log y. 
Thus if we apply the logarithmic function with base 2 to both sides of 
jo", 
we obtain 
logy t = log,(2"). 


Now by definition of logarithm, log,(2") = h [because log,(2") is the exponent to 
which 2 must be raised to obtain 2"), Hence 


loggt Sh 


[as was to be shown]. 


Determining Whether a Certain Binary Tree Exists 


Is there a binary tree that has height 5 and 38 leaves? 


Solution No. By Theorem 10.5.2, any binary tree T with height 5 has at most 2° = 32 
leaves, so such a tree cannot have 38 leaves. E 


Corollary 10.5.3 
A full binary tree of height h has 2” leaves. 


To prove the corollary, start with the proof of Theorem 10.5.2, change the words “bi- 
nary tree” to “full binary tree,’ change the words “at most” to “exactly,” and change the 
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= sign to the = sign. Also delete Case 1 because the root of a full binary tree of height at 
least 1 is guaranteed to have two children. 


Binary Search Trees 


A binary search tree is a kind of binary tree in which data records, such as customer in- 
formation, can be stored, searched, and processed very efficiently. To place records into a 
binary search tree, it must be possible to arrange them in a total order. In case they do not 
have a natural total order of their own, an element of a totally ordered set, such as a number 
or a word and called a key, may be added to each record. The keys are inserted into the 
vertices of the tree and provide access to the records to which they are attached. 

Once it is built, a binary search tree has the following property: for every internal 
vertex v, all the keys in the left subtree of v are less than the key in v, and all the keys in 
the right subtree of v are greater than the key in v. For example, check that the following 
is a binary search tree for the set of records with the following keys: 15, 10, 19, 25, 12, 4. 


To build a binary search tree, start by making a root and insert a key into it. To add a 
new key, compare it to the key at the root. If the new key is less than the key at the root, 
give the root a left child and insert the new key into it. If the key is greater than the key at 
the root, give the root a right child and insert the new key into it. After the first couple of 
keys have been added, the root and other vertices may already have left and right children. 
So to add a key at a subsequent stage, work down the tree to find a place to put the new key, 
starting at the root and either moving left or right depending on whether the new key is less 
or greater than the key at the vertex to which it is currently being compared. This outline 
is expressed more precisely in the following algorithm. 


Algorithm 10.5.1 Building a Binary Search Tree 


Input: A totally ordered, nonempty set K of keys 


Algorithm Body: 
Initialize T to have one vertex, the root, and no edges. Choose a key from K to insert 
into the root. 


while (there are still keys to be added) 
Choose a key, newkey, from K to add. Let the root be called v, let key(v) be 
the key at the root, and let success = 0. 
while (success = 0) 
if (newkey < key(v)) 
then if (v has a left child), call the left child vz and let v:= vz 
else do 1. add a vertex v; to T as the left child for v 
2. add an edge to T to join v to vz 
3. insert newkey as the key for vg 
4. let success := 1 end do 


(continued on page 740) 
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if (newkey > key(v)) 
then if v has a right child 

then call the right child vp, and let v := vp 

else do 1. add a vertex vz to T as the right child for v 
2. add an edge to T to join v to vp 
3. insert newkey as the key for vp 
4. let success := 1 end do 

end while 
end while 
Output: A binary search tree T for the set K of keys 


Steps for Building a Binary Search Tree 


Go through the steps to build a binary search tree for the keys 15, 10, 19, 25, 12, 4, and 
insert the keys in the order in which they are listed. For simplicity, use the same names for 
the vertices and their associated keys. 


Solution 

Insert 15: Make 15 the root. 

Insert 10: Compare 10 to 15. 

Since 10 < 15 and 15 does not have a left child, make 10 the left child of 15 and add an 
edge joining 15 and 10. 

Insert 19: Compare 19 to 15. 

Since 19 > 15 and 15 does not have a right child, make 19 the right child of 15 and add an 
edge joining 15 and 19. 

Insert 25: Compare 25 to 15. 

Since 25 > 15 and 15 has a right child, namely 19, compare 25 to 19. 

Since 25 > 19 and 19 does not have a right child, make 25 the right child of 19 and add an 
edge joining 19 and 25. 

Insert 12: Compare 12 to 15. 

Since 12 < 15 and 15 has a left child, namely 10, compare 12 to 10. 

Since 12 > 10 and 10 does not have a right child, make 12 the right child of 10 and add an 
edge joining 10 and 12. 

Insert 4: Compare 4 to 15. 

Since 4 < 15 and 15 has a left child, namely 10, compare 4 to 10. 

Since 4 < 10 and 10 does not have a left child, make 4 the left child of 10 and add an edge 
joining 10 and 4. 


The sequence of steps is shown in the following diagrams. 


9 oo 
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Note that the algorithm does not specify an order in which keys are to be added as 
the tree is being built. In fact, adding keys in a different order usually results in a dif- 
ferent binary search tree for the given set of keys. For example, if the keys shown in 
Example 10.5.5 are added in the order 19, 10, 25, 4, 15, 12, the result is the following 
binary search tree: 


TEST YOURSELF 


1. A rooted tree is a tree in which 
a vertex in a rooted tree is 
rooted tree is 


. The level of 4. If kis a positive integer and T is a full binary tree 
. The height of a with k internal vertices, then T has a total of 
vertices and has leaves. 


2. A binary tree is a rooted tree in which 5. If Tis a binary tree that has t leaves and height 


h, then t and / are related by the inequality 


3. A full binary tree is a rooted tree in which 


EXERCISE SET 10.5 


1. Consider the tree shown below with root a. 2. Consider the tree shown below with root vo. 


a. What is the level of n? a. What is the level of vg? 

b. What is the level of a? b. What is the level of vg? 

c. What is the height of this rooted tree? c. What is the height of this rooted tree? 

d. What are the children of n? d. What are the children of vio? 

e. What is the parent of g? e. What is the parent of v5? 

f. What are the siblings of j? f. What are the siblings of v,? 

g- What are the descendants of f ? g- What are the descendants of v12? 

h. How many leaves (terminal vertices) are on h. How many leaves (terminal vertices) are on the 
the tree? tree? 
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3. Draw binary trees to represent the following 


expressions: 
a. a‘b—(c/(d+e)) b. a/(b—c:d) 


In each of 4-20, either draw a graph with the given speci- 
fications or explain why no such graph exists. 


4. Full binary tree, five internal vertices 


5. Full binary tree, five internal vertices, seven 
leaves 


6. Full binary tree, seven vertices, of which four are 
internal vertices 


7. Full binary tree, twelve vertices 

8. Full binary tree, nine vertices 

9. Binary tree, height 3, seven leaves 
10. Full binary tree, height 3, six leaves 
11. Binary tree, height 3, nine leaves 


12. Full binary tree, eight internal vertices, seven 
leaves 


13. Binary tree, height 4, eight leaves 
14. Full binary tree, seven vertices 
15. Full binary tree, nine vertices, five internal vertices 


16. Full binary tree, four internal vertices 


ANSWERS FOR TEST YOURSELF 


17. Binary tree, height 4, eighteen leaves 
18. Full binary tree, sixteen vertices 
19. Full binary tree, height 3, seven leaves 


20. What can you deduce about the height of a binary 
tree if you know that it has the following properties? 
a. Twenty-five leaves 
b. Forty leaves 
c. Sixty leaves 


In 21-25, use the steps of Algorithm 10.5.1 to build binary 
search trees. Use numerical order in 21 and alphabetical 
order in 22-25. In parts (a) and (b) of 21 and 22, the ele- 
ments in the lists are the same, but the trees are different 
because the lists are ordered differently. 


21. a. 16, 24, 21, 3, 18, 9,7 
b. 16, 7, 3, 21, 18, 24, 9 


22. a. Asia, Africa, Australia, Antarctica, Europe, 
North America, South America 
b. Australia, Antarctica, Africa, North America, 
Asia, South America, Europe 


23. Carpe diem. Seize the day. Make your lives ex- 
traordinary.! 


24. May the force be with you.” 


25. All good things which exist are the fruits of 
originality.° 


1. one vertex is distinguished from the others and is called the 
root; the number of edges along the unique path between it 
and the root; the maximum level of any vertex of the tree 


2. every parent has at most two children 3. every parent 
has exactly two children 4.2k+1;k+1 5.t5 A or, 
equivalently, log) t S h 


Spanning Trees and a Shortest Path Algorithm 


I contend that each science is a real science insofar as it is mathematics. 


—Immanuel Kant, 1724—1804 


An airline company wants to expand service to the midwestern part of the United States 
and has received permission from the U.S. Federal Aviation Authority to fly any of the 


routes shown in Figure 10.6.1. 


‘adapted from Dead Poets Society (film) 


> Star Wars (film) 
3John Stuart Mill, On Liberty 
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Minneapolis 


Milwaukee 
Detroit 


Cincinnati 


St. Louis Louisville 


Nashville 
FIGURE 10.6.1 
The company wishes to legitimately advertise service to all the cities shown but, for rea- 


sons of economy, wants to use the least possible number of individual routes to connect 
them. One possible route system is given in Figure 10.6.2. 


Minneapolis 


Milwaukee 
Detroit 


Cincinnati 


St. Louis Louisville 


Nashville 
FIGURE 10.6.2 


Clearly this system joins all the cities. Is the number of individual routes minimal? The 
answer is yes, and the reason may surprise you. 

The fact is that the graph of any system of routes that satisfies the company’s wishes is 
a tree, because if the graph were to contain a circuit, then one of the routes in the circuit 
could be removed without disconnecting the graph (by Lemma 10.4.3), and that would 
give a smaller total number of routes. Now any tree with eight vertices has seven edges. 
Therefore, any system of routes that connects all eight vertices and yet minimizes the total 
number of routes consists of seven routes. 


Definition 


A spanning tree for a graph G is a subgraph of G that contains every vertex of G 
and is a tree. 


The preceding discussion contains the essence of the proof of the following proposition: 


Proposition 10.6.1 


1. Every connected graph has a spanning tree. 


2. Any two spanning trees for a graph have the same number of edges. 


(continued on page 744) 
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Proof of part (1) of Proposition 10.6.1: 


Suppose G is a connected graph. If G is circuit-free, then G is its own spanning tree 
and we are done. If not, then G has at least one circuit C}. By Lemma 10.4.3, the 
subgraph of G obtained by removing an edge from C} is connected. If this subgraph 
is circuit-free, then it is a spanning tree and we are done. If not, then it has at least 
one circuit C,, and, as above, an edge can be removed from C; to obtain a connected 
subgraph. Continuing in this way, we can remove successive edges from circuits, un- 
til eventually we obtain a connected, circuit-free subgraph T of G. [This must happen 
at some point because the number of edges of G is finite, and at no stage does removal 
of an edge disconnect the subgraph.] Also, T contains every vertex of G because no 
vertices of G were removed in constructing it. Thus Tis a spanning tree for G. 


You are asked to prove part (2) of Proposition 10.6.1 in exercise 17 at the end of this section. 


Example 10.6.1 Spanning Trees 


Find all spanning trees for the graph G pictured below. 


Us v4 v3 


vo v v 


Solution The graph G has one circuit vọvıv4vz, and removing any edge of the circuit 
gives a tree. Thus, as shown below, there are three spanning trees for G. 


Us V4 V3 Us V4 V3 Us V4 V3 
Uo vi v Vo vi v vo v v 
Minimum Spanning Trees 


The graph of the routes allowed by the U.S. Federal Aviation Authority shown in Fig- 
ure 10.6.1 can be annotated by adding the distances (in kilometers) between each pair of 
cities. This is done in Figure 10.6.3. 


Minneapolis 


s74 
Milwaukee 
Detroit 


370 


Cincinnati 


St. Louis Louisville 


243 
Nashville 
FIGURE 10.6.3 
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Now suppose the airline company wants to serve all the cities shown, but with a route 
system that minimizes the total distance of the system as a whole. Note that such a system 
is a tree, because if the system contained a circuit, removal of an edge from the circuit 
would not affect a person’s ability to reach every city in the system from every other (again, 
by Lemma 10.4.3), but it would reduce the total distance of the system. 

More generally, a graph whose edges are labeled with numbers (known as weights) is 
called a weighted graph. A minimum-weight spanning tree, or simply a minimum spanning 
tree, is a Spanning tree for which the sum of the weights of all the edges is as small as possible. 


Definition and Notation 


A weighted graph is a graph for which each edge has an associated positive real 
number weight. The sum of the weights of all the edges is the total weight of the 
graph. A minimum spanning tree for a connected, weighted graph is a spanning 
tree that has the least possible total weight compared to all other spanning trees for 
the graph. 


If G is a weighed graph and e is an edge of G, then w(e) denotes the weight of e and 
w(G) denotes the total weight of G. 


The problem of finding a minimum spanning tree for a graph is certainly solvable. One 
solution is to list all spanning trees for the graph, compute the total weight of each, and 
choose one for which this total is a minimum. (The well-ordering principle for the integers 
guarantees the existence of such a minimum total.) This solution, however, is inefficient 
in its use of computing time because the number of distinct spanning trees is so large. For 
instance, a complete graph with n vertices has n"? spanning trees. Even using the fastest 
computers available today, examining all such trees in a graph with approximately 100 
vertices would require more time than is estimated to remain in the life of the universe. 

In 1956 and 1957 Joseph B. Kruskal and Robert C. Prim each described much more 
efficient algorithms to construct minimum spanning trees. Even for large graphs, both 
algorithms can be implemented so as to take relatively short computing times. 


Kruskal’s Algorithm 


In Kruskal’s algorithm, the edges of a connected, weighted graph are examined one by one 
in order of increasing weight. At each stage the edge being examined is added to what will 
become the minimum spanning tree, provided that this addition does not create a circuit. 
After n — 1 edges have been added (where n is the number of vertices of the graph), these 
edges, together with the vertices of the graph, form a minimum spanning tree for the graph. 


Algorithm 10.6.1 Kruskal 


Courtesy of Joyce Kruskal 


se ài 
Joseph Kruskal Input: G [a connected, weighted graph with n vertices, where n is a positive integer] 


(1928-2010) Algorithm Body: 


[Build a subgraph T of G to consist of all the vertices of G with edges added in order of 
increasing weight. At each stage, let m be the number of edges of T.] 


1. Initialize T to have all the vertices of G and no edges. 


2. Let E be the set of all the edges of G, and let m := 0. 
(continued on page 746) 
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3. while (m < n-— 1) 
3a. Find an edge e in E of least weight. 
3b. Delete e from E. 


3c. if addition of e to the edge set of T does not produce a circuit 
then add e to the edge set of T and set m := m+ 1 


end while 
Output: T [T is a minimum spanning tree for G.] 


The following example shows how Kruskal’s algorithm works for the graph of the air- 
line route system. 


Action of Kruskal’s Algorithm 


Describe the action of Kruskal’s algorithm on the graph shown in Figure 10.6.4, where 


n= 8. 
Minneapolis 
Nashville 
FIGURE 10.6.4 
Solution 
Iteration Number Edge Considered Weight | Action Taken 
1 Chicago—Milwaukee 119 added 
2 Louisville—Cincinnati 134 added 
3 Louisville—Nashville 243 added 
4 Cincinnati—Detroit 370 added 
5 St. Louis—Louisville 390 added 
6 St. Louis—Chicago 422 added 
7 Chicago—Louisville 433 not added 
8 Louisville—Detroit 493 not added 
9 Louisville—Milwaukee 560 not added 
10 Minneapolis—Chicago 571 added 
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The tree produced by Kruskal’s algorithm is shown in Figure 10.6.5. 


Minneapolis 


571 
Milwaukee 
Detroit 


370 


Cincinnati 


St. Louis Louisville 


243 
Nashville 
FIGURE 10.6.5 E 


When Kruskal’s algorithm is used on a graph in which some edges have the same weight 
as others, more than one minimum spanning tree can occur as output. To make the output 
unique, the edges of the graph can be placed in an array and edges having the same weight 
can be added in the order they appear in the array. 

It is not obvious from the description of Kruskal’s algorithm that it does what it is sup- 
posed to do. To be specific, what guarantees that it is possible at each stage to find an edge 
of least weight whose addition does not produce a circuit? And if such edges can be found, 
what guarantees that they will all eventually connect? And if they do connect, what guar- 
antees that the resulting tree has minimum weight? Of course, the mere fact that Kruskal’s 
algorithm is printed in this book may lead you to believe that everything works out. But the 
questions above are real, and they deserve serious answers. 


Theorem 10.6.2 Correctness of Kruskal’s Algorithm 


When a connected, weighted graph is input to Kruskal’s algorithm, the output is a 
minimum spanning tree. 


Proof: Suppose that G is a connected, weighted graph with n vertices and that T is a 
subgraph of G produced when G is input to Kruskal’s algorithm. Clearly T is circuit- 
free [since no edge that completes a circuit is ever added to T]. Also, T is connected. 
For as long as T has more than one connected component, the set of edges of G that 
can be added to T without creating a circuit is nonempty. [The reason is that since G 
is connected, given any vertex v; in one connected component C; of T and any vertex 
v, in another connected component C,, there is a path in G from v; to v3. Since C, and 
C, are distinct, there is an edge e of this path that is not in T. Adding e to T does not 
create a circuit in T, because deletion of an edge from a circuit does not disconnect a 
graph and deletion of e would.] The preceding arguments show that T is circuit-free 
and connected. Since by construction T contains every vertex of G, T is a spanning 
tree for G. 

Next we show that T has minimum weight. Let 7; be any minimum spanning tree 
for G such that the number of edges T, and T have in common is a maximum. Sup- 
pose that T # 7,. Then there is an edge e in T that is not an edge of 7). [Since trees 
T and T, both have the same vertex set, if they differ at all, they must have different, but 


(continued on page 748) 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


748 CHAPTER10 THEORY OF GRAPHS AND TREES 


same-size, edge sets.] Now adding e to T, produces a graph with a unique circuit (see 
exercise 19 at the end of this section). Let e’ be an edge of this circuit such that e’ is 
not in T. [Such an edge must exist because T is a tree and hence circuit-free.] Let T> be 
the graph obtained from 7, by removing e’ and adding e. This situation is illustrated 
below. 

The entire graph is G. Tı 
has black edges. e is in T 


but not 7. e’ is in Tı but 
not T. 


e' (remove from T) to form T3.) 


e (add to T) to form T3.) 


Note that T, has n — 1 edges and n vertices and that T, is connected [since by Lemma 
10.4.3 the subgraph obtained by removing an edge from a circuit in a connected graph 
is connected]. Consequently, T, is a spanning tree for G. In addition, 


w(T>) = w(T\) — wle') + we). 


Now w(e) = w(e’) because at the stage in Kruskal’s algorithm when e was added to 
T, e’ was available to be added [since it was not already in T, and at that stage its ad- 
dition could not produce a circuit since e was not in T], and e' would have been added 
had its weight been less than that of e. Thus 


w(Ty) = w(T,) —[wle') — w(e)] 
=0 


But T; is a minimum spanning tree. So since T, is a spanning tree with weight less 
than or equal to the weight of 7), T> is also a minimum spanning tree for G. 

Finally, note that by construction, T) has one more edge in common with T than 
T, does, which contradicts the choice of T) as a minimum spanning tree for G with a 
maximum number of edges in common with T. Thus the supposition that T # T is 
false, and hence T itself is a minimum spanning tree for G. 


Prim’s Algorithm 


Prim’s algorithm works differently from Kruskal’s. It builds a minimum spanning tree T by 
expanding outward in connected links from some vertex. One edge and one vertex are add- 
ed at each stage. The edge added is the one of least weight that connects the vertices already 
in T with those not in T, and the vertex is the endpoint of this edge that is not already in T. 


Nokia Bell Labs 


Algorithm 10.6.2 

Robert Prim 

(born 1921) Input: G [a connected, weighted graph with n vertices where n is a positive integer] 
Algorithm Body: 


[Build a subgraph T of G by starting with any vertex v of G and attaching edges (with 
their endpoints) one by one to an as-yet-unconnected vertex of G, each time choosing an 
edge of least weight that is adjacent to a vertex of T.] 
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1. Pick a vertex v of G and let T be the graph with one vertex, v, and no edges. 
2. Let V be the set of all vertices of G except v. 


3. for i:= 1 ton— 1 


3a. Find an edge e of G such that (1) e connects T to one of the vertices in V, and 
(2) e has the least weight of all edges connecting T to a vertex in V. Let w be 
the endpoint of e that is in V. 

3b. Add e and w to the edge and vertex sets of T, and delete w from V. 

next i 


Output: T [T is a minimum spanning tree for G.] 


The following example shows how Prim’s algorithm works for the graph of the airline 
route system. 


ZEWG] Action of Prim’s Algorithm 


Describe the action of Prim’s algorithm for the graph in Figure 10.6.6 using the Minneapo- 
lis vertex as a starting point. 


Minneapolis 


Milwaukee 
Detroit 


370 


Cincinnati 


Nashville 
FIGURE 10.6.6 
Solution 
Iteration Number Vertex Added Edge Added Weight 
0 Minneapolis 
1 Chicago Minneapolis—Chicago 571 
2 Milwaukee Chicago—Milwaukee 119 
3 St. Louis Chicago-St. Louis 422 
4 Louisville St. Louis—Louisville 390 
5 Cincinnati Louisville—Cincinnati 134 
6 Nashville Louisville—Nashville 243 
7 Detroit Cincinnati—Detroit 370 


Note that the tree obtained is the same as that obtained by Kruskal’s algorithm, but the 
edges are added in a different order. 
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As with Kruskal’s algorithm, in order to ensure a unique output, the edges of the graph 
could be placed in an array and those with the same weight could be added in the order 
they appear in the array. It is not hard to see that when a connected graph is input to Prim’s 
algorithm, the result is a spanning tree. What is not so clear is that this spanning tree is a 
minimum. The proof of the following theorem establishes that it is. 


Theorem 10.6.3 Correctness of Prim’s Algorithm 


When a connected, weighted graph G is input to Prim’s algorithm, the output is a 
minimum spanning tree for G. 


Proof: 

Let G be a connected, weighted graph, and suppose G is input to Prim’s algorithm. 
At each stage of execution of the algorithm, an edge must be found that connects a 
vertex in a subgraph to a vertex outside the subgraph. As long as there are vertices 
outside the subgraph, the connectedness of G ensures that such an edge can always 
be found. [For if one vertex in the subgraph and one vertex outside it are chosen, then 
by the connectedness of G there is a walk in G linking the two. As one travels along 
this walk, at some point one moves along an edge from a vertex inside the subgraph 
to a vertex outside the subgraph.] 

Now it is clear that the output T of Prim’s algorithm is a tree because the edge and 
vertex added to T at each stage are connected to other edges and vertices of T and 
because at no stage is a circuit created since each edge added connects vertices in 
two disconnected sets. [Consequently, removal of a newly added edge produces a dis- 
connected graph, whereas by Lemma 10.4.3, removal of an edge from a circuit produces 
a connected graph.] Also, T includes every vertex of G because T, being a tree with 
n— l edges, has n vertices [and that is all G has]. Thus T is a spanning tree for G. 

Next we show that T has minimum weight. Suppose there is a minimum spanning 
tree for G, T;, such that the number of edges T, and T have in common is a maximum, 
but T # T,. Then there is an edge e in T that is not an edge of 7). [Since trees T and 
T, both have the same vertex set if they differ at all, they must have different, same-size 
edge sets.] Of all such edges, let e be the last that was added when T was constructed 
using Prim’s algorithm. Let S be the set of vertices of T just before the addition of e. 
Then one endpoint, say v of e, is in S and the other, say w, is not. Since T} is a spanning 
tree, there is a path in T, joining v to w. And since v € S and w É S, as one travels 
along this path, one must encounter an edge e’ that joins a vertex in S to one that is 
not in S and that therefore is not in T because e was the last edge added to T. Now at 
the stage when e was added to T, e’ could have been added and it would have been 
added instead of e had its weight been less than that of e. Since e’ was not added at 
that stage, we conclude that 


w(e’) = w(e). 


Let T, be the graph obtained from T, by removing e’ and adding e. [Thus T, has 
one more edge in common with T than T, does.] Note that T is a tree. The reason 
is that since e’ is part of a path in T, from v to w, and e connects v and w, add- 
ing e to T, creates a circuit. When e’ is removed from this circuit, the resulting 
subgraph remains connected and has the same number of edges as T. In fact, T> is 
a spanning tree for G since no vertices were removed in forming T, from T;. The 
argument showing that w(7>) = w(7)) is left as an exercise. [It is virtually identical 
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to part of the proof of Theorem 10.6.2.] It follows that T» is a minimum spanning tree 
for G. 

By construction, T) has one more edge in common with T than T; does, which 
contradicts the choice of T) as a minimum spanning tree for G, not equal to T, with 
a maximum number of edges in common with T. It follows that T = T}, and hence T 
itself is a minimum spanning tree for G. 


Finding Minimum Spanning Trees 


Find all minimum spanning trees for the following graph. Use Kruskal’s algorithm and 
Prim’s algorithm starting at vertex a. Indicate the order in which edges are added to form 
each tree. 


Solution When Kruskal’s algorithm is applied, edges are added in one of the following 
two orders: 


1. {d, f}, {a, c}, {a, b}, {c, d}, {d, e} 
2. {d, f}, {a, c}, {b, c}, {c, d}, {d, e} 


When Prim’s algorithm is applied starting at a, edges are added in one of the following 
two orders: 


1. {a, c}, {a, b}, {c, d}, {d, f}, {d, e} 
2. {a, c}, {b, c}, {c, d}, {d, f}, {d, e} 


Thus, as shown below, there are two distinct minimum spanning trees for this graph. 


a 


(a) (b) E 


Dijkstra’s Shortest Path Algorithm 


Although the trees produced by Kruskal’s and Prim’s algorithms have the least pos- 
sible total weight compared to all other spanning trees for the given graph, they do not 
always reveal the shortest distance between any two points on the graph. For instance, 
according to the complete route system shown in Figure 10.6.3, one can fly directly 
from Nashville to Minneapolis for a distance of 1,118 kilometers, whereas if you use 
the minimum spanning tree shown in Figure 10.6.5 the only way to fly from Nashville 
to Minneapolis is by going through Louisville, St. Louis, and Chicago, which gives a 
total distance of 243 + 390 + 422 +571 = 1,626 kilometers and the unpleasantness of 
three changes of plane. 
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In 1959 the computing pioneer, Edsger Dijkstra (see Section 5.5), developed an algo- 
rithm to find the shortest path between a starting vertex and an ending vertex in a weighted 
graph in which all the weights are positive. It is somewhat similar to Prim’s algorithm in 
that it works outward from a starting vertex a, adding vertices and edges one by one to 
construct a tree T. However, it differs from Prim’s algorithm in the way it chooses the next 
vertex to add, ensuring that for each added vertex v, the length of the shortest path from a 
to v has been identified. 

At the start of execution of the algorithm, each vertex u of G is given a label L(u), which 
indicates the current best estimate of the length of the shortest path from a to u. L(a) is 
initially set equal to 0 because the shortest path from a to a has length 0, but, because 
there is no previous information about the lengths of the shortest paths from a to any other 
vertices of G, the label L(u) of each vertex u other than a is initially set equal to a number, 
denoted %, that is greater than the sum of the weights of all the edges of G. As execution of 
the algorithm progresses, the values of L(u) are changed, eventually becoming the actual 
lengths of the shortest paths from a to u in G. 

Because T is built up outward from a, at each stage of execution of the algorithm the 
only vertices that are candidates to join T are those that are adjacent to at least one vertex 
of T. Thus at each stage of Dijkstra’s algorithm, the graph G can be thought of as divided 
into three parts: the tree T that is being built up, the set of “fringe” vertices that are adja- 
cent to at least one vertex of the tree, and the rest of the vertices of G. Each fringe vertex 
is a candidate to be the next vertex added to T. The one that is chosen is the one for which 
the length of the shortest path to it from a through T is a minimum among all the vertices 
in the fringe. 

An essential observation underlying Dijkstra’s algorithm is that after each addition 
of a vertex v to T, the only fringe vertices for which a shorter path from a might be 
found are those that are adjacent to v [because the length of the path from a to v was a 
minimum among all the paths from a to vertices in what was then the fringe]. So after 
each addition of a vertex v to T, each fringe vertex u adjacent to v is examined and 
two numbers are compared: the current value of L(u) and the value of L(v) + w(v, u), 
where L(y) is the length of the shortest path to v (in T) and wọ, u) is the weight of 
the edge joining v and u. If L(v) + w(v, u) < L(u), then the value of L(u) is changed to 
L(v) + wQ, u). 

At the beginning of execution of the algorithm, the tree consists only of the vertex 
a, and L(a) = 0. When execution terminates, L(z) is the length of a shortest path from 
a toz. 

As with Kruskal’s and Prim’s algorithms for finding minimum spanning trees, there 
is a simple but dramatically inefficient way to find the shortest path from a to z: compute 
the lengths of all the paths and choose one that is shortest. The problem is that even for 
relatively small graphs, using this method to find a shortest path could require billions of 
years, whereas Dijkstra’s algorithm could do the job in a few seconds. 


Algorithm 10.6.3 Dijkstra 


Input: G [a connected simple graph with a positive weight for every edge], © [a number 
greater than the sum of the weights of all the edges in the graph], w(u, v) [the weight of 
edge {u, v}], a [the starting vertex], z [the ending vertex] 
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Note The unique path 

in the tree T from a to z 
is the shortest path in G 
from a to z. 


Example 10.6.5 
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Algorithm Body: 


1. Initialize T to be the graph with vertex a and no edges. Let V(T) be the set of 
vertices of T, and let E(T) be the set of edges of T. 


2. Let L(a) = 0, and for all vertices in G except a, let L(u) = ~. 
[The number L(x) is called the label of x.] 


3. Initialize v to equal a and F to be {a}. 
[The symbol v is used to denote the vertex most recently added to T.] 


4. while (z € V(T)) 
4a. F := (F— {v}) U {vertices that are adjacent to v and are not in V(T)} 
[The set F is called the fringe. Each time a vertex is added to T, it is removed 
from the fringe and the vertices adjacent to it are added to the fringe if they 
are not already in the fringe or the tree T.] 
4b. For each vertex u that is adjacent to v and is not in V(T), 
if L(v) + wi, u) < L(u) then 


Liv) := L(v) + wQ, u) 
Diu) := v 


[Note that adding v to T does not affect the labels of any vertices in the fringe 
F except those adjacent to v. Also, when L(u) is changed to a smaller value, the 
notation D(u) is introduced to keep track of which vertex in T gave rise to the 
smaller value.] 

4c. Find a vertex x in F with the smallest label 
Add vertex x to V(T), and add edge {D(x), x} to E(T) 
v := x [This statement sets up the notation for the next iteration of the loop.] 


end while 


Output: L(z) [L(z), a nonnegative integer, is the length of the shortest path from a 
to z.] 


The action of Dijkstra’s algorithm is illustrated by the flow of the drawings in Ex- 
ample 10.6.5. 


Action of Dijkstra’s Algorithm 


Show the steps in the execution of Dijkstra’s shortest path algorithm for the graph shown 
below with starting vertex a and ending vertex z. 
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Solution 
Step 1: Going into the while loop: V(T) = {a}, E(T) = ©, and F = {a} 


During iteration: 

F = {b,c}, L(b) = 3, L(c) = 4. 
z Since L(b) < L(c), b is added to 

V(T), D(b) = a, and {a, b} 

is added to E(T). 


Step 2: Going into the while loop: V(T) = {a, b}, E(T) = {{a, b}} 


b 6 d During iteration: 
3 7 F = {c,d, e}, L(c) = 4, L(d) = 9, 
a z L(e) = 8. 
4 12 Since L(c) < L(d) and L(c) < L(e), c is 
a added to V(T), D(c) =a, and {a, c} 
is added to E(T). 


Step 3: Going into the while loop: V(T) = {a, b, c}, E(T) = {{a, b}, {a, c}} 


b 6 d During iteration: 
3 7 F = {d, e}, Ld) = 9, L(e) =5 
a z L(e) becomes 5 because ace, which has 
4 12 length 5, is a shorter path to e than abe, 
c 1 e which has length 8. 


Since L(e) < L(d), e is added to V(T), 
D(e) = c, and {c, e} is added to E(T). 


Step 4: Going into the while loop: V(T) = {a, b, c, e}, 
E(T) = {{a, b}, {a, c}, fe, e}} 


b 6 d During iteration: 
3 7 F = {d, z}, L@ = 7, L(z) = 17 
a z L(d) becomes 7 because aced, which has 
4 12 length 7, is a shorter path to d than abd, 
c 1 e which has length 9. 


Since L(d) < L(z), d is added to V(T), 
D(d) = e, and {e, d} is added to E(T). 


Step 5: Going into the while loop: V(T) = {a, b, c, e, d}, 
E(T) = {{a, b}, {a, c}, {c, e}, fe, d}} 


b 6 d During iteration: 
3 7 F = {z}, Lz) = 14 
a z L(z) becomes 14 because acedz, which 
4 12 has length 14, is a shorter path to d than 
C l e abdz, which has length 17. 


Since z is the only vertex in F, its label 
is a Minimum, and so z is added to 
VT), D(z) = d, and {d, z} is added to 
ET). 


Execution of the algorithm terminates at this point because z E V(T). The shortest path 
from a to z has length L(z) = 14. 
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Keeping track of the steps in a table is a convenient way to show the action of the algo- 


rithm. Table 10.6.1 does this for the graph in Example 10.6.5. 


TABLE 10.6.1 

Step V(T) E(T) F L(a) | Lb) | L© | Ld) | L@ | LR 
0 {a} Ø {a} 0 00 o0 o 00 oo 
1 {a} Ø {b, c} 0 3 4 00 o0 
2 {a, b} {{a, b}} {c,d,e}| 0 3 4 9 8 o0 
3 {a, b, c} {{a, b}, {a, c}} {d,e} 0 3 4 9 5 oo 
4 {a, b, c, e} {{a, b}, {a,c}, {c, e}} {d, z} 0 3 4 7 5 17 
5 {a,b,c,e,d} | {{a, b}, {a,c}, {c, e}, {e, d}} {z} 0 3 4 7 5 14 
6 | {a,b,c, e,d,z} | {{a, b}, {a,c}, {c, e}, {e, d}, fe, z}} 


In step 1, D(b) = a; in step 2, D(c) = a; in step 3, D(e) = c; in step 4, D(d) = e; and in 
step 5, D(z) = e. Working backward gives the vertices in the shortest path. Because D(z) = d, 
D(d) = e, D(e) = c, and D(c) = a, the shortest path from a to z is acedz. 


It is clear that Dijkstra’s algorithm keeps adding vertices to T until it has added z. The proof 
of the following theorem shows that when the algorithm terminates, the label for z, L(z), is 


the length of the shortest path to z from a. 


Theorem 10.6.4 Correctness of Dijkstra’s Algorithm 


When a connected, simple graph with a positive weight for every edge is input to Di- 
jkstra’s algorithm with starting vertex a and ending vertex z, the output is the length 
of a shortest path from a to z. 


Proof: 

Let G be a connected, weighted graph with no loops or parallel edges and with a 
positive weight for every edge. Let T be the graph built up by Dijkstra’s algorithm, 
and for each vertex u in G, let L(u) be the label given by the algorithm to vertex u. 
For each integer n = 0, let the property P(n) be the sentence 


After the nth iteration of the while loop in Dijkstra’s algorithm, 
(1) Tis a tree, and (2) for every vertex v in T, L(v) is the length of a <—P(n) 
shortest path in G from a to v. 


We will show by mathematical induction that P(n) is true for each integer n from 0 
through the termination of the algorithm. 


Show that P(0) is true: When n = 0, the graph T is a tree because it is defined to 
consist only of the vertex a and no edges. In addition, L(a) is the length of the short- 
est path from a to a because the initial value of L(a) is 0. 


Show that for every integer k = 0, if P(k) is true then P(k +1) is also true: Let k 
be any integer with k = 0 and suppose that 


After the kth iteration of the while loop in Dijkstra’s algorithm, <—P(k) 
(1) Tis a tree, and (2) for every vertex v in T, L(v) is the length cele 


of a shortest path in G from a to v. 


(continued on page 756) 
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We must show that 


After the (k + 1)st iteration of the while loop in Dijkstra’s 
algorithm, (1) T is a tree, and (2) for every vertex v in T, <—P(k+1) 
L(v) is the length of a shortest path in G from a to v. 


Suppose that after the (k + 1)st iteration of the while loop in Dikjstra’s algorithm, the 
vertex v and edge {x, v} have been added to T, where x is in V(T). Clearly the new 
value of T is a tree because adding a new vertex to a tree along with the edge leading 
to it neither creates a circuit nor disconnects the tree. By inductive hypothesis, for 
each vertex y that is in the tree before the addition of v, L(y) is the length of a shortest 
path from a to y. So it remains only to show that L(y) is the length of a shortest path 
from a to v. 

Now, according to the algorithm, the final value of L(v) = L(x) + w(x, v). Con- 
sider any shortest path from a to v, and let {s, t} be the first edge in this path to leave 
T, where s E V(T) and t € V(T). This situation is illustrated below. 


Let LSP(a, v) be the length of a shortest path from a to v, and let LSP(a, s) be the 
length of a shortest path from a to s. Observe that 


LSP(a, v) = LSP(a, s) + w(s, t) because the path from f to v has length = 0 
= L(s) + w(s, À by inductive hypothesis because s is a vertex in T 


t is in the fringe of the tree, and so if L(s) + w(s, f) 
= L(x) + w(x, v) were less than L(x) + w(x, v) then t would have 
been added to T instead of r. 


On the other hand, 


because L(x) + w(x, v) is the length of a path from 
L(x) + w(x, v) = LSP(a, v) a to v and so it is greater than or equal to the length 
of the shortest path from a to v. 


Because both LSP(a, v) = L(x) + w(x, v) and L(x) + w(x, v) = LSP(a, v), we have 
that 


LSP(a, v) = L(x) + w(x, v). 
And since it is also the case that 
Liv) = L(x) + w(x, v), 
we conclude that 
L(v) = LSP(a, v). 


Therefore, L(y) is the length of a shortest path from a to v, which completes the proof 
by mathematical induction. 
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The algorithm terminates as soon as z is in T, and, since we have proved that the 
label of every vertex in the tree gives the length of the shortest path to it from a, then, 
in particular, L(z) is the length of a shortest path from a to z. 


TEST YOURSELF 


1. A spanning tree for a graph G is 


2. A weighted graph is a graph for which ___., and 
the total weight of the graph is 


3. A minimum spanning tree for a connected, 
weighted graph is ____. 


4. In Kruskal’s algorithm, the edges of a connected, 
weighted graph are examined one by one in order 
of starting with —— 


EXERCISE SET 10.6 


5. In Prim’s algorithm, a minimum spanning tree is 
built by expanding outward from an ina 
sequence of ___ 


6. In Dijkstra’s algorithm, a vertex is in the fringe if 
it is vertex in the tree that is being built up. 


7. At each stage of Dijkstra’s algorithm, the vertex 
that is added to the tree is a vertex in the fringe 
whose label is a ___ 


Find all possible spanning trees for each of the graphs in 
land 2. 


ts a b E vo v 
d c V3 Wy 


Find a spanning tree for each of the graphs in 3 and 4. 


3. b d 
a 


G e 


: f 


Use Kruskal’s algorithm to find a minimum spanning tree 
for each of the graphs in 5 and 6. Indicate the order in 
which edges are added to form each tree. 


Use Prim’s algorithm starting with vertex a or Vo to find a 

minimum spanning tree for each of the graphs in 7 and 8. In- 

dicate the order in which edges are added to form each tree. 
7. The graph of exercise 5. 8. The graph of exercise 6. 


For each of the graphs in 9 and 10, find all minimum span- 
ning trees that can be obtained using (a) Kruskal’s algorithm 
and (b) Prim’s algorithm starting with vertex a or t. Indicate 
the order in which edges are added to form each tree. 


9. ; b 


Pos, 
eae ee 


10. t fi u 2 Vv 


11. A pipeline is to be built that will link six cit- 
ies. The cost (in hundreds of millions of dollars) 
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of constructing each potential link depends on 
distance and terrain and is shown in the weighted 
graph below. Find a system of pipelines to connect 
all the cities and yet minimize the total cost. 


Salt Lake City 15 Cheyenne 


1.9 


Amarillo 


Albuquerque 
Phoenix quer 


12. Use Dikstra’s algorithm for the airline route system 
of Figure 10.6.3 to find the shortest distance from 
Nashville to Minneapolis. Make a table similar to 
Table 10.6.1 to show the action of the algorithm. 

Use Dijkstra’s algorithm to find the shortest path from 

ato z for each of the graphs in 13-16. In each case make 

tables similar to Table 10.6.1 to show the action of the 
algorithm. 


13. 


14. bic id 


15. The graph of exercise 9 with a = a and z = f 
16. The graph of exercise 10 with a = u and z = w 


17. Prove part (2) of Proposition 10.6.1: Any two 
spanning trees for a graph have the same number 
of edges. 


18. Given any two distinct vertices of a tree, there ex- 
ists a unique path from one to the other. 
a. Give an informal justification for the above 
statement. 
* b. Write a formal proof of the above statement. 


19. Prove that if G is a graph with spanning tree T and 
e is an edge of G that is not in T, then the graph 
obtained by adding e to T contains one and only 
one set of edges that forms a circuit. 
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20. 


21. 


H 22. 


23. 


24. 


H 25. 


* 26. 


* 27. 


28. 


29. 


30. 


Suppose G is a connected graph and T is a circuit- 
free subgraph of G. Suppose also that if any edge 
e of G not in T is added to 7, the resulting graph 
contains a circuit. Prove that T is a spanning tree 
for G. 


a. Suppose T, and T, are two different spanning 
trees for a graph G. Must T, and T, have an 
edge in common? Prove or give a counterex- 
ample. 

b. Suppose that the graph G in part (a) is simple. 
Must T, and T, have an edge in common? 
Prove or give a counterexample. 


Prove that an edge e is contained in every span- 
ning tree for a connected graph G if, and only if, 
removal of e disconnects G. 


Consider the spanning trees 7; and T in the proof 
of Theorem 10.6.3. Prove that w(T,) = w(7}). 


Suppose that T is a minimum spanning tree for a 
connected, weighted graph G and that G contains 
an edge e (not a loop) that is not in 7: Let v and 

w be the endpoints of e. By exercise 18 there is a 
unique path in T from v to w. Let e’ be any edge of 
this path. Prove that w(e’) = w(e). 


Prove that if G is a connected, weighted graph 
and e is an edge of G (not a loop) that has smaller 
weight than any other edge of G, then e is in every 
minimum spanning tree for G. 


If G is a connected, weighted graph and no two 
edges of G have the same weight, does there 
exist a unique minimum spanning tree for G? 
Use the result of exercise 19 to help justify your 
answer. 


Prove that if G is a connected, weighted graph and 
e is an edge of G that (1) has greater weight than 
any other edge of G and (2) is in a circuit of G, then 
there is no minimum spanning tree T for G such 
that e is in T. 


Suppose a disconnected graph is input to Kruskal’s 
algorithm. What will be the output? 


Suppose a disconnected graph is input to Prim’s 
algorithm. What will be the output? 


Modify Algorithm 10.6.3 so that the output 
consists of the sequence of edges in the shortest 
path from a to z. 
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31. Prove that if a connected, weighted graph G is input 
to Algorithm 10.6.4 (shown below), the output is a 
minimum spanning tree for G. 


Algorithm 10.6.4 


Input: G [a connected graph] 
Algorithm Body: 
1.7:=G. 


2. E := the set of all edges of G, m := the number of 
edges of G. 
3. while (m > 0) 
3a. Find an edge e in E that has maximal weight. 
3b. Remove e from E and set m := m- 1. 
3c. if the subgraph obtained when e is removed from 
the edge set of T is connected then remove e from 
the edge set of T 


end while 


Output: T [a minimum spanning tree for G] 


ANSWERS FOR TEST YOURSELF 


1. a subgraph of G that contains every vertex of G and total weight compared to all other spanning trees for the 
isatree. 2. each edge has an associated positive real graph 4. weight; an edge of least weight 5. initial vertex; 
number weight; the sum of the weights of all the edges of adjacent vertices and edges 6.adjacenttoa 7. minimum 
the graph 3. a spanning tree that has the least possible among all those in the fringe 
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ANALYSIS OF ALGORITHM 
EFFICIENCY 


In 1637 the French mathematician and philosopher René Descartes published his great 
philosophical work Discourse on Method. An appendix to this work, called “Geometry,” 
laid the foundation for the subject of analytic geometry, in which geometric methods are 
applied to the study of algebraic objects, such as functions, equations, and inequalities, 
and algebraic methods are used to study geometric objects, such as straight lines, circles, 
and half-planes. 

The analytic geometry of Descartes provides the foundation for the main topic of this 
chapter: analyzing algorithm efficiency using the big-O, big-Omega, and big-Theta notations. 
In Section 11.1 we briefly discuss certain properties of graphs of real-valued functions of a 
real variable that are needed to understand these notations. In Section 11.2 we define the nota- 
tions, discuss why they are useful in the analysis of algorithms, and apply them to power and 


Bettmann/Getty Images 


René Descartes 
(1596-1650) polynomial functions. Then in Section 11.3 we show how to use the notations to compare the 


efficiencies of various algorithms designed to do the same job. Because the analysis of algo- 
rithms often involves logarithmic and exponential functions, we develop the needed properties 
of these functions in Section 11.4 and use them to analyze several algorithms in Section 11.5. 


EAM Real-Valued Functions of a Real Variable 
and Their Graphs 


The first precept was never to accept a thing as true until I knew it as such without a 
single doubt. —René Descartes, 1637 


A Cartesian plane or two-dimensional Cartesian coordinate system is a pictorial represen- 
tation of R X R, obtained by setting up a one-to-one correspondence between ordered pairs 
of real numbers and points in a Euclidean plane. To obtain it, two perpendicular lines, called 
the horizontal and vertical axes, are drawn in the plane. Their point of intersection is called 
the origin, and a unit of distance is chosen for each axis. An ordered pair (x, y) of real numbers 
corresponds to the point P that lies |x| units to the right or left of the vertical axis and | y| units 
above or below the horizontal axis. On each axis the positive direction is marked with an arrow. 


Definition 


A real-valued function of a real variable is a function from one set of real numbers 
to another. If fis a real-valued function of a real variable, then for each real number 


x in the domain of f there is a unique corresponding real number f(x). The graph of f 
is the set of all points (x, y) in the Cartesian coordinate plane with the property that 
x is in the domain of f and y = f(x). 


760 
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The definition of graph (see Figure 11.1.1) means that for each x in the domain of f: 


y=f(x) < the point (x, y) lies on the graph of f. 


(x, f) 


f(x) = the height of 
the graph of f at x 


Graph of f ———> 


ees 


FIGURE 11.1.1 Graph of a Function f 


Note that if f(x) can be written as an algebraic expression in x, the graph of the function 
fis the same as the graph of the equation y = f(x) where x is restricted to lie in the domain 


of f. 


Power Functions 


A function that sends a real number x to a particular power, x“, is called a power function. 
For applications in computer science, we are almost invariably concerned with situations 
where x and a are nonnegative, and so we restrict our definition to these cases. 


Definition 


Let a be any nonnegative real number. Define p,, the power function with exponent 
a, as follows: 


Pa(x) =x" for each nonnegative real number x. 


Graphs of Power Functions 


Sketch the graphs of the power functions po, p1/2, Pı, and pz on the same coordinate axes. 


Solution Because the power function with exponent zero satisfies po(x) = x? = 1 for 
every nonnegative number x,* all points of the form (x, 1) lie on the graph of py for every 
x 2 0. So the graph is just a horizontal half-line of height 1 lying above the horizontal 
axis. Similarly, p(x) = x for every nonnegative number x, and so the graph of pı consists 
of all points of the form (x, x) where x is nonnegative. The graph is therefore the half-line 
of slope 1 that emanates from (0, 0). 

Since for each nonnegative number x, pj /2(x) = x? = Vx, any point with coordinates 
(x,Vx), where x is nonnegative, is on the graph of p, /2. For instance, the graph of pı/2 con- 
tains the points (0, 0), (1, 1), (4, 2), and (9, 3). Similarly, since p(x) = x, any point with 


*As in Section 5.2 (see page 281), for simplicity we define Oo = 1. 
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coordinates (x, x”) lies on the graph of p2. Thus, for instance, the graph of p> contains the 
points (0, 0), (1, 1), (2, 4), and (3, 9). 
The graphs of all four functions are shown in Figure 11.1.2. 


FIGURE 11.1.2 Graphs of Some Power Functions | 


The Floor Function 


The floor and ceiling functions arise in many computer science contexts. Example 11.1.2 
illustrates the graph of the floor function. In exercise 6 at the end of this section you are 
asked to draw the graph of the ceiling function. 


Graph of the Floor Function 


Recall that each real number either is an integer itself or sits between two consecutive inte- 
gers: For each real number x, there exists a unique integer n such that n = x < n + 1. The 
floor of a number is the integer immediately to its left on the number line. More formally, 
the floor function F is defined by the rule 

For each real number x, 


F(x) = |x] 
= the greatest integer that is less than or equal to x 


= the unique integer n such thatn Sx <n+ 1. 


Sketch a graph of the floor function. 


Solution If n is any integer, then for each real number x in the interval n =x <n+1, 
the floor of x, |x|, equals n. Thus on each such interval, the graph of the floor function is 
horizontal; for each x in the interval, the height of the graph is n. 

It follows that the graph of the floor function consists of horizontal line segments, like 
a staircase, as shown in Figure 11.1.3. The open circles at the right-hand edge of each step 
are used to show that those points are not on the graph. 
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YA 
34 o_o 
2+ e—o 
y=lLx] 
1+ e—o 
ep H H H > 
5 -4 -3 -2 -l i- 2 ga 5 * 
oe 
-1 
e—o2 + 
oo 347 
FIGURE11.1.3 Graph of the Floor Function | 


Graphing Functions Defined on Sets of Integers 


Many real-valued functions used in computer science are defined on sets of integers rather 
than on intervals of real numbers. But if you know what the graph of a function looks like 
when it is given by a formula on an interval of real numbers, you can obtain the graph of 
the function that is defined on the integers in the interval using the same formula by select- 
ing only the points on the known graph whose first coordinates are integers. For instance, 
if fis the function defined by the same formula as the power function p, but having as its 
domain the set of nonnegative integers, then f(n) = n for each nonnegative integer n. The 
graphs of pı, reproduced from Example 11.1.2, and fare shown side-by-side below. 


A A 
4+ 4+ g 
IF 3+ ° 
2+ 2+ ° 
1+ 1+ e 
i_—.—_1—_ 1 —_ s— tt ht 
1 2 3 4 1 2 3 4 


Graph of f where f(n) =n 
for each nonnegative integer n 


Graph of p, where p(x) =x 
for each nonnegative real number x 


Graph of a Function Defined on a Set of Integers 


Consider an integer version of the power function p;/2. In other words, define a function g 
by the formula g(ny=n! ? for each nonnegative integer n. Sketch the graph of g. 


Solution Look back at the graph of pı; in Figure 11.1.2. Draw the graph of g by repro- 
ducing only those points on the graph of p,/. with integer first coordinates. Thus for each 
nonnegative integer n, the point (n, ni! *) is on the graph of g. 


A 
4+ 
e 
3+ > 2 * 
e , . 
27 g% 
e 

l+ e 
D S S O S S E S S S E S E 

12 3 4 5 6 7 8 9 10 11 12 
Graph of g where g(n) = n!/? for each nonnegative integer n ra 
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Graph of a Multiple of a Function 


A multiple of a function is obtained by multiplying every value of the function by a fixed 
number. To understand the concept of O-notation, it is helpful to understand the relation 
between the graph of a function and the graph of a multiple of the function. 


Definition 


Let f be a real-valued function of a real variable and let M be any real number. The 
function M f, called the multiple of f by M or M times f, is the real-valued function 
with the same domain as f that is defined by the rule 


(M f) (x) = M-(f(x)) for each x E domain of f. 


If the graph of a function is known, the graph of any multiple can easily be deduced. Spe- 
cifically, if fis a function and M is a real number, the height of the graph of Mf at any real 
number x is M times the quantity f(x). To sketch the graph of Mf from the graph of f, you plot 
the heights M: (f (x)) on the basis of knowledge of M and visual inspection of the heights f(x). 


Graph of a Multiple of a Function 
Let f be the function whose graph is shown below. Sketch the graph of 2f. 


YA 


Solution At each real number x, you obtain the height of the graph of 2f by measuring 
the height of the graph of f at x and multiplying that number by 2. The result is the follow- 
ing graph. Note that the general shapes of f and 2f are very similar, but the graph of 2f is 
“stretched out”: the “highs” are twice as high and the “lows” are twice as low. 


Graph of 2f EN 
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Increasing and Decreasing Functions 
Consider the absolute value function, A, which is defined as follows: 


x ifx20 
A(x) = |x| = for each real number x. 
—x ifx<0 
When x = 0, the graph of A is the same as the graph of y = x, the straight line with slope 1 
that passes through the origin (0, 0). For x < 0, the graph of A is the same as the graph of 
y = —x, whichis the straight line with slope — 1 that passes through (0, 0). (See Figure 11.1.4.) 


y 


FIGURE 11.1.4 Graph of the Absolute Value Function 


Note that as you trace from left to right along the graph to the left of the origin, the 
height of the graph continually decreases. For this reason, the absolute value function is 
said to be decreasing on the set of real numbers less than 0. On the other hand, as you trace 
from left to right along the graph to the right of the origin, the height of the graph continu- 
ally increases. Consequently, the absolute value function is said to be increasing on the set 
of real numbers greater than 0. 

Since the height of the graph of a function f at a point x is f(x), these geometric concepts 
translate to the following analytic definition. 


Definition 


Let f be a real-valued function defined on a set of real numbers, and suppose the 
domain of f contains a set S. We say that fis increasing on the set S if, and only if, 


for all real numbers x, and x, in S, if x, < x then f(x;) < f(). 
We say that fis decreasing on the set S if, and only if, 
for all real numbers x, and x3 in S, if x, < x. then f(x1) > f %2). 


We say that fis an increasing (or decreasing) function if, and only if, fis increasing 
(or decreasing) on its entire domain. 


Figure 11.1.5 illustrates the analytic definitions of increasing and decreasing. 

It follows almost immediately from the definitions that both increasing functions and 
decreasing functions are one-to-one. You are asked to show this in exercise 10 at the end 
of this section. 


EU ae A Positive Multiple of an Increasing Function Is Increasing 


Suppose that fis a real-valued function of a real variable that is increasing on a set S of real 
numbers, and suppose M is any positive real number. Show that Mfis also increasing on S. 
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(x1, fD 


| 
l 
| (xp, fE) 
(2, f%2)) f(x) = the height 
of graph at x, Ji 
| f(x2) = the height 

l 

l 

l 


(x5) = the height 
foz j of graph at x3 


of graph at x» 


xı 


ry > — R 
f(x) = the height X2 s r > 
of graph at x, 
seem Fox) < flr») Fox) > for») 
An Increasing Function A Decreasing Function 
(a) i 
FIGURE 11.1.5 


Solution Suppose x; and x, are particular but arbitrarily chosen elements of S such that 
X < X. 


[We must show that (Mf)(x1) < (Mf) (x2)./ From the facts that x; < x and fis increasing, it 
follows that 


fœ) < fo). 
Then 
Mf (x1) < Mf(x2), 


since multiplying both sides of the inequality by a positive number does not change the 
direction of the inequality. Hence, by definition of Mf, 


(Mf) (x1) < Mp), 


and, consequently, Mf is increasing on S. a 


It is also true that a positive multiple of a decreasing function is decreasing, that a 
negative multiple of an increasing function is decreasing, and that a negative multiple of a 
decreasing function is increasing. You are asked to prove these facts in exercises 24—26 at 
the end of this section. 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. If fis a real-valued function of a real variable, then 4. Given a function f: R — R and a real number M, 
the domain and co-domain of fare both ___. the function M fis defined by ___. 

2. A point (x, y) lies on the graph of a real-valued 5. Given a function f: R — R, to prove that fis 
function of a real variable fif, and only if, increasing, you suppose that and then you 
ar show that 

3. If ais any nonnegative real number, then the 6. Given a function f: R — R, to prove that fis 
power function with exponent a, pa, is defined by decreasing, you suppose that and then you 
oe show that ; 
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11.1 


EXERCISE SET 11.1* 


REAL-VALUED FUNCTIONS OF A REAL VARIABLE AND THEIR GRAPHS 


767 


1. The graph of a function fis shown below. 
a. Is f(0) positive or negative? 
b. For what values of x does f(x) = 0? 


c. Find approximate values for x, and x, so that 


SQ) = fœ) = 1 but x, F x. 


d. Find an approximate value for x such that 


fi) = 15. 


e. As x increases from —3 to — 1, do the values of 


fincrease or decrease? 


f. As x increases from 0 to 4, do the values of f 5. 


increase or decrease? 


. Sketch the graphs of the power functions p,/; and 


P\/4 On the same set of axes. When 0 < x < 1, 
which is greater: x!’ or x!/42 When x > 1, which 


is greater: x) or x^? 


. Sketch the graphs of the power functions p; and p4 


on the same set of axes. When 0 < x < 1, which 
is greater: x° or x“? When x > 1, which is greater: 
x? orx’? 


Sketch the graphs of y = 2|x| and y = [2x] for each 
real number x. What can you conclude from these 
graphs? 


Sketch a graph for each of the functions defined in 6-9 
below. 


6. 


2. The graph of a function g is shown below. 
a. Is g(0) positive or negative? 
b. Find an approximate value of x so that 
g(x) = 0. 


c. Find approximate values for x, and x, so that 


gx) = g(x) = 1 but x) Fx. 


8. 
9. 


g(x) = [x] for each real number x (Recall that the 
ceiling of x, [x], is the least integer that is greater 
than or equal to x. That is, [x] = the unique integer 
n such thatn—-1<x =n.) 


h(x) = [x|—|x] for each real number x 
F(x) = lx! 2 for each real number x 


G(x) = x — |x] for each real number x 


In each of 10-13 a function is defined on a set of integers. 
Sketch a graph for each function. 


10 


Tl. 


d. Find an approximate value for x such that 


g(x) = —2. 


e. As x increases from —2 to 1, do the values of g B. 


increase or decrease? 


f. As x increases from | to 3, do the values of g 


increase or decrease? 


12. 


14. 


f(n) = |n| for each integer n 

g(n) = (n/2) + 1 for each integer n 
h(n) = |n/2] for each integer n = 0 
k(n) = Ia” 2) for each integer n = 0 


The graph of a function fis shown below. Find the 
intervals on which fis increasing and the intervals 
on which fis decreasing. 


A Graph of f 


*For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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15. Show that the function f: R — R defined by the 
formula f(x) = 2x — 3 is increasing on the set of 
real numbers. 


16. Show that the function g: R — R defined by the 
formula g(x) = —(x/3) + 1 is decreasing on the 
set of real numbers. 


17. Let h be the function from R to R defined by the 
formula h(x) = x? for each real number x 
a. Show that h is decreasing on the set of real 
numbers less than zero. 
b. Show that h is increasing on the set of real 
numbers greater than zero. 


18. Let k: R — R be the function defined by the 
formula k(x) = (x — 1)/x for each real number 
x #0. 
a. Show that k is increasing for every real number 
x>0. 
b. Is k increasing or decreasing for x < 0? Prove 
your answer. 


19. Show that if a function f: R — R is increasing, 
then fis one-to-one. 


20. Given real-valued functions f and g with the same 
domain D, the sum of f and g, denoted f+ g, is 
defined as follows: 


For each real number x, (f+ g)(x) = f(x) + g(x). 


Show that if fand g are both increasing on a set S, 
then f+ g is also increasing on S. 


21. a. Let m be any positive integer, and define 
f(x) = x” for each nonnegative real number x. 
Use the binomial theorem to show that fis an 
increasing function. 
b. Let m and n be any positive integers, and let 
g(x) = x"/" for each nonnegative real number 
x. Prove that g is an increasing function. 
Note: The results of exercise 21 are used in the 


exercises for Sections 11.2 and 11.4. 


22. Let fbe the function whose graph follows. Sketch 
the graph of 3f. 


ANSWERS FOR TEST YOURSELF 


Graph of f 24 


23. Let h be the function whose graph is shown below. 
Sketch the graph of 2 h. 
A 


g 
2-4 Graph of h 


24. Let fbe a real-valued function of a real variable. 
Show that if fis decreasing on a set S and if M is any 
positive real number, then Mf is decreasing on S. 


25. Let fbe a real-valued function of a real variable. 
Show that if fis increasing on a set S and if M is 
any negative real number, then Mf is decreasing 
on S. 


26. Let fbe a real-valued function of a real variable. 
Show that if fis decreasing on a set S and if M is 
any negative real number, then M/ is increasing 
on S. 


In 27 and 28, functions f and g are defined. In each case 
sketch the graphs of f and 2g on the same set of axes and 
find a number xo so that f(x) = 2g(x) for all x > x. You 
can find an exact value for xy by solving a quadratic equa- 
tion, or you can find an approximate value for xo by using 
a graphing calculator or computer. 


27. f(x) = x’ + 10x +11 and g(x) = x? for each real 
number x = 0 


28. f(x) = x? + 125x + 254 and g(x) = x’ for each real 
number x = 0 


1. sets of real numbers 2. y = f(x) 3. pa) = x“ 
for each real numberx 4. (Mf)@) = M- fx) for each 
xER 5.x and x, are any real numbers such that x, < x; 
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fœ) <f(%2) 6. x; and x2 are any real numbers such that 
xı < xz; f(x1) > f) 


11.2 BIG-O, BIG-OMEGA, AND BIG-THETA NOTATIONS 769 


[IPS Big-O, Big-Omega, and Big-Theta Notations 


As soon as an Analytical Engine exists, it will necessarily guide the future course of 
the science. Whenever any result is sought by its aid, the question will then arise— 
by what course of calculation can these results be arrived at by the machine in the 
shortest time? —Charles Babbage, 1864 


Understanding the relative efficiencies of computer algorithms is of much more than aca- 
demic interest. In industrial and scientific settings, the choice of an efficient over an ineffi- 
cient algorithm can save a great deal of money or even make the difference between being 
able or not being able to do a project at all. 

The cost and feasibility of implementing a computer algorithm are most affected by 
Charles Babbage the length of computer time and the amount of computer memory the algorithm requires. 
(1729-1871) While both are important, this chapter concentrates on basic techniques for calculating 
time efficiency, which is usually the more significant of the two. Occasionally, however, 
one algorithm may make more efficient use of time but less efficient use of memory than 
another, forcing a trade-off based on the resources available to the user. 

Charles Babbage’s Analytical Engine was similar in many respects to a modern com- 
puter. The quotation at the beginning of this section shows that he anticipated the impor- 
tance of analyzing the time efficiencies of computer algorithms almost a hundred years 
before the first computer was actually built. 

The main objects of analysis in this chapter will be algorithms that take a data array and 
either search it to find a particular element or sort it into ascending or descending order. As 
a simple example, imagine running an algorithm to search an array of data for a particular 
element. In the best case the algorithm might happen to find the element in its very first 
step; in the worst case it might have to check every element before ending. 

The best and worst cases cannot be predicted in advance because they depend on the 
nature of the data being processed, so, when comparing two algorithms, it is reasonable to 
want control over worst-case situations. Although the results involve approximation, analy- 
sis can reveal dramatic differences among algorithms designed to do the same job, at least 
for large data sets and in the worst cases. For example, the graph in Figure 11.2.1 gives a 
range of worst-case execution times for two algorithms used to sort sets of data: insertion 
sort and merge sort. As the length of the data set becomes larger and larger, the difference 
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between the ranges becomes dramatically greater. For example, sorting a data set with 10 
million items could take just a fraction of a second with merge sort but approximately 30 
minutes with insertion sort. 

The symbols © and Q are the uppercase Greek letters theta and omega. The ©-notation 
introduced in this section is determined by ranges such as those shown in Figure 11.2.1. It 
is related to two other notations: Q-notation and O-notation. The oldest of the notations, 
O-notation (read “big-O notation”), was introduced by the German mathematician Paul 
Bachmann in 1894 in a book on analytic number theory, and it was the first to be used to 
compare efficiencies of algorithms. As you will see, however, using O-notation alone can 
produce ambiguous or even misleading results. As a response, in 1976 Donald Knuth, a 
pioneer in the analysis of algorithms, introduced the Q- and ©-notations so that the growth 
of functions could be compared with greater precision. 

The idea of the notations is this: Suppose fand g are real-valued functions of an integer 
variable n and suppose g(n) = 0 for every integer n greater than some positive real number. 


1. If, for sufficiently large values of n, the values of f are greater than those of a positive 
multiple of g, then fis of order at least g, written “f(n) is O(g(n)).” 


2. If, for sufficiently large values of n, the values of f are positive and less than those of a 
positive multiple of g, then fis of order at most g, written “f(n) is O(g(n)).” 


3. If, for sufficiently large values of n, the values of fare bounded both above and below by 
those of positive multiples of g, then fis of order g, written “f(n) is © (g()).” 


These relationships are illustrated in Figure 11.2.2. 
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When they were originally defined, the order notations referred to functions defined on 
continuous intervals of real numbers rather than to functions defined on sets of integers. 
However, the important variable in the analysis of algorithm efficiency is the size of the 
problem the algorithm is designed to solve, which is an integer. For example, it might be 
the number of items in a data array to be sorted or the number of nodes in a graph where a 
minimum spanning tree is to be found. 

Now when a function satisfies a property for all the real numbers in an interval, then it 
satisfies the property for all the integers in the interval, and the proofs in this chapter are 
valid if the phrase “for every real number x” is substituted in place of “for every integer 
n.” Thus, although we restrict the discussion to functions defined on sets of integers, func- 
tions defined on intervals of real numbers are in the background. This is reflected in the 
fact that we show graphs as continuous curves rather than as sets of discrete points. As 
in Example 11.1.5, you can imagine how the graphs would look if they were defined for 
integer values only. 


Definition 


Let fand g be real-valued functions defined on the same set of nonnegative integers, 
with g(n) = 0 for every integer n = r, where r is a positive real number. Then 


1. fis of order at least g, written f(n) is O(g(@)) (f of n is big-Omega of g of n), if, 
and only if, there exist positive real numbers A and a = r such that 


Ag(n) =f(n) for every integer n = a. 


2. fis of order at most g, written f (n) is O(g(n)) (f of n is big-O of g of n), if, and only 
if, there exist positive real numbers B and b = r such that 


0 = f(n) = Bg(n) for every integer n = b. 


3. fis of order g, written f(n) is O(g(n)) (f of n is big-Theta of g of n), if, and only 
if, there exist positive real numbers A, B, and k = r such that 


Ag(n) S f(n) = Bg(n) for every integer n = k. 


Remark on Notation: In Section 7.1 we stated that we would carefully distinguish between 
a function f and a value f(n) of the function. The traditional use of the order notation vio- 
lates this principle. For instance, in the statement “f(n) is @(g(n)),” the symbols f(n) and 
g(n) are understood to refer to the functions f and g defined by the expressions f(n) and g(n), 
respectively. For example, the statement 


3n°+4n+5 is O(n’) 


means that fis of order g where f and g are defined by the formulas f(n) = 3n’ +4n+5 
and g(n) = n’ for every integer n = 1. 


Translating to ©-Notation 


Use ©-notation to express the statement 


4n < 17n — 45n? +2n +8 = 30n? for every integer n = 3. 


Solution Let A = 4, B = 30, and k = 3. Then the statement translates to 


An® < 17n — 45n? +2n+8 < Bn® for every integer n = k. 
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So, by definition of ®-notation, 


17n — 45n? +2n+8 is @(n°). E 


ZcEWJGILP YA Translating to Q- and O-Notations and Deducing ©-Notation 
a. Use Q-notation to express the statement 


iis aN oa , 
an =3 4 +5n° for every integer n = 2. 
b. Use O-notation to express the statement 


2 
Os AE) +50 <6n? for every integer n = 1. 


2 
c. Justify the statement: (z) +5n? is @(n’). 


Solution 


a. Let A = a and a = 2. The statement in (a) translates to 
2 n|\ 2 : 
An = l) +5n° for every integer n = a. 
So, by definition of Q-notation, 
n N2 
A) +57? is O(n’). 
4 
b. Let B = 6 and b = 1. The statement in (b) translates to 
n N2 
Os l) +5n? < Bn? for every integer n = b. 
So, by definition of O-notation, 


2 
A) +5n? is O(n’). 
4 
11 


c. Let A = 7.4 = 2, B = 6, and b = I, and let k be the larger of a and b. Then when 
n = k, both inequalities in parts (a) and (b) are satisfied, and so 


2 n|\” 2 2 ; 
An = A) +5n = Bn’ for every integer n = k. 


Hence by definition of @-notation, 


2 
l) +5n? is OMD. , 
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Part (c) of Example 11.2.2 illustrates the fact that if you know both that fis of order at 
most g and that fis of order at least g, then you may take k to be the larger of the numbers 
a and b promised in the definitions for big-Omega and big-O and conclude that f is of 
order g. Conversely, if fis of order g, then both a and b may be taken to be the number k 
promised in the definition for big-Theta to show that fis of order at most g and fis of order 
at least g. You are asked to summarize this discussion in a formal proof in exercise 19 at 
the end of this section. 


Theorem 11.2.1 Relation among O-, ©-, and ©-Notations 


If f and g are real-valued functions defined on the same set of nonnegative integers, 
and if f(n) = 0 and g(n) = 0 for every integer n = r, where r is a positive real number, 


then fn) is O(g(n)) if, and only if, f(n) is O(g(n)) and f(n) is O(g(n)). 


Orders of Power Functions 


The functions that are most commonly used for comparing algorithm efficiencies are 
power functions, such as ni! z n, n, and n, and combinations involving a power func- 
tion and an exponential or logarithmic function, such as 2”, log(n), n log(n), and n? log(n). 
These functions arise naturally in the analysis of algorithms, but, for large values of n, 
their sizes are dramatically different. In this section we focus on power functions, while in 
Section 11.4 we discuss functions that involve logarithms. 

Observe that if 


l <n, 
then 
nan by multiplying both sides 
by n (which is positive) 
and so 
2 3 ac A 
n =n by multiplying again by n. 


Thus if n = 1, then 


lsns n? = n? by transitivity of order. 


The following theorem generalizes this result. Exercises 46 and 47 at the end of this 
section provide an outline for the proof. 


Theorem 11.2.2 For any positive rational numbers r and s and any integer 
n=1, 


ifr=s, then n’ =n’. 


The relation among the graphs of various positive power functions of n for n = 1 is 
shown graphically in Figure 11.2.3. 


Orders of Polynomial Functions 


The following two examples show how to use Theorem 11.2.2 to find a big-Q and a big-O 
for some polynomial functions. 
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YA 
44 

If r < s, the graph of 

y =n’ lies underneath the 
FF graph of y=n* forn > 1. 
2+ 


| > 
n 


i 
T 
1 2 3 4 


FIGURE 11.2.3 Graphs of Powers of n for n = 1 


Finding a Big-Omega and a Big-O for a Polynomial Function with Nonnegative 
Coefficients 


a. Show that 157° + 11n? +9 is Q(n’). 
b. Without using part (a), show that 15n? + 11n? +9 is O(n’). 


Solution 


a. To show that 157? + 11n? +9 is Qn’), you need to show that 157° + 11n? +9 is greater 
than or equal to a positive multiple of n’ for all values of n that are sufficiently large. Now 


15n? < 15n°+11n?+9 for every integer n = 1 

because when n = 1, 117? +9 is positive. Thus you can let A = 15 and a = 1 to obtain 
An? < 15n +11n?+9 for every integer n = a, 

and conclude, by definition of Q-notation, that 15n? +117? +9 is A(n>). 


b. To show that 15n* + 11n? +9 is O(n), you need to show that 157° + 11n? +9 is greater 
than or equal to 0 and less than or equal to some positive multiple of n? for all values of n 
that are sufficiently large. First note that because all terms of 157° + 11n? +9 are positive, 


0 < 15n?+11n*+9 for every integer n = 1. 


Next observe that for every integer n = 1, 


15n + 11n? +905 15n° + 11n? + 9n? because, by Theorem 11.2.2, 
since n = 1, then n S n? and 
t= ae and so lin < 11n? 


and 9 < 9n? 
= 35n because 15+ 11+9 = 35. 
Thus, by transitivity of order and equality, 
0 =< 15n°+11n+7 <35n° for every integer n = 1. 
Let B = 35 and b = 1. Then 


0 < 15n?+11n?+9 < Bn? for every integer n = b, 
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and so, by definition of O-notation, 


150? +117? +9 is O(n’). E 


Finding a Big-Omega and a Big-O for a Polynomial Function 
with Some Negative Coefficients 


a. Show that nf — 5n — 8 is Q(n*). 
b. Show that nf — 5n — 8 is O(n’). 


Solution 


a. To show that nf — 5n — 8 is Qn), you need to find positive real numbers A and a such that 
Ant < n*—5n—-8 for every integer n = a. 


Because n* — 5n — 8 contains negative terms, you need to use a different technique 
from the one illustrated in Example 11.2.3(a). Two methods are shown below. The first 
relies on ad hoc calculations and the second describes a general procedure. 


Method 1 (Using ad hoc calculations): Let <> stand for the words “if, and only if,” 
and observe that 


1 
(*) xt 5n —5n-8 


n because adding or subtracting 5n + 8 — In’ 
to both sides of an inequality preserves 
the direction of the inequality 
(FR) <> =a kt = because dividing or multiplying both sides 
of an inequality by 2n>, which is positive, 


preserves the direction of the inequality. 


Because all the inequalities are equivalent (that is, each inequality is true if, and only 
if, all the others are true), any value of n that makes inequality (**) true makes in- 
equality (*) true also. A little trial and error shows that inequality (**) is true for every 
integer n = 3: 


< 


1 
Ifn = 3, then 3 a we = 32 3 27 


Hence, by transitivity of order and equality, 


n 


which is inequality (**). Therefore, inequality (*) is also true for n = 3: 


1 
z“ <n'—5n-8 for every integer n = 3. 


Let A = Ż and a = 3. Then 
Anf S nt—5n—8 for every integer n = a, 
and so, by definition of Q-notation, nt —5n—8 is Qn’). 
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Method 2 (Using a general procedure): 


Let m be a nonnegative integer, let P(n) be a polynomial of degree m, and suppose 
the coefficient a„ of n” is positive. 


To find big-Omega for P(n): Let A = Lap and let a be the number obtained as 
follows: 


1. Find the sum of the absolute values of all the coefficients of P(n) except 
foran. 


2 
2. Multiply the result of step 1 by a 
3. Let a be the larger of the number | and the result of step 2. 


Show that An” = P(n) for every integer n = a. 


To use the general procedure to show that n — 5n — 8 is Qn’), observe that the coef- 
ficient of its highest power is 1 and the sum of the absolute values of its other coeffi- 
cients is |—5| + |—8|. Thus you would take 


> wi w= Ala 
gw an = — = = 
2 a 


and note that 


2 
ae (|—5| +|—8]) = 26, which is greater than 1. 


Requiring n = a means that 


and multiplying both sides by in gives 


1 
z = (d=5] + |=8x 


= 5n? + 8n? 


= ín+8 because n = 1 and so 57° = 5n and 8n? = 8. 


Hence, by transitivity of order and equality, 
1 
Pi =5n+8 for every integer n = a. 


Subtracting the right-hand side from the left-hand side and adding in‘ to both sides 
gives 


1 ; 
n'—5n—-8 = =n’ for every integer n = a. 


Thus since A = L 
n'—5n—-8=An' for every integer n = a, 


and so, by definition of Q-notation, nt — 5n- 8 is Qn’). 
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b. To show that nf — 5n — 8 is O(n’), observe that for every integer n = 1, 


nî —5n-8 < nt +5n+8 because when n = 1, 5n + 8 is positive 


= nî + 5nÎ + 8n! by Theorem 11.2.2, since n = 1, then n S n 
andi = n’, and so 5n = 5n‘ and 8 < 8n* 
= 14n* because 1+5+8 = 14. 
Thus, by transitivity of order and equality, 
n*—5n—8 < 14n* for every integer n = 1. 
In addition, by part (a) 


1 : 
-nf =n'—5n—-8 for every integer n = 3, 


so since 0 = ini, transitivity of order gives that 

0 <n'—5n-8 < 14n for every integer n = 3. 
Let B = 14 and b = 3. Then 

0 <n*—5n—8 < Bn? for every integer n = b, 


and hence, by definition of O-notation, n'—5n-8is O(n’). E 


The results of Examples 11.2.3 and 11.2.4 can be used to find big-Thetas for 
15n? + 11n? +9 and n — 5n- 8. 


Finding a Big-Theta for a Polynomial Function 


a. Show that 157° + 11n? +9 is O(n’). 
b. Show that nf — 5n — 8 is O(n’). 


Solution 

a. By Example 11.2.3, 15n? + 11n? +9 is O(n*) and by Example 11.2.4, 15n? + 11n? +9 
is Q(n°). Thus by Theorem 11.2.1, 157° + 11n? +9 is OM’). 

b. By Example 11.2.4, nf — 5n — 8 is both Q (ní) and O(n). Thus, by Theorem 11.2.1, 
n — 5n — 8 is O(n’). E 


Theorem 11.2.3 A Limit on What Can Be Inferred from Big-O 


For any function f and positive real numbers r and s with r < s, 


if fn) is O(n’) then f(n) is O(n"). 


Proof: Suppose r and s are real numbers with r < s and f is a function such that 
f(n) is O(n”). By definition of O-notation, there exist positive real numbers B and b 
such that 


0 <f(n) S Bn” for every integer n = b. 


(continued on page 778) 
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Now by Theorem 11.2.2, 
Bn = Bn’ for every integer n = 1. 


Let b; be the larger of b and 1. Then 


0 < f(n) S Bn’ for every integer n = b}, 


and thus f(n) is O(n®). 


It follows from Theorem 11.2.3 that knowing a function is big-O of another function 
gives only partial information about how the function behaves. 


A Caution about O-notation 


Suppose a person finds that f(n) is O(n’) and that g(n) is O(n’). Because 4 < 5, this person 
might conclude that the graph of g lies below the graph of f for large values of n. Show that this 
is not necessarily the case by showing that there exist functions f and g such that f(n) is O(n’) 
and g(n) is own’), yet for large values of n the graph of f lies below, not above, the graph of g. 


Solution Let f(n) = 157° + 11n? +9 and g(n) = n*—5n—8. Since f(n) is O(n’), it fol- 
lows from Theorem 11.2.3 that f(n) is O(n’). Also by Example 11.2.3, g(n) is O(n’). How- 
ever, Figure 11.2.4, which shows both fand g, suggests that for values of n = 16, the graph 
of g lies above the graph of f. 


A Graph of g 


Graph of f 


3 T5 10 12.5 15 17.5 20 


FIGURE 11.2.4 


This result can be confirmed analytically by noting that 


(*) g(n) = fin) 
< n'—5n—8 = 15n +11? +9 
yee n= 15n?+11n?+5n+17 
11 5 17 
(**) <> Dg age 
n n n 
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So any value of n that satisfies inequality (**) also satisfies the inequality (*). Now when 
n= 16, then i = i: and so inequality (**) is satisfied because 
EN ee ee ee f int = 16 
—+ 5+ 58 at te or every integer n = 16. 
np w 16 167 16° ete 
Therefore 
n'—5n—8 = 15n +11n?+9 for every integer n = 16. 


Thus it is possible to find bands around each graph so that the band around the graph of 
g(n) lies entirely above the band around the graph of f(n) for every integer n = 16. In fact, 
the second derivative test from calculus shows that g(n) is growing at a faster rate than f(n), 
which explains why the graph of g(n) bends upward more steeply than the graph of f(n). 
Readers who have studied calculus can check this result themselves. E 


A related feature of O-notation is illustrated by Example 11.2.7. 


Showing That One Function Is Not Big-O of Some Other Function 


Let e(n) = nt — 5n- 8. By Example 11.2.4, g(n) is Qn’). Show g(n) is not O(n’) for any 
positive real number r < 4. 


Solution 
Suppose by way of contradiction that there exists a positive real number r such that r < 4 
and g(n) is O(n’). Then there exist positive real numbers B and b such that 


0S g(n) Bn” for every integer n = b. 
Since it is also the case that g(n) is Qn’), there exist positive real numbers A and a such that 
Ant = g(n) for every integer n = a. 
Let t be the larger of a and b. Then 
An* < g(n) < Bn” for every integer n = t. 


Let = stand for the words “which implies that.” It follows from the inequalities above that 
for every integer n = t, 


An* < Br’ by transitivity of order 
nt B a : : 
=> ee by dividing both sides by An 
n A 
= B 
> ns- by algebra 
A 
4-r 
> n= V B/A by taking the (4 — r)th root of both sides. (*) 


Since r < 4, then 4 — r > 0, which implies that ey, B/A is a fixed positive real number. But 
n can be greater than any fixed number, which implies that condition (*) is contradictory. 
In other words, the supposition results in a contradiction, and hence it is false. Therefore, 
for each positive real number r with r < 4, g(n) is not O(7’). B 


The next theorem generalizes the result of Example 11.2.7. The proof can be modeled on 


the solution to Example 11.2.7 and is left as exercise 21 at the end of this section. 
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Theorem 11.2.4 Showing That a Big-O Relationship Does Not Hold 


If f is a real-valued function defined on a set of nonnegative integers and f(n) is 
O(n”), where m is a positive integer, then f(n) is not O(n”) for any positive real num- 
ber p < m. 


Because ©-notation, unlike O-notation, gives “tight bounds” on function values, you 
can count on it to give precise results for comparing function growths. In Example 11.2.5 
@-notations were found for 157° + 11n? + 9 and nt — 5n — 8. In both cases the order of the 
polynomial function was the power function corresponding to the highest power in the 
polynomial. The theorem on polynomial orders shows that this result is not an accident. 
The simplest proof uses the idea of limit from calculus and is included below. Exercise 25 
at the end of the section asks for a proof based on the techniques from Example 11.2.4, 
which does not rely on calculus. 


Theorem 11.2.5 On Polynomial Orders 


If m is any integer with m = 0 and do, aj, ao, . . . , Am are real numbers with a,, > 0, 
then a,,n”" + a,_,—\n" |+ +++ +a;n+agis O(n”). 


Proof (using limits): Suppose m is an integer with m = O and suppose do, 41, d2,... , 
am are real numbers with a,, > 0. Because lim, sehh = 0 for every integer i = 1, 


m 
n 


n=% 


=i -7 
i (“= Nn" + dy" tapat +++> + ayant 2) 
im 


= m 
n=% n m-l n 


ay- m= a a 
fin ag + Es BE +S) 


= am 


By definition of limit, this implies that for any real number e > 0, there exists an 
integer K such that 


Am-1 Am—2 aj ag : 
a, =e Sat F tees e < ampte for every integer n > K. 
m m n n? n" 1 n” m 


. am : ; 
In particular, when £ = >, there is an integer k such that 


a Am-1 Am-2 ay ado a é 
Am7 a< an F ee F-n Say — for every integer n > k. 
2 n n 2 


n yt! 


Combining like terms and multiplying all parts of the inequality by n” gives that 


m 


a _ 3a . 
(2) < apn” +ap n"! +-+ an+ ag < ( a p” for every integer n > k. 


m 3 m 
Let A = “ and B = 5 . Then 


An” < apn” + Gyn” | ++ antag < Bn” for every integer n > k. 


Therefore, by definition of ©-notation, 


—1 . 
Ay" + Ayn" + + an + agis O(n". 
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Calculating Polynomial Orders Using the Theorem on Polynomial Orders 


Use the theorem on polynomial orders to find orders for the functions given by the follow- 
ing formulas. 


a. f(n) = In +5n3>—n+4 foreach positive integer n. 


(n— 1)\(n+1) tga de 
b. g(n) = EE aa for each positive integer n. 


Solution 


a. By direct application of the theorem on polynomial orders, Tn? + 5n? —n +4 is Om’). 


(n—1)(n+1) 
S O 
1 
= 0w- 
eS p 
a" 4 y algebra. 
Thus g(n) is O(n’) m 


An Order for the Sum of the First n Integers 


Sums of the form 1 +2 +3 + +- +n arise in the analysis of computer algorithms such as 
selection sort. Show that for a positive integer variable n, 


1+2+3+ = +nis On”). 


Solution According to the formula for the sum of the first n integers (see Theorem 5.2.1), 
for each positive integer n, 


7 n(n + 1) 


1+24+34---+n 3 


Now 


nnt+1) 1g 1 
=—n'+—n by basic algebra. 
2 2 2 


And, by the theorem on polynomial orders, 


Hence 


14+24+3+-::4+n is OM). m 


We end this section by stating some theorems that give useful properties of order nota- 
tions. These will be applied in Section 11.4. Two sample proofs are given, with the rest 
being left to the exercises at the end of the section. 
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Theorem 11.2.6 Reciprocal Relationship between Q- and O-notations 


Let fand g be real-valued functions defined on the same set of nonnegative integers, 
and suppose there is a positive real number r such that f(n) = 0 and g(n) = 0 for 
every integer n = r. Then: 

a. If f(n) is O(g(@™), then g(n) is Of). 

b. If g(n) is O(f (n), then f (n) is Qe). 


Theorem 11.2.7 Reflexive, Symmetric, and Transitive Properties 
of ©-notation 


Let f, g, and h be real-valued functions defined on the same set of nonnegative inte- 
gers, and suppose there is a positive real number r such that f(n) = 0, g(n) = 0 and 
h(n) = 0, for every integer n = r. Then: 

a. fn) is Ofn). 

b. Iffin) is O(g(n)), then g(n) is Of). 

c. Iff(n) is O(g(n)) and g(n) is O(A(~)), then f(n) is Ohn). 


Theorem 11.2.8 Effect of Constants on Order Notations 


Let fand g be real-valued functions defined on the same set of nonnegative integers, 
and suppose there is a positive real number r such that f(n) = 0 and g(n) = 0 for 
every integer n = r. 

Then for every positive real number c: 
a. If An) is O(g(@™), then cf(n) is O(g(n)); 
b. Iffin) is Olen), then cf(n) is O(g(n)); 
c. If fn) is O(g(n)), then cfm) is O(g(@). 


Theorem 11.2.9 Orders of Sums and Products of Functions 


Let fi». f2, 81, and g, be real-valued functions defined on the same set of nonnegative 

integers, and suppose there is a positive real number r such that fi(n) = 0, f(n) = 0, 

g,(n) = 0, and go(n) = 0 for every integer n = r. Then: 

a. If fi(n) is O(g) and faln) is O(g), then fin) + fan) is O(g(n)). 

b. If fi(n) is O(g\(n)) and fo(n) is O(820), then Cif) is O(gi() go). 

c. Iffin) is O(n) and f(n) is O(g.(n)) and if there is a real number s so that 
g\(n) S g,(n) for every integer n = s, then (fi(n) + fo(n)) is O(g2(n)). 


Proof of Theorem 11.2.6(a) 


Let f and g be real-valued functions defined on the same set of nonnegative integers 
and suppose there is a positive real number r such that g(n) = 0 for every integer 
n = r. Suppose also that f(n) is O.(g(m)). We will show that g(n) is O(f (n). By defini- 
tion of Q-notation, there are positive real numbers A and a such that a = r, and 


Ag(n) = fn) for every integer n = a. 
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Divide both sides by A to obtain 


for every integer n = a. 


gn) = i (n) 
In addition, since a = r, 
0 <= g(n) for every integer n = a. 
Let B = 1/A and b = a. Then B and b are positive real numbers and 
0 S g(n) = Bf(n) for every integer n = b, 


and so g(n) is O(f(n)) by definition of O-notation /as was to be shown]. 


Proof of Theorem 11.2.7(c) 


Suppose f, g, and h are real-valued functions defined on the same set of nonnegative 
integers, and suppose there is a positive real number r such that f(n) = 0, g(n) = 0, 
and h(n) = 0, for every integer n = r. Suppose also that f(n) is O(g(n)) and g(n) is 
O(h(n)). We will show that fn) is O(i(~)). By definition of @-notation, there exist 
positive real numbers A, B4, kı, A2, Bo, and ky with kj = r and k = r, and 


Ajg(n) = fin) = Byg(n) for every integer n = kı 
and 
A,h(n) = g(n) = Byh(n) for every integer n = ky. 


Let A = A,A>, B = B,B,, and k = max(k,, k2). Then, by transitivity of order and 
equality, for every integer n = k, 


Ah(n) = A\(Ajh(n)) = Aig) = fin) = Byg(n) = By (Byh(n)) = Bh(n), 


and so, by definition of ®-notation, f(n) is O(A(n)) [as was to be shown]. 


TEST YOURSELF 


1. A sentence of the form “Ag(n) = f(n) for every 
n = a” translates into Q-notation as ___ 


2. A sentence of the form “0 = f(n) = Bg(n) for 
every n = b” translates into O-notation as 


3. A sentence of the form “Ag(n) = f(n) = Bg(n) for 
every n = k” translates into ©-notation as 


. Whenn= l,n 


S 


2 2 
n andn n. 


. According to the theorem on polynomial orders, if 


p(n) is a polynomial in n, then p(n) is O(n”), where 
mis 


. Ifnis a positive integer, then 1+2+3+ °°: +n 


has order 
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EXERCISE SET 11.2 


1. The following is a formal definition for Q- 
notation, written using quantifiers and variables: 
f(n) is O(g(n)) if, and only if, 3 positive real 
numbers a and A such that Vn = a, 


Ag(n) = f(n). 


a. Write the formal negation for the definition 
using the symbols V and 3. 

b. Restate the negation less formally without us- 
ing the symbols V and J or the words “for any,” 
“for every,” or “there exists.” 


2. The following is a formal definition for O-notation, 
written using quantifiers and variables: f(n) is 
O(g(n)) if, and only if, 3 positive real numbers b 
and B such that Y n = b, 


0 = f(n) = Bgn). 


a. Write the formal negation for the definition 
using the symbols V and 3. 

b. Restate the negation less formally without us- 
ing the symbols V and 4 or the words “for any,” 
“for every,” or “there exists.” 


3. The following is a formal definition for @-notation, 
written using quantifiers and variables: f(n) is 
@(g(n)) if, and only if, 3 positive real numbers k, A, 
and B such that Vn = k, 


Ag(n) = f(n) = Bg(n). 


a. Write the formal negation for the definition 
using the symbols V and 3. 

b. Restate the negation less formally without us- 
ing the symbols V and 4 or the words “for any,” 
“for every,” or “there exists.” 


In 4-9, express each statement using Q-, O-, or 
Q-notation. 


1 
4. z” £ m= l+ 1 for every integer n = 1. 


(Use Q-notation.) 


5 0=n [24 1 <n for every integer n = 3. 
(Use O-notation.) 


6. n <3n(n—2) = 4n? for every integer n = 3. 
(Use O-notation.) 
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10. 


1. 


12. 


13. 


14. 


“2°. 3 


1 > _ n@Gn-2) : 
—n = ——— for every integer n = 3. 


(Use ()-notation.) 


n(3n — 2) 2 ; 
= Ta =n’ for every integer n = 1. 


(Use O-notation.) 


w fel 3 j 
6 =n 3 1} <n’ for every integer n = 2. 
(Use @-notation.) 


a. Show that for any integer n = 1, 
0 S 2n*+15n+4 = 21n. 

b. Show that for any integer n = 1, 
2n* < 2n? +15n+4. 

c. Sketch a graph to illustrate the results of parts 
(a) and (b). 

d. Use the O- and Q-notations to express the 
results of parts (a) and (b). 

e. What can you deduce about the order of 
2n? + 15n+ 4? 


a. Show that for any integer n = 1, 
0 < 23n'+8n°+4n < 35n’. 

b. Show that for any integer n = 1, 
23n* < 23n* + 8n + 4n. 

c. Sketch a graph to illustrate the result of parts 
(a) and (b). 

d. Use the O- and (-notations to express the 
results of parts (a) and (b). 

e. What can you deduce about the order of 


23n* + 8n? + 4n? 


a. Show that for any integer n = 1, 
0 < 7n? + 10n? +3 < 207°. 

b. Show that for any integer n = 1, 
Tn? < Tn? + 10n? +3. 

c. Sketch a graph to illustrate the result of parts 
(a) and (b). 

d. Use the O- and Q-notations to express the 
results of parts (a) and (b). 

e. What can you deduce about the order of 


Tn? + 10n? +3? 


Use the definition of @-notation to show that 
5n? + 65n + 30 is @(n°). 


Use the definition of @-notation to show that 
n? + 100n + 88 is O(n”). 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


Use the definition of ©-notation to show that 


bi + H is O(n). 


Use the definition of ©-notation to show that 


n+ J is O(n). 


Use the definition of ©-notation to show that 


z is @(n). (Hint: Show that if n = 4, then 


= 1> in) 
Prove Theorem 11.2.7(b): If fand g are real-valued 
functions defined on the same set of nonnegative 
integers and if f(n) = 0 and g(n) = 0 for every 
integer n = r, where r is a positive real number, 
then if f(n) is O(g(@), then g(n) is Of). 


Prove Theorem 11.2.1: If fand g are real-valued 
functions defined on the same set of nonnegative 
integers and if f(n) = 0 and g(n) = 0 for every 
integer n = r, where r is a positive real number, 
then f(n) is O(g() if, and only if, f(n) is O(g@™) 
and f(n) is O(g(n)). 


Without using Theorem 11.2.4 prove that n’ is not 
O(n’). 


Prove Theorem 11.2.4: If fis a real-valued func- 
tion defined on a set of nonnegative integers and 
fn) is O(n”), where m is a positive integer, then 
f(n) is not O(n”) for any positive real number 
p<m. 


a. Use one of the methods of Example 11.2.4 to 
show that 2n* — 90n? + 3 is Q^). 

b. Show that 2n*—90n? + 3 is O(n’). 

c. Justify the conclusion that 2n* — 907? + 3 is 
On’). 

a. Use one of the methods of Example 11.2.4 to 
show that n? — 42n — 8 is O(n’). 

b. Show that $n? — 42n — 8 is O(n’). 

c. Justify the conclusion that in —42n— 8 is 
@(n’). 

a. Use one of the methods of Example 11.2.4 to 
show that n° — 507°? + 3n + 12 is Qn’). 

b. Show that n° — 507° +3n+ 12 is O(n’). 


c. Justify the conclusion that im —50n? + 3n+ 12 
is O(n’). 
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25. Suppose 
P(n) = apn” + dyn | + +++ + an? + ayn +t ap, 


where all the coefficients ao, ay, . . 

numbers and a,, > 0. 

a. Prove that P(n) is O(n”) by using the general 
procedure described in Example 11.2.4. 

b. Prove that P(n) is O(n"). 

c. Justify the conclusion that P(n) is O(n”). 


. , Am are real 


Use the theorem on polynomial orders to prove each of 
the statements in 26-31. 


26. 


27. 


28. 


29. 


30. 


31. 


eae) a =) GP) 


3a 1) is @(n?) 


n(n — 1) 


+3n is @(n’) 


n(n—1)(2n+ 1). 
——— a i 
n(n + 1) 
lee 


s Om’) 


2 
l is O(n’) 


n(n + 1) 
2 T 


2(n 


1)+ qe 1) 


5 is O(n’) 


Prove each of the statements in 32-39. Use the theorem 


on 


polynomial orders and results from the theorems and 


exercises in Section 5.2 as appropriate. 


32. 
33. 
34. 
35. 


36. 


37. 


H 38. 


39. 


40 


174+274+37+ +n? is O0?) 
P23 + --- +n is ONS 
24+44+6+ =- +2nis O(n’) 
5+104+15+20+25+ --- +5nis O(n’) 


S (4i—9) is O(n’) 
i=1 


Sk+ 3) is O(n’) 
k=1 
SiG+ 1) is O°) 


i=1 
SY & = 2k) is O(n’) 
k=3 


. a. Prove: If c is a positive real number and if f 
is a real-valued function defined on a set of 
nonnegative integers with f(n) = 0 for every 
integer n greater than or equal to some positive 
real number, then cf(n) is O(f(n)). 
b. Use part (a) to show that 3n is O(n). 
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41. Prove: If c is a positive real number and f(n) = c and there is a real number r such that f(n) = 0 for 
for every integer n = 1, then f(n) is O(1). every integer n = r, then f(n) is O(f(n)). 
42. What can you say about a function f with the prop- 50. Prove Theorem 11.2.8: 
erty that fn) is O(1)? a. Let fand g be real-valued functions defined 
on the same set of nonnegative integers, 
Use Theorems 11.2.5-11.2.9 and the results of exercises and suppose there is a positive real number 
15-17, 40, and 41 to justify the statements in 43-45. r such that f(n) = 0 and g(n) > 0 for every 
n+l integer n = r. If f(n) is O(g(n)) and c is any 
43. =] +3n is O(n) positive real number, then cf(n) is O(g(n)). 
b. Let fand g be real-valued functions defined 
n(n—=1) |n , j on the same set of nonnegative integers, and 
44. -> F zl +1 is O(n") suppose there is a positive real number r such 


that f(n) = 0 and g(n) = 0 for every integer 
n , n =r. If f(n) is O(g(n)) and c is any positive 

45. B +4n+3 is O(n) real number, then cf(n) is O(g(n)). 

c. Let fand g be real-valued functions defined 
on the same set of nonnegative integers, and 
suppose there is a positive real number r such 
that f(n) = 0 and g(n) = 0 for every integer 
n =r. If f(n) is O(g(n)) and c is any positive 
real number, then cf(n) is O(g(n)). 


46. a. Use mathematical induction to prove that if n 
is any integer with n = 1, then for every inte- 
germ = 1, n” = 1. 
b. Prove that if n is any integer with n = 1, then 
n” <n’ for all integers r and s with r Ss. 
51. Prove Theorem 11.2.9: 
Ha. Let fi, fo, and g be real-valued functions defined 
n on the same set of nonnegative integers, and 


integer m= 1 „ifx = l thenx = 1. n suppose there is a positive real number r such 
H b. Explain how it follows from part (a) that if x is that fi(n) = 0, f(n) = 0, and g(n) = 0 for 
any positive real number, then for every integer i 


47. a. Let x be any positive real number. Use math- 
ematical induction to prove that for every 


aes every integer n = r. If fi(n) is O(g(n)) and fa) is 
lie aries ea, a @(g(n), then (fi(n) + fa(n)) is (gin). 

c. Explain how it follows from part (b) that if x is b. Let f;,f>, gı, and g% be real-valued functions 
any positive real number, then for every integer defined on the same set of nonnegative inte- 


: 1/m 
m= l, ifx > 1 then x f ie 1. gers, and suppose there is a positive real num- 
d. Let p, q, r, and s be positive integers, and sup- ber r such that fi(n) = 0, f(n) = 0, gin) = 0, 


pose p/q > r/s. Use part (c) and the result of and g,(n) = 0 for every integer n = r. If fi(n) is 


exercise 40 to prove Theorem 11.2.2. In other O(g) and f,(n) is OCg20), then (FDF) is 
words, show that for any integer n, if n > 1 Oleg Nn). 


plq ris 
then n”1 > n”. c. Let fi, fo, g1, and g, be real-valued func- 


tions defined on the same set of nonnega- 
tive integers, and suppose there is a positive 
real number r such that fi(n) = 0, f(n) = 0, 
g\(n) = 0, and g2(n) = 0 for every integer 
n = r. If fi(n) is @(g,(n)) and f,(n) is O(g2(n)) 


48. Prove Theorem 11.2.6(b): If f and g are real- 
valued functions defined on the same set of 
nonnegative integers, and if there is a positive 
real number r such that f(n) = 0 and g(n) = 0 for 


every integer n = r, and if gln) is O), then and if there is a real number s so that 


fn) is Mata). g\(n) = g»(n) for every integer n = s, then 
49. Prove Theorem 11.2.7(a): If fis a real-valued F + AHN) is O(g2(n)). 
function defined on a set of nonnegative integers 


ANSWERS FOR TEST YOURSELF 
1. fM is Qen) 2.f(n) is Olm 3.ft) is Olen) 4.5; = 5. the degree of p(n) 6. n? 
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EEE Application: Analysis of Algorithm Efficiency | 


It is convenient to have a measure of the amount of work involved in a computing 
process, even though it be a very crude one. ... We might, for instance, count the 
number of additions, subtractions, multiplications, divisions, recording of numbers... 
—Alan Turing, 1948 


Starting in the late 1940s a number of mathematicians and computer scientists contributed 
to developing formal techniques for analyzing computer algorithms. As indicated by the 
quotation at the beginning of this section, Alan Turing may have been the first to suggest 
Alan Turing a concrete way for doing this. In the early 1960s, Donald Knuth began the process of ex- 
(1912-1954) panding upon his own work and the work of others into a series of volumes titled The Art 

of Computer Programming.’ The first three volumes are in their third edition, a fourth vol- 
Note For more about the ume is being published in parts, and the fifth through seventh volumes are in preparation. 
work of Alan Turing, see The books are providing a solid and extensive foundation for computer science that is both 
peeHOn seer an bo) elegant and mathematically rigorous. 


Science History Images/Alamy Stock Photo 


The Sequential Search Algorithm 


The object of a search algorithm is to hunt through an array of data in an attempt to find 
a particular item x. In a sequential search, x is compared to the first item in the array, 
then to the second, then to the third, and so on. The search is stopped if a match is found 
at any stage. On the other hand, if the entire array is processed without finding a match, 
then x is not in the array. An example of a sequential search is shown diagrammatically in 
Figure 11.3.1. 


Bbva/Notimex/Newscom 


Donald Knuth 
(born 1938) 


a[l] a[2] a{3] a4] a{5] a6] a{7] 
m=: no a=: no a=: no a=: no 
yes 


FIGURE 11.3.1 Sequential Search of a[1], a[2],..., a[7] for x where x = a[5] 


Best-, Worst-, and Average-Case Orders for Sequential Search 


Find best, worst-, and average-case orders for the sequential search algorithm from among 
the set of power functions. 


Solution Suppose the sequential search algorithm is applied to an input array a[l], 
a[2],..., a[n] to find an item x. In the best case, the algorithm requires only one comparison 
between x and the items in a[1], a[2],..., a[n]. This occurs when x is the first item in the array. 


*Quarterly Journal of Mechanics and Applied Mathematics, vol. 1 (1948), pp. 287-308. 


‘Donald E. Knuth, The Art of Computer Programming, vol. 1: Fundamental Algorithms, 3rd ed. (1997); vol. 2: 
Seminumerical Algorithms, 3rd ed. (1997); vol. 3: Searching and Sorting, 2nd ed. (1998); vol. 4a: Combinato- 
rial Algorithms Part 1, 1st ed. (2011); vol. 4: Fascicle 6: Satisfiability, 1st ed. (2015) (Addison-Wesley). 
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Thus in the best case, the sequential search algorithm is @(1). (Note that ©(1) = Qn”), 
In the worst case, however, the algorithm requires n comparisons. This occurs when x = a[n] or 
when x does not appear in the array at all. Thus in the worst case, the sequential search algorithm 
is O(n). Finally, because x is as likely to be in the first half of the array as in the second half, the 
algorithm requires an average of 5 comparisons. Since 2 is ©(n), the algorithm’s average-case 
performance is also O(n). a 


Measuring the Efficiency of an Algorithm 


Two aspects of algorithm efficiency are important: the amount of time required to execute 
the algorithm and the amount of memory space needed when it is run. In this chapter we 
introduce basic techniques for calculating time efficiency. Similar techniques exist for 
calculating space efficiency. Occasionally, one algorithm may make more efficient use of 
time but less efficient use of memory space than another, forcing a trade-off based on the 
resources available to the user. 

How can the time efficiency of an algorithm be calculated? The answer depends on 
several factors. One is the size of the set of data that is input to the algorithm; for ex- 
ample, it takes longer for a sort algorithm to process 1,000,000 items than 100 items. 
Consequently, the execution time of an algorithm is generally expressed as a function of 
its input size. 

Roughly speaking, the analysis of an algorithm for time efficiency begins by trying 
to count the number of elementary operations that must be performed when the algo- 
rithm is executed with an input of size n (in the best case, worst case, or average case). 
What is classified as an “elementary operation” may vary depending on the nature 
of the problem the algorithms being compared are designed to solve. For instance, to 
compare two algorithms for evaluating a polynomial, the crucial issue is the number 
of additions and multiplications that are needed, whereas to compare two algorithms 
for searching a list to find a particular element, the important distinction is the num- 
ber of comparisons that are required. As is common, we will classify the following as 
elementary operations: addition, subtraction, multiplication, division, and compari- 
sons that are indicated explicitly in an if-then statement using one of the relational sym- 
bols <, =, >, =, =, or +. 

When algorithms are implemented in a particular programming language and run on 
a particular computer, some operations are executed faster than others, and, of course, 
there are differences in execution times from one machine to another. In certain practical 
situations these factors are taken into account when deciding which algorithm or which 
machine to use to solve a particular problem. In other cases, however, the machine is fixed, 
and rough estimates are all that are needed to determine the clear superiority of one algo- 
rithm over another. Since each elementary operation is executed in time no longer than the 
slowest, the time efficiency of an algorithm is approximately proportional to the number 
of elementary operations required to execute the algorithm. 

Consider the example of two algorithms, A and B, designed to do a certain job. Sup- 
pose that for an input of size n, the number of elementary operations needed to perform 
algorithm A is between 10n and 20n (at least for large n), and the number of elementary 
operations needed to perform algorithm B is between 2n? and 4n*. We say that in the worst 
case, algorithm A is O(n) (or has worst-case order n) and that in the worst case, algorithm 
Bis @(n’) (or has worst-case order n). 

To compare the efficiencies of A and B, let Max A be the maximum number of opera- 
tions needed to execute algorithm A, and let Min B be the minimum number of operations 
needed to execute algorithm B. Then Max A is 20n and Max B is 2n*. Table 11.3.1 shows 
the differences between algorithms A and B for larger and larger values of n. 
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TABLE 11.3.1 Number of Elementary Operations for Algorithms A and B 


Max A Min B Max B — Max A 
n 20n 2n? 2n? — 20n 
1,000 20,000 2,000,000 1,980,000 
10,000 200,000 200,000,000 199,800,000 
1,000,000 20,000,000 2,000,000,000,000 1,999,980,000,000 


For example, to accomplish a job with an input size of 1,000,000, algorithm B requires 
almost 2 trillion more elementary operations than algorithm A. One way to look at this is 
that algorithm B has to perform 100,000 operations for each single operation performed by 
algorithm A. 


Definition 


Let A be an algorithm. 


1. Suppose the number of elementary operations performed when A is executed for 
an input of size n depends on n alone and not on the nature of the input data; say 
it equals f(n). If f(n) is O(g(n)), we say that A is O(g(m)) or A is of order g(n). 


2. Suppose the number of elementary operations performed when A is executed for 
an input of size n depends on the nature of the input data as well as on n. 


a. Let b(n) be the minimum number of elementary operations required to execute 
A for all possible input sets of size n. If b(n) is O(g(n)), we say that in the best 
case, A is O©(g(n)) or A has a best-case order of g(n). 

b. Let w(n) be the maximum number of elementary operations required to ex- 


ecute A for all possible input sets of size n. If w(n) is O(g(n)), we say that in the 
worst case, A is ©(g(n)) or A has a worst-case order of g(n). 


Some of the orders most commonly used to describe algorithm efficiencies are shown 
in Table 11.3.2. As you see from the table, differences between the orders of various types 
of algorithms can be more than astronomical. For instance, the time required for an algo- 
rithm of order 2” to operate on a data set of size 100,000 is approximately 10°06 times the 
estimated 13.8 billion years since the universe began (according to one theory of cosmol- 
ogy). On the other hand, an algorithm of order log, n needs at most a fraction of a second 
to process the same data set. 


TABLE 11.3.2 Time Comparisons of Some Algorithm Orders 


Approximate Time to Execute f(n) Operations Assuming One Operation per Nanosecond* 
fn) n = 10 n = 1,000 n = 100,000 n = 10,000,000 
log, n 3.3 X 107° sec 107° sec 1.7 X 10 8 sec 2.3 X 107° sec 
n 1078 sec 10 ° sec 0.0001 sec 0.01 sec 
n log, n 3.3 X 107° sec 1077 sec 0.0017 sec 0.23 sec 
n’ 107” sec 0.001 sec 10 sec 27.8 hr 
n? 10 ° sec 1 sec 11.6 days 31,688 yr 
ar 10~° sec 3.4 x 10°84 yr 3.1 X 10° yr 2.9 X 100083 yr 


“one nanosecond = 107° second, one year = 365.25 days 
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Computing an Order of an Algorithm Segment 


Assume n is a positive integer and consider the following algorithm segment: 


p:=0,x:=2 
for i := 2 ton 

p:=(pti-x 
next i 


a. Compute the actual number of elementary operations that are performed when this 
algorithm segment is executed. 


b. Use the theorem on polynomial orders to find an order for this algorithm segment. 


Solution 


a. This algorithm segment has just one loop, which goes from 2 to n, and so there are as 
many iterations of the loop as there are integers from 2 ton, namely n-2+1 =n-1." 
During each iteration, one multiplication and one addition are performed. Thus twice 
as many elementary operations are performed as there are iterations of the loop, and 
hence 2(n — 1) = 2n — 2 elementary operations are performed when the algorithm 
segment is executed. 


b. By the theorem on polynomial orders, 
2n-2 is O(n), 


and so this algorithm segment is O(7). E 


The next example looks at an algorithm segment that contains a nested loop. 


An Order for an Algorithm with a Nested Loop 
Assume 7 is a positive integer and consider the following algorithm segment: 
s:=0 
for i:= 1 ton 
for j:= 1 toi 


J 
next j 


it “fj = ithens:= 541 


next 7 


a. Compute the actual number of elementary operations that are performed when this 
algorithm segment is executed. 

b. Use the theorem on polynomial orders to find an order for this algorithm segment. 

Solution 


a. Each iteration of the inner loop requires one division and one multiplication, so the 
total number of elementary operations is twice the number of iterations of the inner 
loop. Now the inner loop is iterated 


one time when i = 1 


two times when i = 2 


*See “Counting the Elements of a List” in Section 9.1 to review this fact. 
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three times when i = 3 


n times when i = n. 


You can see this easily if you construct a table that shows the values of i and j for 
which the statements in the inner loop are executed. Each column in the table repre- 
sents one iteration. 


l 1 2 >| 3 si A > n a 
J 1 1 2 1 2 3 1 2 3 4 1 2 3 n 
wA aA M es SS, Iaat | m 
1 2 3 4 n 


Hence the total number of iterations of the inner loop is 


_ n(n + 1) 


1+2+3+ +n 7 


by Theorem 5.2.1, 


and so the number of elementary operations performed is 


gT =n(n+ 1). 


An alternative method for computing the number of columns of the table uses an ap- 
proach discussed in Example 9.6.3. Observe that the number of columns in the table is the 
same as the number of ways to place two X’s in n categories, 1, 2, . . ., n, where the loca- 
tion of the <’s indicates the values of į and j with j = i. By Theorem 9.6.1, this number is 


n—-1+2\ [ntl (n+ 1)! (n+1)n(n— 1)! ntn+1) 
2 N2 J MAn)!  An=D 2 


Although, for this example, the alternative method is more complicated than the one pre- 
ceding it, it is simpler when the number of loop nestings exceeds two. (See exercise 19.) 


b. By the theorem on polynomial orders, n(n + 1) = n? +n is O(n’), and so this algo- 
rithm segment is O(n’). E 


When the Number of Iterations Depends on the Floor Function 
Assume n is a positive integer and consider the following algorithm segment: 
for i := |n/2] ton 
a:=n—i 
next i 


a. Compute the actual number of elementary operations that are performed when this 
algorithm segment is executed. 


b. Use the theorem on polynomial orders to find an order for this algorithm segment. 


Solution 

a. Each iteration of the loop requires one subtraction, and the loop is iterated as many 
times as there are integers from |n/2] to n, namely, n — |5] + 1 times. If n is even, then 
[5] = me and so the number of elementary operations performed is 


n n n+2 
n tl=n +1 = : 
2, 2 2 
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If n is odd, then |5] = a L and so the number of elementary operations performed is 


n n-1 2n-—(n-1)+2 n+3 
n +l=n +1 = = : 
2 2 2 2 


b. By the theorem on polynomial orders, 


n+2 


is O(n) and à is O(n) 


also. Hence, regardless of whether n is even or odd, this algorithm segment is ©(n). E 


The Insertion Sort Algorithm 


Insertion sort is an algorithm for arranging the items in an array into ascending order. Ini- 
tially, the second item is compared to the first. If the second item is less than the first, their 
values are interchanged, and as a result the first two array items are in ascending order. 
The idea of the algorithm is gradually to lengthen the section of the array that is known 
to be in ascending order by inserting each subsequent array item into its correct position 
relative to the preceding ones. When the last item has been placed, the entire array is in 
ascending order. 

Figure 11.3.2 illustrates the action of step k of insertion on an array a[1], a[2], a[3],..., 
ajn]. 


sorted subarray 


a[l], a[2], a[3] P alk- 1, alk], a[k+1],..., aln] 


t 


Step k: Insert the value of a[k] into its proper position relative to 
AT AZ); sses a({k—1]. At the end of this step a[1], a[2],..., a[k] is sorted. 


FIGURE 11.3.2 Step k of Insertion Sort 


The following is a formal algorithm for insertion sort. 


Algorithm 11.3.1 Insertion Sort 


[The aim of this algorithm is to take an array a{\], a[2], a[3],..., a[n], where n = 1, 
and reorder it. The output array is also denoted a[l], a[2], a[3],..., a[n]. It has the 
same values as the input array, but they are in ascending order. In the kth step, a[l], 
a(2], a[3],..., a[k—1] is in ascending order, and a[k] is inserted into the correct posi- 
tion with respect to it.] 


Input: n [a positive integer], a[1], a[2], a[3],..., a[n] [an array of data items capable 
of being ordered] 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


11.3 APPLICATION: ANALYSIS OF ALGORITHM EFFICIENCY | 793 


Algorithm Body: 
for k := 2 ton 


[Compare a[k] to previous items in the array a[1], a[2], a[3],..., a[k— 1], starting from 
the largest and moving downward. Whenever afk] is less than a preceding array item, 
the indexes of a[k] and the preceding item are switched. As soon as alk] is greater than 
or equal to an array item, the value of afk] is left unchanged.] 

x := afk] 

j:=k-l1 

while (j # 0) 

if x < a[j] then 


a[j+ 1] := a[j] 
al j] :=x 
j:=j—1 
else j := 0 
end if 
end while 


next k 


Output: a[l], a[2], a[3], ..., a[n] [in ascending order] 


Figure 11.3.3 shows the result of each step when insertion sort is applied to the particu- 
lar array 


a[l] =6, a[2]=3, al3]=5, al4)=7, al5] =2. 


Initial 


a[l] 


a[3] a[4] 


7 


a[5] 
2 


a[2] 
3 


The top row of the table shows the initial values of the 


Result of step 1 3 6 5 7 2 array, and the bottom row shows the final values. The result 
Result of step 2 3 5 6 7 2 of each step is shown in a separate row. For each step, 
the sorted section of the array is shaded. 
Result of step 3 3 5 6 7 2 
Result of step 4 2 3 5 6 7 


Example 11.3.5 


FIGURE 11.3.3 Action of Insertion Sort on an Array 


Example 11.3.5 develops a trace table for the action of insertion sort on a particular array. 


A Trace Table for Insertion Sort 
Construct a trace table showing the action of insertion sort on the array 
a[l] = 6, a[2] = 3, a[3] = 5, a[4] = 7, a[5] = 2. 


Solution 


The first column shows the state of the variables before the first iteration of the for-next 
loop. When the for-next loop is first iterated, k is assigned the value 2; x the value of a[2], 
which is 3; and j the value of k — 1, which is 1. Because j # 0, the while loop is entered 
and the condition for the if-then-else statement is tested. Because a[1] > x, then a[2] is 
assigned the value of a[1], which is 6, j is assigned the value of j — 1, which is 0, and a[1] is 
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assigned the value of x, which is 3. The condition governing the while loop is tested again, 
but since j = 0, it is not satisfied, and so the while loop is not entered. Then the value of 
k is incremented by 1 (so that it equals 3), and the for-next loop is entered a second time. 
This process continues until the value of k has been incremented to 6. Because 6 is greater 
than the top value in the for-next loop, execution of the algorithm ceases, and the array 
items are in ascending order. 


a 

EN 

— 
=lolnlnialalwla]u 

a 


Finding a Worst-Case Order for Insertion Sort 


a. When insertion sort is applied to the array a[1], a[2], a[3],..., a[n], what is the maxi- 
mum number of comparisons if those that control the while and for-next loops are 
included in the count? 


b. Use the theorem on polynomial orders to find a worst-case order for insertion sort. 


Solution 

a. In each iteration of the while loop, one explicit comparison is made to test whether 
alj] > x. During the time that a[k] is put into position relative to a[1], a[2],..., 
a(k — 1], the maximum number of attempted iterations of the while loop is k. This 
happens when afk] is less than every a[1], a[2],..., a[k— 1] and results in k— 1 
comparisons. Then when the kth iteration is attempted, a comparison results in set- 
ting j = 0 and so the condition of the while loop is not satisfied. Thus the maximum 
number of comparisons for a given value of k is k— 1. Now because k goes from 2 to 
n, the maximum total number of comparisons occurs when the items in the array are 
in reverse order, and it equals 


24+34+:--+n=(1424+34+:::+n)-1 by adding and subtracting 1 
n(n + 1) 
= a l by Theorem 5.2.1 
_ nn) r 
2 
= 7? +—n-1 by algebra. 
2 2 


b. By the theorem on polynomial orders, in T in — 1 is @(n”), and so the insertion sort 
algorithm has worst-case order O(n). E 
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The definition of expected value that was introduced in Section 9.8 can be used to find 
an average-case order for insertion sort. 


ZEWJGIEYA Finding an Average-Case Order for Insertion Sort 


a. What is the average number of comparisons that are performed when insertion sort is 
applied to the array a[1], a[2], a[3],..., a[n]? 


b. Use the theorem on polynomial orders to find an average-case order for insertion sort. 


Solution 
a. For any positive integer n, let E,, be the average, or expected, number of comparisons 
used to sort a[1], a[2],..., a[n] with insertion sort. Then for each integer k = 2, 3,..., n, 


the expected number of 
comparisons used to 
sort a[1], a[2],..., a[k] 


the expected number of the expected number of comparisons 
= | comparisons used to +] used to place a[k] into position 
sort a[1], a[2],..., a[k—1] relative to a[1], a[2],..., a[k—1] 


Thus 


the expected number of comparisons 
E; = Ex, + | used to place a[k] into position 
relative to a[1], a[2],..., a[k—1] 


Also, E; = 0 because when there is just one item in the array, n = 1 and no iterations 
of the outer loop are performed. 

Now at the time the while loop is used to place a[k] relative to a[1], a[2],..., 
a[k — 1], on average the value of a[k] is equally likely to belong in any one of the first 
k positions. Thus the probability of its belonging in any particular position is 1/k. If 
it actually belongs in position j, then k — j + 1 comparisons will be used to move it, 
because there will be k —j + 1 attempted iterations of the while loop and, as noted in 
Example 11.3.6, there is one comparison per attempted iteration. 

According to the definition of expected value given in Section 9.8, the expected 
number of comparisons used to place a[k] relative to a[1], a[2],..., a[k — 1] is therefore 


k 1 1 k 
Zig a ro aa a 1) by Theorem 5.1.1 


Jol 


= nik +(k—1)+:::-+34+2+1] by writing the summation in 


expanded form 


1/k(k+1) 
= by Theorem 5.2.1 
k 2 
ae by algeb 
3 y algebra. 


Hence 


+1 
E; = Ez- + = for every integer k = 2, and 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


796 CHAPTER11 ANALYSIS OF ALGORITHM EFFICIENCY 


In exercise 27 at the end of the section you are asked to solve this recurrence relation 
to show that 


n+3n—4 . 
n= —z for each integer n = 1. 
; w+3n-4 12,3 ; 2 
b. By the theorem on polynomial orders, z = qn +łn-— 1 is O(n’), and so the 
average-case order of insertion sort is also O(n’). E 
TEST YOURSELF 
1. When an algorithm segment contains a nested through _______ elements of the array before it 
for-next loop, you can find the number of times terminates. 


the loop will iterate by constructing a table in 


; 3. The worst-case order of the insertion sort algo- 
which each column represents 


rithm is , and its average-case order is 
2. In the worst case for an input array of length 
n, the sequential search algorithm has to look 


EXERCISE SET 11.3 


1. Suppose a computer takes 1 nanosecond (=10 ° 
second) to execute each operation. Approximately 
how long will it take the computer to execute 
the following numbers of operations? Convert 
your answers into seconds, minutes, hours, days, 
weeks, or years, as appropriate. For example, 
instead of 2° nanoseconds, write 13 days. 

a. log, 200 b. 200 c. 200 log, 200 
d. 200° e. 200° f. 2° 


c. By what factor will the number of operations 
increase when the input size is increased by a 
factor of ten? 

Exercises 4-5 explore the fact that for relatively small 
values of n, algorithms with larger orders can be more ef- 
ficient than algorithms with smaller orders. 


4. Suppose that when run with an input of size n, 
algorithm A requires 2n? operations and algorithm 


B requires 807? operations. 


, $ 2 á ‘ 
2. Suppose an algorithm requires cn“ operations a. What are orders for algorithms A and B from 
when performed with an input of size n (where c is among the set of power functions? 


a constant). b. For what values of n is algorithm A more 


a. How many operations will be required when 
the input size is increased from m to 2m (where 
m is a positive integer)? 

b. By what factor will the number of operations 
increase when the input size is doubled? 

c. By what factor will the number of operations 
increase when the input size is increased by a 
factor of ten? 


‘ : 3 í 
. Suppose an algorithm requires cn” operations 


when performed with an input of size n (where c is 

a constant). 

a. How many operations will be required when 
the input size is increased from m to 2m (where 


efficient than algorithm B? 
c. For what values of n is algorithm B at least 
100 times more efficient than algorithm A? 


. Suppose that when run with an input of size n, al- 


gorithm A requires 10°n? operations and algorithm 

B requires n operations. 

a. What are orders for algorithms A and B from 
among the set of power functions? 

b. For what values of n is algorithm A more 
efficient than algorithm B? 

c. For what values of n is algorithm B at least 100 
times more efficient than algorithm A? 


For each of the algorithm segments in 6-19, assume that 
n is a positive integer. (a) Compute the actual number of 
elementary operations (additions, subtractions, multipli- 
cations, divisions, and comparisons) that are performed 


m is a positive integer)? 
b. By what factor will the number of operations 
increase when the input size is doubled? 
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when the algorithm segment is executed. For simplicity, 
however, count only comparisons that occur within if- 
then statements; ignore those implied by for-next loops. 
(b) Use the theorem on polynomial orders to find an order 


for the algorithm segment. 


6. fori :=3ton—1 
a:=3:n+2:i-1 
next i 
7. max := afl] 
for i:= 2 ton 
if max < a[i] then max := afi] 
next i 
8. a:=0 
for i := 1 to|n/2| 
a:=at3 
next i 
9. s:=0 
for i := 1 ton 
for j := 1 to 2n 
Si=stiy 
next j 
next i 
10. fork := 2 ton 
for j := 1 to 3n 
x := alk] — b[j] 
next j 
next k 
11. for k:= l1 ton- 1 
forj:= 1 tok+1 
x:= alk] + b[j] 
next j 
next k 
12. for k:= l ton—1 
max := alk] 
for i:= k+1ton 
if max < aļi] then max := afi] 
next 7 
a[k] := max 
next k 
13. fori := 1ton—1 


forj:= iton 
if a[ j | > a[i] then do 


temp := ali] 

ali] := aly] 

alj] := temp 

end do 
next j 


next i 


14. 


15. 


16. 


17. 


18. 


H* 19. 


20. 


21. 
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t:=0 

for i := 1 ton 
s:=0 
for j := | toi 

s:= stat] 

next j 
ti=tts’ 

nexti 

fori := 1 ton—1 


p:=1 
q:=1 
forj:=i+1ton 
p := p'clj] 
q := q (elj)? 
next j 
ri= ptq 
next i 
for i: = 1 ton 
s:=0 
for j:= | toi—1 
ss=stpeG-j+) 
next j 
ri=s* 
next i 


for i := 1 ton 
for j := 1 to |(é+1)/2! 
a:= (n—i):(n—j) 
next j 
next i 


for i := 1 ton 
for j := |(i + 1)/2] ton 
x:= ij 
next j 
next i 


for i := 1 ton 
for j:= | toi 
for k := 1 toj 
x= ijk 
next k 
next j 
next i 


Construct a table showing the result of each step 
when insertion sort is applied to the array a[1] = 6, 
a[2] = 2, a[3] = 1, a[4] = 8, and a[5] = 4. 


Construct a table showing the result of each step 
when insertion sort is applied to the array a[l] = 7, 
a[2] = 3, a[3] = 6, a[4] = 9, and a[5] = 5. 
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22. Construct a trace table showing the action of nee? 


insertion sort on the array of exercise 20. if IndexOfMin + k then 
23. Construct a trace table showing the action of Temp := alk] 

insertion sort on the array of exercise 21. alk] := a[IndexOfMin| 

alIndexOfMin] := Temp 

24. How many comparisons between values of a[j] next k 

and x actually occur when insertion sort is applied Output: a[1], a[2], a[3],..., a[n] [in ascending order] 

to the array of exercise 20? The action of selection sort can be represented pictori- 
25. How many comparisons between values of af j] ally as follows: 

and x actually occur when insertion sort is applied a[l] af2]-*-/alk] lajk +1] °° afn] 


to the array of exercise 21? 


. . kth step: Find the index of the array element with 
26. According to Example 11.3.6, the maximum num- r A 


: i : minimum value from among a[k + 1],..., a{n]. 
ber of comparisons needed to perform insertion If the value of this array element is less than the 
sort on an array of length five is 52 +5-2 = 28. value of a[k], then its value and the value of a[k] 
Find an array of length five that requires the are interchanged. 
maximum number of comparisons when insertion 
sort is applied to it. 28. Construct a table showing the interchanges that 
H 27. Consider the recurrence relation that arose in occur when selection sort is applied to the ar- 
Example 11.3.7: E; = 0 and E, = E,_, +, for ray a[1] = 7, a[2] = 3, a[3] = 8, a[4] = 4, and 
each integer k = 2. a[5] = 2. 
a. Use iteration to find an explicit formula for the 29. Construct a table showing the interchanges that 
sequenge, PO ; ; occur when selection sort is applied to the ar- 
b. Use mathematical induction to verify the cor- ray a[1] = 6, a[2] = 4, a[3] = 5, a[4] = 8, and 
rectness of the formula. a[5] = 1. 
Exercises 28-35 refer to selection sort, which is another 30. Construct a trace table showing the action of 
algorithm to arrange the items in an array in ascending selection sort on the array of exercise 28. 
order. 


31. Construct a trace table showing the action of 
selection sort on the array of exercise 29. 


Algorithm 11.3.2 Selection Sort 
[Given an array a[1], a[2], a[3], ..., a[n], this algorithm 32 
selects the smallest element and places it in the first 

position, then selects the second smallest element and 


. When selection sort is applied to the array of exer- 
cise 28, how many times is the comparison in the 


es os i if-then statement performed? 
places it in the second position, and so forth, until the P 


entire array is sorted. In general, for each k = | ton—~|1, 33. When selection sort is applied to the array of exer- 
the kth step of the algorithm selects the index of the array cise 29, how many times is the comparison in the 
item with minimum value from among a[k + 1], a[k+ 2], if-then statement performed? 

a[k + 3],..., a[n]. Once this index is found, the value of ` 


the corresponding array item is interchanged with the 34. When selection sort is applied to an array a[1], 
value of a[k] unless the index already equals k. At the end a2], a[3], a[4], how many times is the comparison 
of execution the array elements are in order.] in the if-then statement performed? 

Input: n [a positive integer], a[1], a[2], a[3],..., a[n] [an 


35. Consider applying selection sort to an array a[1], 

a{2], a[3],..., a[n]. 

a. How many times is the comparison in the 
IndexOfMin := k if-then statement performed when a[1] is com- 
fori kt tton pared to each of a[2], a[3],..., a[n]? 

if ali] < aUndexOfMin]) b. How many times is the comparison in the 
then IndexOfMin := i if-then statement performed when a[2] is 
compared to each of a[3], a[4],..., a[n]? 


array of data items capable of being ordered] 


Algorithm Body: 
for k:= 1ton—-1 
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How many times is the comparison in the 
if-then statement performed when afk] is 
compared to each of a[k— 1], a[k+2],..., 
a(n]? 

Using the number of times the comparison 
in the if-then statement is performed as a 
measure of the time efficiency of selection 
sort, find a worst-case order for selection 
sort. Use the theorem on polynomial 
orders. 


Hd. 


Exercises 36-39 refer to the following algorithm to com- 
pute the value of a real polynomial. 


Algorithm 11.3.3 Term-by-Term Polynomial Evaluation 
[This algorithm computes the value of a polynomial 
aln] + a[n— Ux) +--+ a{2|x? + afl x + a[0] by 
computing each term separately, starting with a[0], and 
adding it to an accumulating sum.] 


Input: n /a nonnegative integer], a[0], a[1], a[2],..., a[n] 
[an array of real numbers], x [a real number] 
Algorithm Body: 

polyval := a0] 

fori := 1 ton 


term := a{i] 
for j:= 1 toi 
term := term-x 
next j 
polyval := polyval + term 
next i 
[At this point 


polyval = a[n]x" + a[n— 1x"! 
+- + [2]? + afl ]x + af0]./ 
Output: polyval [a real number] 


36. Trace Algorithm 11.3.3 for the input n = 3, 
a[0] = 2, a[l] = 1, a[2] = —1, a[3] = 3, and 
x=2., 


37. Trace Algorithm 11.3.3 for the input n = 2, 


a{O] = 5, a[1] = —1, a[2] = 2, and x = 3. 


38. Let s, = the number of additions and multiplica- 
tions that are performed when Algorithm 11.3.3 is 
executed for a polynomial of degree n. Express s, 


as a function of n. 


ANSWERS FOR TEST YOURSELF 


11.3 APPLICATION: ANALYSIS OF ALGORITHM EFFICIENCY | 


799 


39. Use the theorem on polynomial orders to find an 


Ex 
kn 


order for Algorithm 11.3.3. 


ercises 40-43 refer to another algorithm, 
own as Horner’s rule, for finding the value of a 


polynomial. 


Algorithm 11.3.4 Horner’s Rule 

[This algorithm computes the value of a polynomial 
aln] + a[n— Ux +++ + a{2|x? + afl ]x + a[0] by nest- 
ing successive additions and multiplications as indicated 
in the following parenthesization: 


(C-e (a{n]x + afn — 1])x + afn — 2])x 
+ +++ +a[2])x + a[1])x + a0]. 


At each stage, starting with a[n], the current value of 
polyval is multiplied by x and the next lower coefficient of 
the polynomial is added to it.] 


Input: n [a nonnegative integer], a[0], a[1], a[2],..., 
a[n] [an array of real numbers], x [a real number] 


Algorithm Body: 
polyval := ajn] 
for i:= 1 ton 

polyval := polyval:x + a[n — i] 
next i 
[At this point 
polyval = a{n]x" + a[n — 1]x""! 
a[2]x° 4 


Output: polyval [a real number] 


a{1]x + a[0]./] 


40. 


41. 


H 42. 


43. 


Trace Algorithm 11.3.4 for the input n = 3, a[0] = 2, 
a[1] = 1, a[2] = —1, a[3] = 3, and x = 2. 


Trace Algorithm 11.3.4 for the input n = 2, 
a[0] = 5, a[1] = —1, a[2] = 2, and x = 3. 


Let t, = the number of additions and multiplica- 
tions that are performed when Algorithm 11.3.4 is 
executed for a polynomial of degree n. Express t, 
as a function of n. 


Use the theorem on polynomial orders to find an 
order for Algorithm 11.3.4. How does this order 
compare with that of Algorithm 11.3.3? 


1. one iteration of the innermost loop 2.n 3. nn? 
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HEJ Exponential and Logarithmic Functions: 
Graphs and Orders 


Although this may seem a paradox, all exact science is dominated by the idea of 
approximation. —Bertrand Russell, 1872-1970 


Exponential and logarithmic functions are of great importance in mathematics in gen- 
eral and in computer science in particular. Several important computer algorithms 
have execution times that involve logarithmic functions of the size of the input data 
(which means they are relatively efficient for large data sets), and some have execution 
times that are exponential functions of the size of the input data (which means they are 
extremely inefficient for large data sets). In addition, since exponential and logarith- 
mic functions arise naturally in the descriptions of many growth and decay processes 
and in the computation of many kinds of probabilities, these functions are used in 
the analysis of computer operating systems, in queuing theory, and in the theory of 
information. 


Graphs of Exponential Functions 


As defined in Section 7.2, the exponential function with base b > 0 is the function that 
sends each real number x to b*. The graph of the exponential function with base 2 (together 
with a partial table of its values) is shown in Figure 11.4.1. Note that the values of this 
function increase with extraordinary rapidity. If we tried to continue drawing the graph 
using the scale shown in Figure 11.4.1, we would have to plot the point (10, 2!°) more than 
6 meters above the horizontal axis. And the point (30, 230) would be located more than 
982,228 kilometers higher—well beyond the moon! 


x 2” 

0 29=1 

1 2l = 

2 ?=4 

3 23 = 
-1 21=05 
-2 2-2 = 0.25 
3 2-3 = 0.125 

0.5 205 = 1.414 
-0.5 | 2-25 = 0.707 


FIGURE 11.4.1 The Exponential Function with Base 2 


The graph of any exponential function with base b > 1 has a shape that is simi- 
lar to the graph of the exponential function with base 2. If 0 < b < 1, then 1/⁄b > 0 
and the graph of the exponential function with base b is the reflection across the 
vertical axis of the exponential function with base 1/b. These facts are illustrated in 
Figure 11.4.2. 
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Graph of the exponential function Graph of the exponential function 
with base b > 1 with base b where 0 < b < 1 
(a) (b) 


FIGURE 11.4.2 Graphs of Exponential Functions 


Graphs of Logarithmic Functions 


Logarithms were first introduced by the Scotsman John Napier. Astronomers and naviga- 
tors found them so useful for reducing the time needed to do multiplication and division 
that they quickly gained wide acceptance and played a crucial role in the remarkable devel- 
opment of those areas in the seventeenth century. Nowadays, however, electronic calcula- 
tors and computers are available to handle most computations quickly and conveniently, 
and logarithms and logarithmic functions are used primarily as conceptual tools. 

Recall the definition of the logarithmic function with base b from Section 7.1. We state 
it formally below. 


Bettmann/Getty Images 


(ey abae 


John Napier Definition 
(1550-1617) 


If b is a positive real number not equal to 1, then the logarithmic function with 
base b, log,: R* — R, is the function that sends each positive real number x to the 
number log, x, which is the exponent to which b must be raised to obtain x. 


The logarithmic function with base b is, in fact, the inverse of the exponential function 
with base b. (See exercise 10 at the end of this section.) It follows that the graphs of the 
two functions are symmetric with respect to the line y = x. The graph of the logarithmic 
function with base b > | is shown in Figure 11.4.3. 

If its base b is greater than 1, the logarithmic function is increasing. Analytically, this 
means that 


if b > 1, then for all positive numbers x, and x», 


if x) < x7, then log,(x)) < log, (x2). 11.4.1 


Note As examples, Corresponding to the rapid growth of the exponential function, however, is the very 
log,(1,024) is only 10 and slow growth of the logarithmic function. Thus you must go very far out on the horizontal 
logo(1,048,576) is just 20. axis to find points whose logarithms are large numbers. 
The following example shows how to make use of the increasing nature of the logarith- 
mic function with base 2 to derive a remarkably useful property. 
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YA 


logy x2 
14 


FIGURE 11.4.3 The Graph of the Logarithmic Function with Base b > 1 


Base 2 Logarithms of Numbers between Two Consecutive Powers of 2 


Prove the following property: 


a. 
If k is an integer and x is a real number with 


2% < x <2**! then |log; x] = k. 11.4.2 


b. Describe property (11.4.2) in words and give a graphical interpretation of the property 
for x > 1. 


Solution 


a. Suppose that k is an integer and x is a real number with 
eee! 
Because the logarithmic function with base 2 is increasing, this implies that 
log,(2") = logy x < log,(2‘*?). 


But log,(2") = k [the exponent to which you must raise 2 to get 2* is k] and 
log,(2**!) = k+ 1 [fora similar reason]. Hence 


k=logyax<k+1. 
By definition of the floor function, then 
logs x] = k. 


b. Recall that the floor of a positive number is its integer part. For instance, [2.82] = 2. 
Hence property (11.4.2) can be described in words as follows: 


If x is a positive number that lies between two consecutive integer powers of 2, the 
floor of the logarithm with base 2 of x is the exponent of the smaller power of 2. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


11.4 EXPONENTIAL AND LOGARITHMIC FUNCTIONS: GRAPHS AND ORDERS 803 


A graphical interpretation follows: 


y=logyx 


then log, x lies in here: 


~v- 


= 3. se 8 k 
1 2 2*=4 <2 =8 2 = 
If x lies in here 


One consequence of property (11.4.2) does not appear particularly interesting in its own 
right but is frequently needed as a step in the analysis of algorithm efficiency. 


J 
nkt 1 


When |log.(n—1)] = [logan] 


Prove the following property: 


For any odd integer n > 1, [logo(n — 1)| = [logs n]. 11.4.3 


Solution If nis an odd integer that is greater than 1, then 7 lies strictly between two suc- 
cessive powers of 2: 


2% <n<2*! for some integer k > 0. 11.4.4 
It follows that 2* = n— 1 because 2‘ < n and both 2‘ and n are integers. Consequently, 
en=1 <2" 11.4.5 
Applying property (11.4.2) to both inequalities (11.4.4) and (11.4.5) gives 
[logan] = k and also |logy(n — 1)| = k. 
Hence [logy] = [log,(n — 1). E 


Application: Number of Bits Needed to Represent 
an Integer in Binary Notation 


Given a positive integer n, how many binary digits are needed to represent n? To answer 
this question, recall from Section 5.4 that any positive integer n can be written in a unique 
way as 


n= Atera ltet Co 


where k is a nonnegative integer and each Co, C4, Co, . . . Cg—1 is either 0 or 1. Then the binary 
representation of n is 


1cy—1Cx—2* + * C2C1C0, 


and so the number of binary digits needed to represent n is k+ 1. 
What is k + 1 as a function of n? Observe that since each c; = 1, 


n= Mey yp 2 t-te Pte 2+ep StH 14-4 V4241. 
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Now, by the formula for the sum of a geometric sequence (Theorem 5.2.2), 


k+l _ 


ooh 4-477 42415 =e 2+1], 
Hence, by transitivity of order, 
nat =, 11.4.6 
In addition, because each c; = 0, 
Hate e lt beg +ey-2t+eo =n. 11.4.7 


Putting inequalities (11.4.6) and (11.4.7) together gives the double inequality 
Jano", 
Then, by property (11.4.2), 
k = |logzn], 


and so the number of binary digits needed to represent n is [logyn|+ 1. 


Se Meee Number of Bits in a Binary Representation 
How many binary digits are needed to represent 52,837 in binary notation? 


Solution If you compute the logarithm with base 2 using the formula in part (a) of Theo- 
rem 7.2.1 and a calculator or computer that gives you approximate values of logarithms 
with base 10, you find that 


logio(52,837) _ 4722938151 _ i57 
logio(2)  0.3010299957 ~ 


log2(52,837) = 


Thus the binary representation of 52,837 has |15.7)+ 1 = 15+ 1 = 16 binary digits. E 


Application: Using Logarithms to Solve Recurrence Relations 


In Chapter 5 we discussed methods for solving recurrence relations. A class of recur- 
rence relations that is very important in computer science has solutions that can be 
expressed in terms of logarithms. One such recurrence relation is discussed in the next 
example. 


elu wee A Recurrence Relation with a Logarithmic Solution 
Define a sequence qj, do, a3, . . . recursively as follows: 
4 = 1 
ak = 2akj2} for each integer k = 2. 
a. Use iteration to guess an explicit formula for this sequence. 


b. Use strong mathematical induction to confirm the correctness of the formula obtained 
in part (a). 
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Solution 


a. Begin by iterating to find the values of the first few terms of the sequence. 


4 = 1 j=2" 

ay 22/2 2a, 2°1 } =o) 
Po, 2a3/2) = 2a, = 2-1 =2 

a4 2a 4/2 2a 2°2 4 sao? 
Pt 2ays/2| = 2ay = 2:2 = 4 

de 246/2 2a3 2:2 4 

a7 = 27/2) = 2a; = 2:2 =4 

ag 2ag/2 2d4 2-4 8 g= 
Hi Mae) = 244 = 2:4 = 8 

4i5 = 2a 15/2] = 2a, =2:4=8 


Pa = 2q 16/2; = 2ag = 2:8 = S, 16 = 2" 


Note that in each case when the subscript n is between two powers of 2, a, equals the 
smaller power of 2. More precisely: 


If2i=n<2'*! then a, = 22. 11.4.8 
Now since n satisfies the inequality 
<n< it) 
then (by property 11.4.2) 
i = [logy n|. 
Substituting into statement (11.4.8) suggests that 


i= piles. n) 


b. The following proof shows that the recursively defined sequence defined in this 
example does in fact satisfy the formula obtained in part (a). In other words, if a), a2, 
a3, . . . is a sequence of numbers that satisfies 


a =l, and ay = 2a 4/2, foreach integer k = 2, 
then the sequence satisfies the formula 
a, = 202”! for every integer n = 1. 


Proof: 
Let a4, a2, a3, . . . be the sequence defined by specifying that a, = 1 and ap = 2an; for 
each integer k = 2, and let the property P(n) be the equation 


a, = oen e P(n) 


We will use strong mathematical induction to prove that for every integer n = 1, P(n) 
is true. 


Show that P(1) is true: By definition of a1, ay, a3, . . . , we have that a; = 1. Now it is 
also the case that 22® "! = 2° = 1. Thus a, = 2®°® " and P(1) is true. 
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Show that for every integer k = 1, if P(i) is true for each integer i from 1 through k, 
then P(k +1) is also true: Let k be any integer with k = 1, and suppose that 


qa; = glee 4 for each integer i with 1 S i S k. <— inductive hypothesis 


We must show that 


üi = qlogs(k+ DI <— P(k+1) 
Consider the two cases: k is even and k is odd. 
Case 1 (k is even): In this case, k + 1 is odd, and 
ak+1 = 2a(k+ 1)/2 by definition of a), ay, 43, . . . 
= 2/2 because |(k + 1)/2| = k/2 since k + 1 is odd 
= 2 qlioge (k/2)) by inductive hypothesis because, since k is even, k = 2, and 
so k/2 = 1 

=2 loga(/2)|+ 1 by the laws of exponents from algebra (7.2.1) 

= 208k — log:2)1+ 1 by the identity log, (x/y) = log,x —log,y from Theorem 7.2.1 

= qlloesk—1+1 since log)2 = 1 

= iogkl-1+1 by substituting x = log» k into the identity 

= 28k lx— 1J = |x]— 1 derived in exercise 15 of Section 4.6 

logo(k+ 1) 


= by property (11.4.3). 


Case 2 (k is odd): The analysis of this case is very similar to that of case 1 and is left 
as exercise 51 at the end of the section. 


logo(k + 1)] 


Thus in either case, a, = 2 , as was to be shown. E 


Exponential and Logarithmic Orders 


Now consider the question “How do graphs of logarithmic and exponential functions com- 
pare with graphs of power functions?” It turns out that for large enough values of x, the graph 
of the logarithmic function with any base b > 1 lies below the graph of every power function 
with a positive exponent, and the graph of the exponential function with any base b > 1 lies 
above the graph of each of these power functions. In analytic terms, this says the following: 


For all real numbers b and r with b > 1 and r > 0, there is a positive real number s 
such that 


log, x Sx” for every real number x = s. 11.4.9 


and x" = b* for every real number x = s. 11.4.10 


These statements have the following implications for O-notation. 


For all real numbers b and r with b > 1 and r > 0, 
log,n is O(n’) 11.4.11 
and n” is O(b") 1114:12 


Another important function in the analysis of algorithms is the function f defined by 
the formula 


f(x) =xlog,x for every real number x > 0. 
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For large values of x, the graph of this function fits in between the graph of the identity 
function and the graph of the squaring function. More precisely: 


For every real number b with b > 1, there is a positive real number s such that for 
every real number x = s, 


x=xlog,x =x. 11.4.13 


The O-notation versions of these facts are as follows: 


For every real number b > 1, 


n is O(mlog,n) and nlog,n is O(n’). 11.4.14 


Although proofs of some of these facts require calculus, some cases can be verified us- 
ing the algebra of inequalities. (See the exercises at the end of this section.) Figure 11.4.4 
illustrates the relationships among some power functions, the logarithmic function with 
base 2, the exponential function with base 2, and the function defined by the formula 
x — x log, x. Note that different scales are used on the horizontal and vertical axes. 


YA 


15+ 


10+ 


FIGURE 11.4.4 Graphs of Some Logarithmic, Exponential, and Power Functions 


Examples 11.4.5 and 11.4.6 use properties of logarithms together with theorems from 
Section 11.2 to derive orders for combinations that involve logarithmic and exponential 
functions. 


Orders That Involve Logarithmic Functions 


a. Show that if b and c are any real numbers with b > 1 and c > 1, then log, n and log, n 
have the same order. 


b. Show that n + n log, n is O(n logy n). 
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Solution 
a. By part (d) of Theorem 7.2.1, 


log.n 1 
log,n = = logen 
log..b log.b 
1 
Since both b > 1 and c > 1, then log.b > 0, and so > 0. Thus, by Theorem 
11.2.8, O8cb 


1 
(st }ieen is @dog,n), and so log,n is O©(logen). 
log.b 


Exactly the same reasoning with b and c interchanged gives that 
logen is @(logpn). 
Thus logn and log, n have the same order. 


b. By Theorem 11.2.7, n is O(n) and n logzn is O(n log,n). In addition, by property 
11.4.13, there is a positive real number r such that for every integer n = r, n =n logyn. 
Thus it follows from Theorem 11.2.9(c) that n + n logyn is O(n logan). E 


An Exponential Order 


Show that n! is O(n"). 


Solution 
For any positive integer n, 


n! =n: (n— l) (n—2)2-1 
—— —-” 
n factors 
<nnnenn=n" 
ie Le os eh 


n factors 


because (n— 1) =n, (n—2) =n,...,2 Sn, and 1 =n. Let B = 1 and b = 1. Then, since 
n! =0Owhenn= b, 


O0Sn!<n” for every integer n = b. 


Thus n! is O(n”) by definition of O-notation. E 


Example 11.4.7 shows how a logarithmic order can arise from the computation of a 
certain kind of sum. It requires the following fact from calculus: 


The area underneath the graph of y = 1/x between x = | and x = n equals In n, where 
Inn = log, n. This fact is illustrated in Figure 11.4.5. 


YA 


Graph of y = l 


Area of shaded region = Inn 


> 
1 n A 


1 
FIGURE 11.4.5 Area under graph of y = — between x = 1 and x =n 
x 
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Senmieewa” Order of a Harmonic Sum 


1 1 
Sums of the form | + 7 prege 7 are called harmonic sums. They occur in the analysis of 


1 1 
various computer algorithms such as quick sort. Show that 1 + > F 3 +++ +—is Odnn) 
by performing the steps shown below: n 


a. Interpret Figure 11.4.6 to show that 


and 


1 
Ings Ae aa, 
2 3 n 


b. Show that if n is an integer that is at least 3, then 1 < Inn. 


c. Deduce from (a) and (b) that if the integer n is greater than or equal to 3, then 


1 1 
Inns 1+>+>+ + 
2 3 


=2Inn. 


a [e 


d. Deduce from (c) that 


1 1 L 
1+=+=+4+---+=— is @dnn). 
2 3 n 


Solution 


a. Figure 11.4.6(a) shows rectangles whose bases are the intervals between each pair of 
integers from 1 to n and whose heights are the heights of the graph of y = 1/x above 
the right-hand endpoints of the intervals. Figure 11.4.6(b) shows rectangles with the 
same bases but whose heights are the heights of the graph above the left-hand end- 
points of the intervals. 


Graph of y = 1 Graph of y = 1 


Total area under graph 
from | ton=Inn 


Total area under graph (1, 1) 


from 1 ton=Inn 


d, 1) 


1 
(3.3) (4,2) G-La) (ai ae ale) Ge 


FIGURE 11.4.6 


Now the area of each rectangle is its base times its height. Since all the rectangles 
have base 1, the area of each rectangle equals its height. Thus in Figure 11.4.6(a), 
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1 
the area of the rectangle from 1 to 2 is z; 


1 
the area of the rectangle from 2 to 3 is 3° 
: 4 
the area of the rectangle from n — 1 to n is T 


1 1 1 
So the sum of the areas of all the rectangles is a +F 3 seen = From the picture it is 


clear that this sum is less than the area underneath the graph of f between x = 1 and 
x = n, which is known to equal In n. Hence 
1 1 1 
= + = + eae + = 
2 3 n 
A similar analysis of the areas of the combined blue and gray rectangles in Figure 
11.4.6(b) shows that 


= Inn. 


1 1 1 
InnS1+—4+=4+-°°° 45 
2 3 n 


b. Suppose n is an integer and n = 3. Since e = 2.718, then n = e. Now the logarithmic 
function with base e is strictly increasing. Thus since e = n, then 1 = Ine = Inn. 


c. By part (a), 


and by part (b), 
1=Ina. 


Adding these two inequalities together gives 


i ee 1 
Lg ge 2lnn for each integer n = 3. 
n 


d. Combining the results of parts (a) and (c) leads to the conclusion that for every integer 
n= 3, 
1 1 1 
Inn Ss lot ot et SS 2 In. 
2 3 n 
Let A = 1, B = 2, and k = 3. Then Inn > 0 for n = 3 and 


1 1 1 
Alnns a a as +) = Blinn for every integer n > k. 


Hence by definition of @-notation, 


1 1 1, 
1+—+—+-:-+-— is Onn). a 
2 3 n 


Now by Example 11.4.5(a), all logarithmic functions have the same order, and thus for 
each real number b > 1, 


itih pocti poi 
2'3 z 8 (log, n). 
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TEST YOURSELF 


1. 


EXERCISE SET 11.4 


11.4 EXPONENTIAL AND LOGARITHMIC FUNCTIONS: GRAPHS AND ORDERS 


811 


The domain of any exponential function is 
and its range is 


The domain of any logarithmic function is 
and its range is 


. If kis an integer and 2% < x < 247! then 


logs x) = —__. 


. If bis areal number with b > 1, then there is a 


positive real number s with the property that for 

any real number x that is greater than s, when the 
quantities x, x, log x, and x log x are arranged in 
order of increasing size, the result is 


If n is a positive integer, then 1 + 5 + tor 4t 
has order 


Graph each function defined in 1-8. 


1. 


Sa fo » PF ww ON 


10. 


Tl. 


f(x) = 3° for each real number x 

g(x) = (Gy for each real number x 

h(x) = log, x for each positive real number x 
k(x) = log, x for each positive real number x 
F(x) = |log; x] for each positive real number x 
G(x) = |log; x] for each positive real number x 
A(x) = x log, x for each positive real number x 
K(x) = x log, x for each positive real number x 


The scale of the graph shown in Figure 11.4.1 

is 6.35 millimeters to each unit. If the point (2, 
26$) is plotted on the graph of y = 2*, how many 
kilometers will it lie above the horizontal axis? 
What is the ratio of the height of the point to 

the distance of the earth from the sun? 

(The earth is approximately 150,000,000 kilometers 
from the sun on average.) 


a. Use the definition of logarithm to show that 
log, b* = x for every real number x. 

b. Use the definition of logarithm to show that 
bS = x for every positive real number x. 

c. By the result of exercise 28 in Section 7.3, 
if f: X — Y and g: Y — X are functions and 
g° f= Ixand f° g = ly, then f and g are 
inverse functions. Use this result to show that 
log, and exp, (the exponential function with 
base b) are inverse functions. 


Let b> 1. 

a. Use the fact that u = log, v + v = b" to show 
that a point (u, v) lies on the graph of the loga- 
rithmic function with base b if, and only if, 


12. 


14. 


15. 


H 16. 


17. 


(v, u) lies on the graph of the exponential func- 
tion with base b. 

b. Plot several pairs of points of the form (u, v) 
and (v, u) on a coordinate system. Describe the 
geometric relationship between the locations of 
the points in each pair. 

c. Draw the graphs of y = logy x and y = 2". 
Describe the geometric relationship between 
these graphs. 


Give a graphical interpretation for property 
(11.4.2) in Example 11.4.1(a) forO < x < 1. 


. Suppose a positive real number x satisfies the inequal- 


ity 10” = x < 10”! where mis an integer. What 
can be inferred about |logj, x}? Justify your answer. 


a. Prove that if x is a positive real number and k is 
a nonnegative integer such that 2l ae ea 
then flog; x] = k. 

b. Describe in words the statement proved in 
part (a). 


If n is an odd integer and n > 1, is 
flog.(n — 1)] = [log(n)|? Justify your answer. 


If n is an odd integer and n > 1, is 
flogo(n + 1)] = Jlog,(7)]? Justify your answer. 


If n is an odd integer and n > 1, is 
Hlog.(m + 1)| = [logs(n)J? Justify your answer. 


In 18 and 19, indicate how many binary digits are needed 
to represent the numbers in binary notation. Use the 
method shown in Example 11.4.3. 


18. 
19. 


148,206 
5,067,329 
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20. It was shown in the text that the number of binary 
digits needed to represent a positive integer n is 


log, n|+ 1. Can this also be given as [logs n]? Why 


or why not? 


In each of 21 and 22, a sequence is specified by a recur- 
rence relation and initial conditions. In each case, (a) use 
iteration to guess an explicit formula for the sequence; 
(b) use strong mathematical induction to confirm the cor- 
rectness of the formula you obtained in part (a). 


21. dy = agnt 2, for each integer k = 2 


4&4 = il 
22. b= bwt 1, foreach integer k = 2 
by =1. 
H 23. Define a sequence cy, C2, C3, . . . recursively as 
follows: 
cy 0 


Ck = Gu2)+k, for each integer k = 2. 


Use strong mathematical induction to show that 
Cy = n’ for every integer n = 1. 


*H 24. Use strong mathematical induction to show that 
for the sequence of exercise 23, c, <= n log, n, for 
every integer n = 4. 


Exercises 25 and 26 refer to properties 11.4.9 and 


11.4.10. To solve them, think big! 
25. Find a real number x > 3 such that logy x < zt 


26. Find a real number x > 1 such that x? < 2". 


Use Theorems 11.2.7-11.2.9 and properties 11.4.11, 
11.4.12, and 11.4.13 to derive each statement in 27-30. 


27. 2n+ logy nis O(n) 
28. n? + 5n log, n is O(n”) 
29. n? +2" is O2” 

H 30. 2"*! is OQ” 

H 31. Show that 4” is not O(2”). 


Prove each of the statements in 32-37, assuming n is an 
integer variable that takes positive integer values. Use 
identities from Section 5.2 as needed. 


32. 1+42+2 +22 + +2 is OQA. 
H 33. 4+4 +4 +- +4" is OQ m, 
34. 243293 aia 42-32" iš @(32”). 
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1 4 # 4" 
5. sa a" TER tori 


is O(1). 


36. nto+— +--+ + is O(n) 
par r zn is O(n). 


E an an an at Om) 
be ee ee es O(n). 
3: 3 2? a 
38. Quantities of the form 
kin+k,nlogn for positive integers kı, k2, and n 


arise in the analysis of the merge sort algorithm 
in computer science. Show that for any positive 
integer k, 
kın + kən logy n is O(n logy n). 
39. Calculate the values of the harmonic sums 


1 1 1 
1+5+>+ + 
2 3 n 


40. Use part (d) of Example 11.4.7 to show that 


for n = 2, 3, 4, and 5. 


non n 
n+ —+—+---+— is O(n Inn). 
2 3 n 


41. Show that log, n] is O(log, n). 
42. Show that flog, n| is O(log, n). 


43. Prove by mathematical induction that n = 10" for 
every integer n = 1. 


H 44. Prove by mathematical induction that log, n = n 
for every integer n = 1. 


H 45. Show that if n is a variable that takes positive 
integer values, then 2” is O(n!). 


46. Let n be a variable that takes positive integer 


values. 
a. Use Example 11.4.6 to show that log,(n!) is 
O(n log, n). 


H b. Show that n” < (n!) for every integer n = 1. 
c. Use part (b) to show that log, (n!) is Q(n log, n). 
d. Use parts (a) and (c) to find an order for log,(n!). 


47. For each positive real number u, log, u < u. Use 
this fact and the result of exercise 21 in Section 
11.1 to prove the following: For every integer 

n = 1, if xis any real number with x > (2n)*”, 


then log, x < x!/”, 


48. Use the result of exercise 47 above to prove the 
following: For every integer n = 1, if x is any real 
number with x > (2n)*", then x” < 2°. 


11.5 APPLICATION: ANALYSIS OF ALGORITHM EFFICIENCY II 813 


Exercises 49 and 50 use L’H6pital’s rule from calculus. 50. a. Let b be any real number greater than 1. Use 
49. a. Let b be any real number greater than 1. Use P ca ha ae prove: Mabtor even eee 
L Hôpital’s rule and mathematical induction to vs 
prove that for every integer n = 1, | log,x _ 
a lim 1/n 
x’ x 
lim a = 0. b. Use the result of part (a) and the definitions of 


limit and of O-notation to prove that log, x is 


b. Use the result of part (a) and the definitions of O(<'") for any integer n = 1. 


limit and of O-notation to prove that x" is O(b") 
for any integer n = 1. 51. Complete the proof in Example 11.4.4. 


ANSWERS FOR TEST YOURSELF 


1. the set of all real numbers; the set of all positive real of all real numbers 3.k 4. log, x< x< x log, x <x? 


numbers 2. the set of all positive real numbers; the set 5. In x (or, equivalently, log x) 


Application: Analysis of Algorithm Efficiency ll 


In almost every computation a great variety of arrangements for the succession of 
the processes is possible, and various considerations must influence the selections 
amongst them for the purposes of a calculating engine. One essential object is to 
choose that arrangement which shall tend to reduce to a minimum the time necessary 
for completing the calculation. —Ada Augusta, Countess of Lovelace, 1843 


Have you ever played the “guess my number” game? A person thinks of a number between 
two other numbers—say, | and 10 or 1 and 100 for example—and you try to figure out 
what it is, using the least possible number of guesses. Each time you guess a number, the 
person tells you whether you are correct, too low, or too high. 

Lady Lovelace If you have played this game, you have probably already hit upon the most efficient 
(19191832) strategy: Begin by guessing a number as close to the middle of the two given numbers as 
possible. If your guess is too high, then the number is between the lower of the two given 
numbers and the one you first chose. If your guess is too low, then the number is between 
the number you first chose and the higher of the two given numbers. In either case, you 
take as your next guess a number as close as possible to the middle of the new range in 
which you now know the number lies. You repeat this process as many times as necessary 
until you have found the person’s number. 

The technique described previously is an example of a general strategy called divide 
and conquer, which works as follows: To solve a problem, reduce it to a fixed number of 
smaller problems of the same kind, which can themselves be reduced to the same fixed 
number of smaller problems of the same kind, and so forth until easily resolved problems 
are obtained. In this case, the problem of finding a particular number in a given range of 
numbers is reduced at each stage to finding a particular number in a range of numbers 
approximately half as long. 

It turns out that algorithms using a divide-and-conquer strategy are generally quite 
efficient and nearly always have orders involving logarithmic functions. In this section we 
define the binary search algorithm, which is the formalization of the “guess my number” 


Hulton Archive/Getty Images 
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game described previously, and we compare the efficiency of binary search to the sequen- 
tial search discussed in Section 11.3. Then we develop a divide-and-conquer algorithm for 
sorting, merge sort, and compare its efficiency with that of insertion sort and selection sort, 
which were also discussed in Section 11.3. 

Almost a hundred years before the first computer was built and working only from a 
description of how such a machine might function, Ada Augusta, Countess of Lovelace, 
foresaw both the general nature of the problems it could address and the importance of 
designing an efficient arrangement of computations to solve them. 


Binary Search 


Whereas a sequential search can be performed on an array whose elements are in any 
order, a binary search can be performed only on an array whose elements are arranged in 
ascending (or descending) order. Given an array a[1], a[2], . . . , a[n] of distinct elements 
arranged in ascending order, consider the problem of trying to find a particular element x 
in the array. 

To use binary search, first compare x to the “middle element” of the array. If the two 
are equal, the search is successful. If the two are not equal, then because the array ele- 
ments are in ascending order, comparing the values of x and the middle array element 
narrows the search either to the lower subarray (consisting of all the array elements below 
the middle element) or to the upper subarray (consisting of all array elements above the 
middle element). 

The search continues by repeating this basic process over and over on successively 
smaller subarrays. It terminates either when a match occurs or when the subarray to 
which the search has been narrowed contains no elements. The efficiency of the algo- 
rithm is a result of the fact that at each step, the length of the subarray to be searched 
is roughly half the length of the array of the previous step. This process is illustrated in 
Figure 11.5.1. 


left subarray middle element right subarray 
cr š ` ¢ A + 


afr] see a[mid — 1] a[mid | a[mid + 1] see als] 


Compare x to a[mid]. If the two 
are equal, the search ends. 


x < a[{mid] x > a[mid| 
Search the left subarray Search the right subarray 
a{r],..., a[mid — 1] for x. a[mid + 1],..., a[s] for x. 


FIGURE11.5.1 One Iteration of the Binary Search Process 


To write down a formal algorithm for binary search, we introduce a variable index 
whose final value will tell us whether or not x is in the array and, if so, will indicate the 
location of x. Since the array goes from a[1] to a[n], we initialize index to be 0. If and 
when x is found, the value of index is changed to the subscript of the array element equal- 
ing x. If index still has the value 0 when the algorithm is complete, then x is not one of 
the elements in the array. Figure 11.5.2 shows the action of a particular binary search. 

Formalizing a binary search algorithm also requires that we be more precise about 
the meaning of the “middle element” of an array. (This issue was side-stepped by careful 
choice of n in Figure 11.5.2.) If the array consists of an even number of elements, there are 
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a[l] = a{2] = a{3] = a{4] = a{5] z a[6] a a{7] 


no: x > a[4] 


no: x < a[6] 


yes 


index = 


FIGURE11.5.2 Binary Search of a[1], a[2], . . . , a[7] for x (for x = a[5]) 


two elements in the middle. For instance, both a[6] and a[7] are equally in the middle of 
the following array. 


a3] al4] al5] al6] al7] a8] af9] a{10] 


three elements two middle three elements 
elements 


In a case such as this, the algorithm must choose which of the two middle elements to 
take, the smaller or the larger. The choice is arbitrary—either would do. We will write the 
algorithm to choose the smaller. The index of the smaller of the two middle elements is the 
floor of the average of the top and bottom indexes of the array. That is, if 

bot = the bottom index of the array, 

top = the top index of the array, and 


mid = the lower of the two middle indexes of the array, 
then 


bot + to, 
ni =| | 


In this case, bot = 3 and top = 10, so the index of the “middle element” is 


mid = ? = = p =]|6.5] = 6. 


The following is a formal algorithm for a binary search. 


Algorithm 11.5.1 Binary Search 


[The aim of this algorithm is to search for an element x in an ascending array of ele- 
ments a[|1], a[2],..., a[n]. If x is found, the variable index is set equal to the index of the 
array element where x is located. If x is not found, index is not changed from its initial 
value, which is 0. The variables bot and top denote the bottom and top indexes of the 
array currently being examined. ] 


Input: n [a positive integer], a[1], a[2],..., a[n] [an array of data items given in as- 
cending order], x [a data item of the same data type as the elements of the array] 


(continued on page 816) 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


816 


CHAPTER11 ANALYSIS OF ALGORITHM EFFICIENCY 


Algorithm Body: 
index := 0, bot := 1, top:=n 
[Compute the middle index of the array, mid. Compare x to a{mid]. If the two are 


equal, the search is successful. If not, repeat the process either for the lower or for 
the upper subarray, either giving top the new value mid —1 or giving bot the new 
value mid +1. Each iteration of the loop either decreases the value of top or in- 
creases the value of bot. Thus, if the looping is not stopped by success in the search 
process, the value of top eventually becomes less than the value of bot, which stops 
the looping process and shows that x is not an element of the array.] 

while (top = bot and index = 0) 


bot +t 
mam |222] 


if a[mid] = x then index := mid 
if a[mid] > x 
then top := mid — 1 
else bot := mid + 1 
end while 
[If index has the value 0 at this point, then the while loop was not entered because 
top < bot, and so x is not in the array. Otherwise, index gives the index of the array 
where x is located.] 


Output: index [a nonnegative integer] 


Tracing the Binary Search Algorithm 


Trace the action of Algorithm 11.5.1 on the variables index, bot, top, mid, and the values of 
x given in (a) and (b) below for the input array 


a[l] = Ann, a[2] = Dawn, a[3] = Erik, a[4] = Gail, a[5] = Juan, 
a{6] = Matt, a[7] = Max, a[8] = Rita, a[9] = Tsuji, a[10] = Yuen, 


where alphabetical ordering is used to compare elements of the array. 


a. x = Erik b. x = Sara 
Solution 
a. | index 0 3 b. index 0 
bot 1 3 bot 1 6 9 
top 10 top 10 8 
mid 5 2 3 mid 5 8 
x Erik x Sara 


The Efficiency of the Binary Search Algorithm 


Here briefly is the idea for how to derive the efficiency of the binary search algorithm. At 
each stage of the binary search process, the length of the new subarray to be searched is 
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approximately half that of the previous one, and in the worst case, every subarray down 
to a subarray with a single element must be searched. Consequently, in the worst case, the 
maximum number of iterations of the algorithm’s while loop is 1 more than the number 
of times the original input array can be cut approximately in half. Now if the length n 
of this array is a power of 2 (n = 2* for some integer k), then n can be halved exactly 
k = logy n = [logs n] times before an array of length 1 is reached. If n is not a power of 2, 
then n = 2‘ + m for some integer k (where m < 2%), and thus n can be split approximately 
in half k times also, and so in this case, k = |log n] also. Thus in the worst case, the 
maximum number of iterations of the while loop in the binary search algorithm (which 
is proportional to the number of comparisons required to execute it) is |log,n|+ 1. The 
derivation is concluded by noting that [logs n|+ 1 is O(log, n). 

The details of the derivation are developed in Examples 11.5.2—11.5.6. Throughout the 
derivation, for each integer n = 1, let 


w, = the number of iterations of the while loop 
in a worst-case execution of the binary search 
algorithm for an input array of length n. 


The first issue to consider is this. If the length of the input array for one iteration of 
the while loop is known, what is the greatest possible length of the array input to the next 
iteration? 


The Length of the Input Array to the Next Iteration of the Loop 


Prove that if an array of length k is input to the while loop of the binary search algorithm, 
then after one unsuccessful iteration of the loop, the input to the next iteration is an array 
of length at most |k/2). 


Solution Consider what occurs when an array of length k is input to the while loop in 
the case where x # a[mid]: 


albot], albot+1],..., almid—1], alid], almid+ 1],..., altop — 1], a[top]. 
Se EE 
new input to the while “middle new input to the while 
loop if x < a[mid] element” loop if x > a[mid] 


Since the input array has length k, the value of mid depends on whether k is odd or even. 
In both cases we match up the array elements with the integers from 1 to k and analyze the 
lengths of the left and right subarrays. In case k is odd, both the left and the right subar- 
rays have length [k/2). In case k is even, the left subarray has length |k/2|— 1 and the right 
subarray has length |k/2). The reasoning behind these results is shown in Figure 11.5.3. 

Because the maximum of the numbers |k/2| and |k/2|— 1 is |k/2|, in the worst case this 
will be the length of the array input to the next iteration of the loop. E 


To find the order of the algorithm, an explicit formula for w1, w2, w3, . . . is needed. The 
next example derives a recurrence relation for the sequence. 


Sen weve A Recurrence Relation for w,, W2, W3,... 
Prove that the sequence w4, w2, w3, . . . satisfies the recurrence relation and initial condition 


w,= 1, 


wg = 1+Wy,2, for every integer k > 1. 
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kodd:  a{bot] tee a[mid — 1] a[mid | a(mid + 1] tee a{top] 
t + t ¢ ¢ 
k+1_ k+1 k+1 
1 ides =a 1 5 a +1 k 
> 8 =. 
ak+1_j_k-1_|k _p_fk+1 _k-1_|{k 
length ar l= 7 -|5| length = k ( 7 + 1) +1 7 [5 
keven: aļbot] tee a[mid — 1] a[mid] a[mid + 1] tee a(top] 
t ¢ ¢ $ t 
k k k 
Ž_-1 x X41 gig 
i 2 2 2 k 
oth = * —1-|£]_ -= k ak lk 
length = 1=|5| 1 length = k ( + i+ 1=k [$| 


FIGURE 11.5.3 Lengths of the Left and Right Subarrays 


Solution In Example 11.5.2 it was shown that given an input array of length k to the while 
loop, the worst that can happen is that the next iteration of the loop will have to search an 
array of length |k/2]. Hence the maximum number of iterations of the loop is 1 more than 
the maximum number necessary to execute it for an input array of length |k/2). In symbols, 


Wk = 1+ Wi k/2|+ 
Also w,=1 


because for an input array of length 1 (bot = top), the while loop iterates only one time. E 


Now that a recurrence relation for w, w2, w3, . . . has been found, iteration can be used 
to come up with a good guess for an explicit formula. 


Sen wieeee An Explicit Formula for w4, W2, w3,... 


Apply iteration to the recurrence relation found in Example 11.5.3 to conjecture an explicit 
formula for w4, w2, W3,... 


Solution Begin by iterating to find the values of the first few terms of the sequence. 


wo= ieJ isg+i 
Pig lroa l+w =1+1 =2':2=141 


W3 = 1+ wg/2 l+w, 1+1 l 
| 4=2:3=2+1 


Pu ae: 1 +w 12 
we = 1 + w62 1 +w; 1+2 
w7 = 1tw)= 1tw3=14+2 
We= 1 +wg/2 1 +w, 1+3 

Hyg S14 Hoe 1 +w, 14+3 


Wa= 1+ war 1+w, 1+2 
8=2:4=3+1 


-BGOwwwn@®Qne@e 


wis = 1+ wisy = 1+w, = 143 e 
w@ = 1+ w16/2 1 + wg 1+4 


isp Segt 
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Note that in each case when the subscript n is between two powers of 2, w, is 1 more than 
the exponent of the lower power of 2. In other words: 


If2i'<n<2'*!, thenw, =i+1. 11.5.1 
Now if Panag tl 
then [by property (11.4.2) of Example 11.4.1] 


i = log, n|. 
Substituting into statement (11.5.1) gives the conjecture that 


w, = log, n|+ 1 a 
Mathematical induction can then be used to verify the correctness of the formula found 
in Example 11.5.4. 
Verifying the Correctness of the Formula 


Use strong mathematical induction to show that if w4, w2, w3, . . . is a Sequence of numbers 
that satisfies the recurrence relation and initial condition 


w,=1 and wg = l +w for each integer k > 1, 


then w4, W2, W3, . . . satisfies the formula 


w, = llog.n|+1 for every integer n = 1. 


Solution Let wı, w2, w3, . . . be the sequence defined by specifying that w, = 1 and 
wg = 1 + wgy2j for each integer k = 2, and let the property P(n) be the equation 


w, = logzn]+ 1. <— Pn) 


We will use strong mathematical induction to prove that for every integer n = 1, P(n) is 
true. 


Show that P(1) is true: By definition of w, w2, w3, . . . , we have that w, = 1. But 
logs 1]+ 1 =0+1 = 1 also. Thus w = [log 1]+ 1 and P(/) is true. 


Show that for every integer k > 1, if P(i) is true for each integer i from 1 through k, then 
P(k +1) is also true: Let k be any integer with k = 1, and suppose that 


w; =|log,iJ)+1 foreach integer i with 1 S i S k. <— inductive hypothesis 


We must show that 
wg+1 = lloga (k+ 1)|+ 1. < P(k+1) 


Consider the two cases: k is even and k is odd. 
Case 1 (k is even): In this case, k + 1 is odd, and 


Wer = 1+ Wests) by definition of w1, W2, w3,.. - 
=1+ Wik/2| because |(k + 1)/2| = k/2 since k + 1 is odd 


=]+ ((logs(k/2)| +1) by inductive hypothesis because, since k is 
even, k = 2, and so 1 S|k/2)Sk/2 < k 


= |log(k) — log22|+2 by substituting into the identity log,(x/y) = log, x — log, y 


from Theorem 7.2.1 


= [logo(k) — 1]+2 since logs 2 = 1 
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= (Logs (k)|— 1) +2 by substituting x = log,(k) into the identity 
[x — 1] = |x]— 1 derived in exercise 15 of Section 4.6 
= [logo (k+ 1)|+ 1 by property (11.4.3) in Example 11.4.2. 


Case 2 (k is odd): In this case, it can also be shown that w; = [logs k| + 1. The analysis is 
very similar to that of case | and is left as exercise 16 at the end of the section. 
Hence regardless of whether k is even or k is odd, 

Wi+1 = llogx(k + 1)]+ 1, 


as was to be shown. [Since both the basis and the inductive steps have been demonstrated, the 
proof by strong mathematical induction is complete.] a 


The final example shows how to use the formula for w4, w2, w3, . . . to find a worst-case 
order for the algorithm. 


The Binary Search Algorithm Is Logarithmic 
By Example 11.5.5, 
w, =|log.n|+ 1 for every integer n = 1. 
Use this result to show that in the worst case, the binary search algorithm is O(log, n). 


Solution Let = stand for the words “which implies that.” Then for each integer n = 2, 


w, = [log.n|+ 1 by Example 11.5.5 
= lognsw, Slogsntl because x < |x]+ 1 and |x] < x for 
every real number x 


=> logy n=w,= logy n+ logs n since the logarithm with base 2 is increasing, 
when 2 < n, then 1 = logy 2 = log) n. 
=> logn S w, S 2logon. 


Let A = 1, B = 2, and k = 2. Then 
A logn S w, S Blog,n for every integer n = k. 
Hence by definition of O-notation, 
w, is O(log, n). 


Finally, because w,, the number of iterations of the while loop, is proportional to the num- 
ber of comparisons performed when the binary search algorithm is executed, the binary 
search algorithm is O(log; n). E 


Examples 11.5.2—11.5.6 show that in the worst case, the binary search algorithm has 
order logy n. As noted in Section 11.3, in the worst case the sequential search algorithm 
has order n. This difference in efficiency becomes increasingly more important as n gets 
larger and larger. If one loop iteration is performed each nanosecond, then performing n 
iterations for n = 100,000,000 requires 0.1 second, whereas performing log, n iterations 
requires 0.000000027 second. For n = 100,000,000,000 the times are 1.67 minutes and 
0.000000037 second, respectively. And for n = 100,000,000,000,000 the respective times 
are 27.78 hours and 0.000000047 second. 


Merge Sort 


Note that it is much easier to write a detailed algorithm for sequential search than for binary 
search. Yet binary search is much more efficient than sequential search. Such trade-offs 
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often occur in computer science. Frequently, the straightforward “obvious” solution to a 
problem is less efficient than a clever solution that is more complicated to describe. 

In the text and exercises for Section 11.3, we gave two methods for sorting, insertion sort 
and selection sort, both of which are formalizations of methods human beings often use in 
ordinary situations. Can a divide-and-conquer approach be used to find a sorting method 
more efficient than these? It turns out that the answer is an emphatic “yes.” In fact, over the 
past few decades, computer scientists have developed several divide-and-conquer sorting 
methods all of which are somewhat more complex to describe but are significantly more 
efficient than either insertion sort or selection sort. 

One of these methods, merge sort, is obtained by thinking recursively. Imagine that 
an efficient way for sorting arrays of length less than k is already known. How can such 
knowledge be used to sort an array of length k? One way is to suppose the array of length 
k is split into two roughly equal parts and each part is sorted using the known method. Is 
there an efficient way to combine the parts into a sorted array? Sure. Just “merge” them. 

Figure 11.5.4 illustrates how a merge works. Imagine that the elements of two ordered 
subarrays—2, 5, 6, 8 and 3, 6, 7, 9—are written on slips of paper (to make them easy to 
move around). Place the slips for each subarray in two columns on a tabletop, one at the 
left and one at the right. Along the bottom of the tabletop, set up eight positions into which 
the slips will be moved. Then, one-by-one, bring down the slips from the bottoms of the 
columns. At each stage compare the numbers on the slips currently at the column bottoms, 
and move the slip containing the smaller number down into the next position in the array 
as a whole. If at any stage the two numbers are equal, take, say, the slip on the left to move 
into the next position. And if one of the columns is empty at any stage, just move all the 
slips from the other column into the correct positions one-by-one in order. 


Tabletop 


NI WN |} o |} oo 
wi aD |} |} vo 


S 


2] [3] [s] [6] L6] [7] [s] [o 


00000000 


FIGURE 11.5.4 Merging Two Sorted Subarrays to Obtain a Sorted Array 


The process of merging is efficient because the total number of comparisons needed to 
merge two subarrays into an array of length k is just k — 1. You can see why by analyzing 
Figure 11.5.4. Observe that at each stage, the decision about which slip to move is made by 
comparing the numbers on the slips currently at the bottoms of the two columns—except 
when one of the columns is empty, in which case no comparisons are made at all. Thus in 
the worst case there will be one comparison for each of the k positions in the final array 
except the very last one (because when the last slip is placed into position, the other column 
is sure to be empty), or a total of k — 1 comparisons in all. 

The merge sort algorithm is recursive because its defining statements include references 
to itself. The algorithm is well defined, however, because at each stage the length of the 
array that is input to the algorithm is shorter than at the previous stage, so that, ultimately, 
the algorithm has to deal only with arrays of length 1, which are already sorted. Specifi- 
cally, merge sort works as follows. 
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Given an array of elements that can be put into order, if the array consists of a single 
element, leave it as it is. It is already sorted. Otherwise: 


1. Divide the array into two subarrays of as nearly equal length as possible. 
2. Use merge sort to sort each subarray. 


3. Merge the two subarrays together. 


Figure 11.5.5 illustrates a merge sort in a particular case. 


Initial array: 


Sorted array: 


FIGURE 11.5.5 Applying Merge Sort to the Array 5, 2, 4, 6, 1, 3, 2, 6 


As in the case of the binary search algorithm, in order to formalize merge sort 
we must decide at exactly what point to split each array. Given an array denoted 
by a[bot], a[bot+ 1], . . . , altop], let mid = |(bot + top)/2). Take the left subarray to 
be a[bot], a[bot + 1], ..., a[mid] and the right subarray to be a[mid + 1], a[mid+2],..., 
a{top]. The following is a formal version of merge sort. 


Algorithm 11.5.2 Merge Sort 


[The aim of this algorithm is to take an array of elements afr], a[r + 1],..., als] (where 
r < s) and to order it. The output array is denoted afr], a[r+1],... , als] also. It has 
the same values as the input array, but they are in ascending order. The input array is 
split into two nearly equal-length subarrays, each of which is ordered using merge sort. 
Then the two subarrays are merged together.] 


Input: r and s [positive integers with r = s], a[r], a[r + 1], . . . , als] [an array of data 
items that can be ordered] 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


11.5 APPLICATION: ANALYSIS OF ALGORITHM EFFICIENCY II 823 


Algorithm Body: 
bot := r, top := s 
while (bot < top) 

bot + to 
in| 
2 
call merge sort with input bot, mid, and 
albot|, a[bot+ 1],..., al[mid] 
call merge sort with input mid + 1, top and 
a[mid + 1], a[mid + 2], . . . , a[top] 
[After these steps are completed, both arrays a[bot], a[bot+ 1],... , a[mid] 
and a|[mid + 1], a[mid + 2], . . . , a[top] are in order.] 
merge a[bot], a[bot + 1], ..., almid] and a[mid + 1], a[mid + 2], . . . , a[top] 
[This step can be done with a call to a merge algorithm. To put the final ar- 
ray in ascending order, the merge algorithm must be written so as to take two 


arrays in ascending order and merge them into an array in ascending order.] 
end while 


Output: afr], alr + 1], . . . , a[s] [an array with the same elements as the input array 
but in ascending order] 


To derive the efficiency of merge sort, let 


m, = the maximum number of comparisons used 
when merge sort is applied to an array of length n. 


Then mı = 0 because no comparisons are used when merge sort is applied to an array of 
length 1. Also for any integer k > 1, consider an array a[bot], a[bot+ 1], . . . , a[top] of 
length k that is split into two subarrays, a[bot], a[bot+ 1], . . . , almid] and a[mid+ 1], 
almid + 2],..., a[top], where mid = |(bot + top)/2). In exercise 24 you are asked to show 
that the right subarray has length |k/2] and the left subarray has length [k/2]. From the 
previous discussion of the merge process, it is known that to merge two subarrays into an 
array of length k, at most k — 1 comparisons are needed. 
Consequently, 


the number of comparisons the number of comparisons 
when merge sort is applied | = | when merge sort is applied 
to an array of length k to an array of length |k/2| 


the number of comparisons the number of comparisons 
+ | when merge sort is applied | +| used to merge two subarrays 
to an array of length [k/2] into an array of length k. 


In other words, 


mg = My/2) + mgn + (K— 1) for every integer k > 1. 
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In exercise 25 you are asked to use this recurrence relation to show that 
7” logon =m, S 2n logon for every integer n = 1. 


It follows that merge sort is O(n log; n). 

In the text and exercises for Section 11.3, we showed that insertion sort and selection 
sort are both @(n?). How much difference can it make that merge sort is O(n log, n)? If 
n = 100,000,000 and a computer is used that performs one operation each nanosecond, the 
time needed to perform n log, n operations is about 2.7 seconds, whereas the time needed 
to perform n operations is over 115 days. 


Tractable and Intractable Problems 


At an opposite extreme from an algorithm such as binary search, which has logarithmic 
order, is an algorithm with exponential order. For example, consider an algorithm to direct 
the movement of each of the 64 disks in the Tower of Hanoi puzzle as they are transferred 
one by one from one pole to another. In Section 5.7 we showed that such a transfer requires 
2% —1 steps. If a computer took a nanosecond to calculate each transfer step, the total time 


to calculate all the steps would be 
1 1 
. = 584.5 years. 
24 


O] 


C a i 365.25 
numberof moves seconds minutes hours days 
moves per per per per per 

second minute hour day year 


Problems whose solutions can be found with algorithms whose worst-case order with 
respect to time is a polynomial are said to belong to class P. They are called polyno- 
mial-time algorithms and are said to be tractable. Problems that cannot be solved in 
polynomial time are called intractable. For certain problems, it is possible to check the 
correctness of a proposed solution with a polynomial-time algorithm, but it may not be 
possible to find a solution in polynomial time. Such problems are said to belong to class 
NP.* The biggest open question in theoretical computer science is whether every problem 
in class NP belongs to class P. This is known as the P vs. NP problem. The Clay Institute, 
in Cambridge, Massachusetts, has offered a prize of $1,000,000 to anyone who can either 
prove or disprove that P = NP. 

In recent years, computer scientists have identified a fairly large set of problems, called 
NP-complete, that all belong to class NP but are widely believed not to belong to class P. 
What is known for sure is that if any one of these problems is solvable in polynomial time, 
then so are all the others. One of the NP-complete problems, commonly known as the 
traveling salesman problem, was discussed in Section 10.1. 


A Final Remark on Algorithm Efficiency 


This section and the previous one on algorithm efficiency have offered only a partial 
view of what is involved in analyzing a computer algorithm. For one thing, it is assumed 
that searches and sorts take place in the memory of the computer. Searches and sorts on 


*Technically speaking, a problem whose solution can be verified on an ordinary computer (or determinis- 
tic sequential machine) with a polynomial-time algorithm can be solved on a nondeterministic sequential 
machine with a polynomial-time algorithm. Such problems are called NP, which stands for nondeterministic 
polynomial-time algorithm. 
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disk-based files require different algorithms, though the methods for their analysis are 
similar. For another thing, as mentioned at the beginning of Section 11.3, time efficiency is 
not the only factor that matters in the decision about which algorithm to choose. Although 
computer memory is now very inexpensive, the amount of memory space required may 
also be a factor. There are mathematical techniques to estimate space efficiency, which are 
very similar to those used to estimate time efficiency. Furthermore, as parallel processing 
of data becomes increasingly prevalent, current methods of algorithm analysis are being 
modified and extended to apply to algorithms designed for this new technology. 


TEST YOURSELF 


1. To solve a problem using a divide-and-conquer 
algorithm, you reduce it to a fixed number of 
smaller problems of the same kind, which can 
themselves be , and so forth until 


2. To search an array using the binary search algo- 
rithm in each step, you compare a middle element 
of the array to . If the middle element is less 
than ____, you , and if the middle element 
is greater than , you 


EXERCISE SET 11.5 


3. The worst-case order of the binary search algo- 
rithm is 


4. To sort an array using the merge sort algorithm, 
in each step until the last one you split the array 
into approximately two equal sections and sort 
each section using . Then you the two 
sorted sections. 


5. The worst-case order of the merge sort algorithm 
is 


1. Use the facts that log, 10 = 3.32 and that 
for each real number a, log»(10°) = a log, 10 
to find log(1,000), log>(1,000,000), and 
log>(1,000,000,000,000). 


2. Suppose an algorithm requires c|log, n] operations 
when performed with an input of size n (where c is 

a constant). 

a. By what factor will the number of operations 
increase when the input size is increased from 
m to m° (where m is a positive integer power 
of 2)? 

b. By what factor will the number of operations 
increase when the input size is increased from 
m to m” (where m is a positive integer power 
of 2)? 

c. When n increases from 128 (= 2”) to 
268,435,456 (= a) by what factor is clog, n] 
increased? 

Exercises 3 and 4 illustrate that for relatively small 
values of n, algorithms with larger orders can be more 
efficient than algorithms with smaller orders. Use a 
graphing calculator or computer to answer these 
questions. 


3. For what values of n is an algorithm that requires 
n operations more efficient than an algorithm that 
requires |50log, n] operations? 


4. For what values of n is an algorithm that requires 
\n”/10| operations more efficient than an algorithm 
that requires |n log, n] operations? 

In 5 and 6, trace the action of the binary search algorithm 
(Algorithm 11.5.1) on the variables index, bot, top, mid, 
and the given values of x for the input array a[1] = Chia, 
a[2] = Doug, a[3] = Jan, a[4] = Jim, a[5] = Jose, a[6] = 
Mary, a[7] = Rob, a[8] = Roy, a[9] = Sue, a[10] = Usha, 
where alphabetical ordering is used to compare elements 


of the array. 
5. a. x = Chia b. x = Max 
6. a. x = Amanda b. x = Roy 


7. Suppose bot and top are positive integers with 
bot = top. Consider the array 


a[bot], a[bot+ 1],..., atop]. 


a. How many elements are in this array? 
b. Show that if the number of elements in the ar- 
ray is odd, then the quantity bot + top is even. 
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c. Show that if the number of elements in the ar- 
ray is even, then the quantity bot + top is odd. 


Exercises 8-11 refer to the following algorithm segment. 
For each positive integer n, let a, be the number of itera- 
tions of the while loop. 
while (n > 0) 
n:=ndiv2 
end while 


8. Trace the action of this algorithm segment on n 
when the initial value of n is 27. 


9. Find a recurrence relation for a,. 
10. Find an explicit formula for a,,. 


11. Find an order for this algorithm segment. 
Exercises 12-15 refer to the following algorithm segment. 


For each positive integer n, let b, be the number of itera- 
tions of the while loop. 


while (n > 0) 
n:=ndiv3 


end while 


12. Trace the action of this algorithm segment on n 
when the initial value of n is 424. 


13. Find a recurrence relation for b,. 


H14. a. Use iteration to guess an explicit formula 
for b,. 
b. Prove that if k is an integer and x is a real num- 
ber with 3° = x < 3*, then [log3 x] = k. 
c. Prove that for every integer m = 1, 


log3(3m)| = [log3(3m+ 1)| = [log3(3m + 2)|. 


d. Prove the correctness of the formula you found 
in part (a). 


15. Find an order for the algorithm segment. 


16. Complete the proof of case 2 of the strong 
induction argument in Example 11.5.5. In other 
words, show that if k is an odd integer and 
w; = [logs i|+ 1 for every integer i with | S i S k, 
then wz+ı = [log.k+ 1J+ 1. 

For 17-19, modify the binary search algorithm (Algo- 

rithm 11.5.1) to take the upper of the two middle array 

elements in case the input array has even length. In other 
words, in Algorithm 11.5.1 replace 


K + L 


bot + to 
mid := oe 


with mid := | 
2 
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17. Trace the modified binary search algorithm for the 
same input as was used in Example 11.5.1. 


18. Suppose an array of length k is input to the 
while loop of the modified binary search algo- 
rithm. Show that after one iteration of the loop, 
if a[mid] # x, the input to the next iteration is an 
array of length at most |k/2). 


19. Let w, be the number of iterations of the while 
loop in a worst-case execution of the modified bi- 
nary search algorithm for an input array of length 
n. Show that w, = 1 + wyg for k = 2. 

In 20 and 21, draw a diagram like Figure 11.5.4 to show 

how to merge the given subarrays into a single array in 

ascending order. 


20. 3,5, 6, 9, 12 and 2, 4, 7, 9, 11 

21. F, K, L, R, U and C, E, L, P, W (alphabetical 
order) 

In 22 and 23, draw a diagram like Figure 11.5.5 to show 

how merge sort works for the given input arrays. 

22. R, G, B, U, C, F, H, G (alphabetical order) 

23. 5, 2, 3, 9, 7, 4, 3,2 


24. Show that given an array a[bot], a[bot+ 1],..., 
altop] of length k, if mid = |(bot + top)/2| then 
a. the subarray a[mid+ 1], a[mid+2],..., 
a{top] has length |k/2|. 
b. the subarray a[bot], a[bot + 1], ..., a[mid] has 


length [k/2]. 
H 25. The recurrence relation for mı, m2, m3,... , which 
arises in the calculation of the efficiency of merge 


sort, is 

m, = 0 

Mg = Mgt Mgt k=l. 
Show that for every integer n = 1, 


a. İn logn =m, 
b. m, S 2n log n 
26. It might seem that n — 1 multiplications are 
needed to compute x”, since 
Ke ee 

UA EA 

n— 1 multiplications 
But observe that, for instance, since 6 = 4+ 2, 

fados Re. 


Thus x° can be computed using three multiplica- 
tions: one to compute x, one to compute (°°), 
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and one to multiply (x) times x”. Similarly, since 
11=8+2+1, 


xt = 8291 = (A 


and so x'! can be computed using five multiplica- 
tions: one to compute x’, one to compute (x*)°, one 
to compute (yy, one to multiply (ESDH times 
x’, and one to multiply that product by x. 
a. Write an algorithm to take a real number x and 
a positive integer n and compute x” by 
(i) calling Algorithm 5.1.1 to find the binary 
representation of n: 


ANSWERS FOR TEST YOURSELF 


(r[k] r[k— 1] +> r[0])2, 


where each r{i] is 0 or 1; 
(ii) computing x, x”, 2, er ral by squaring, 
then squaring again, and so forth; 
(iii) computing x” using the fact that 
pom i+ + 22? + 1a! + r[0]2? 


= Te... g. a, oR 


b. Show that the number of multiplications per- 
formed by the algorithm of part (a) is less than 
or equal to 2|logz n]. 


1. reduced to the same finite number of smaller problems of 
the same kind; easily resolved problems are obtained 

2. the element you are looking for; the element you are 
looking for; apply the binary search algorithm to the lower 


half of the array; the element you are looking for; apply the 
binary search algorithm to the upper half of the array 

3. logy n, where n is the length of the array 4. merge sort; 
merge 5.nlog.n 
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REGULAR EXPRESSIONS 
AND FINITE-STATE 
AUTOMATA 


The theoretical foundations of computer science are derived from several disciplines: logic 
(the foundations of mathematics), electrical engineering (the design of switching circuits), 
brain research (models of neurons), and linguistics (the formal specification of languages). 

As discussed briefly in Sections 6.4 and 7.4, the 1930s saw the development of math- 
ematical treatments of basic questions concerning what can be proved in mathematics and 
what can be computed by means of a finite sequence of mechanized operations. Although 
the first digital computers were not built until the early 1940s, ten years earlier Alan Turing 
developed a simple abstract model of a machine, now called a Turing machine, by means of 
which he defined what it would mean for a function to be computable. 

Around the same time, somewhat similar models of computation were developed by 
the American logicians Alonzo Church, Stephen C. Kleene, and Emil Post (who was 
born in Poland but came to the United States as a child), but Church and others showed 
these all to be equivalent. As a result, Church formulated a conjecture, now known as 
the Church-Turing thesis, asserting that the Turing machine is universal in the sense 
that anything that can ever be computed on a machine can be computed with a Turing 
machine. If this thesis is correct—which is widely believed—then all computers that 
have been or will ever be constructed are theoretically equivalent in what they can do, 
although they may differ widely in speed and storage capacity. For instance, quantum 
computers may have the capability to compute certain quantities enormously faster than 
classical computers. But Church’s thesis implies that the theory of computation is likely 
to remain fundamentally the same, even though the enabling technology is subject to 
constant change. 

In the early 1940s, Warren S. McCulloch and Walter Pitts, working at the Massachusetts 
Institute of Technology (M.LT.), developed a model of how the neurons in the brain might 
work and how models of neurons could be combined to make “circuits” or “automata” ca- 
pable of more complicated computations. To a certain extent, they were influenced by the 
results of Claude Shannon, who also worked at M.LT. and had in the 1930s developed the 
foundations of a theory that implemented Boolean functions as switching circuits. In the 
1950s, Kleene analyzed the work of McCulloch and Pitts and connected it with versions of 
the machine models introduced by Turing and others. 

Another development of the 1950s was the introduction of high-level computer languag- 
es. During the same years, linguist Noam Chomsky’s attempts to understand the underly- 
ing principles by means of which human beings generate speech led him to develop a theo- 
ry of formal languages, which he defined using sets of abstract rules, called grammars, of 
varying levels of complexity. It soon became apparent that Chomsky’s theory was of great 
utility in the analysis and construction of computer languages. For computer science, the 
most useful of Chomsky’s language classifications are also the two simplest: the regular 
languages and the context-free languages. 


828 
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Note Automaton is 
pronounced aw-TAHM- 
uh-than. The plural of 
automaton is automata. 


12.1 


DPA Picture Alliance Archive/Alamy Stock Photo 


Noam Chomsky 
(born 1928) 
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Regular languages, which are defined by regular expressions, are used extensively 
for matching patterns within text (as in word processing or Internet searches) and for 
lexical analysis in computer language compilers. They are part of sophisticated text 
editors and a number of UNIX“ utilities, and they are also used in transforming XML’ 
documents. 

Using the Backus-Naur notation (introduced in Section 10.4), context-free languages 
can describe many of the more complex aspects of modern high-level computer languages, 
and they form the basis for a key part of compilers, which translate programs written in a 
high-level language into machine code suitable for execution. 

A remarkable fact is that all of the subjects referred to previously are related. Each 
context-free grammar turns out to be equivalent to a type of automaton called a pushdown 
automaton, and each regular expression turns out to be equivalent to a type of automaton 
called a finite-state automaton. In this chapter, we focus on the study of regular languages 
and finite-state automata, leaving the subject of context-free grammars and their equiva- 
lent automata to a later course in compiler construction or automata theory. 


Formal Languages and Regular Expressions 


The mind has finite means but it makes unbounded use of them and in very specific 
and organized ways. That’s the core problem of language that it became possible to 
face [by the mid-twentieth century]. —Noam Chomsky, circa 1998 


An English sentence can be regarded as a string of words, and an English word can be 
regarded as a string of letters. Not every string of letters is a legitimate word, and not every 
string of words is a grammatical sentence. We could say that a word is legitimate if it can 
be found in an unabridged English dictionary and that a sentence is grammatical if it satis- 
fies the rules in a standard English grammar book. 

Computer languages are similar to English in that certain strings of characters are le- 
gitimate words of the language and certain strings of words can be put together according 
to certain rules to form syntactically correct programs. A compiler for a computer lan- 
guage analyzes the stream of characters in a program—first to recognize individual word 
and sentence units (this part of the compiler is called a lexical scanner), then to analyze the 
syntax, or grammar, of the sentences (this part is called a syntactic analyzer), and finally to 
translate the sentences into machine code (this part is called a code generator). 

In computer science it has proved useful to look at languages from a very abstract point 
of view as strings of certain fundamental units and allow any finite set of symbols to be 
used as an alphabet. It is common to denote an alphabet by a capital Greek sigma: ÈX. (This 
just happens to be the same symbol as the one used for summation, but the two concepts 
have no other connection.) 

In Section 5.9 we used recursion to give a formal definition for strings over an alpha- 
bet, and we used structural induction to give rigorous proofs for their properties. The 
formal definition provides a solid foundation for picturing strings as juxtapositions of 
characters. 


“UNIX is an operating system that was developed in 1969 by Kenneth Thompson at Bell Laboratories. 
Originally written in assembly language, it was later rewritten in Dennis Ritchie’s C language, which was also 
developed at Bell Laboratories. 


*XML is a standard for defining markup languages used for Internet applications. 
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Note A Greek sigma is 


the traditional symbol Alphabet È: a finite set of characters 

used to denote an alpha- a o. 

bet in a formal language. String over È: (1) a finite juxtaposition of elements (called charac- 
It is smaller than the ters) of È or (2) the null string À 


Greek sigma used for a 


: Length of a string over =: the number of characters that made up the string, 
summation. 


with the null string having length 0 


Formal language over È: a set of strings over the alphabet 


Note that the empty set satisfies the criteria for being a formal language. Allowing the 
empty set to be a formal language turns out to be convenient in certain technical situations. 


Examples of Formal Languages 
Let the alphabet È = {a, b}. 


a. Define a language L, over È to be the set of all strings that begin with the character a 
and have length at most three characters. Find L4. 


b. A palindrome is a string that looks the same if the order of its characters is reversed. 
For instance, aba and baab are palindromes. Define a language L, over È to be the set 
of all palindromes obtained using the characters of 2. Write ten elements of Lp. 


Solution 
a. L, = {a, aa, ab, aaa, aab, aba, abb} 


b. L, contains the following ten strings (among infinitely many others): 


A, a, b, aa, bb, aaa, bab, abba, babaabab, abaabbbbbaaba E 


Let È be an alphabet. For each nonnegative integer n, let 


>” = the set of all strings over È that have length n, 
>* = the set of all strings over = that have length at least 1, and 
py 


= the set of all strings over È. 


University of Wisconsin Digital Collections Center 


Note that =” is essentially the Cartesian product of n copies of >. The language È* is 
called the Kleene closure of 2, in honor of Stephen C. Kleene (pronounced CLAY-nee). 


Stephen C. Kleene 


(1909-1994) >” is the set of all strings over È except for A and is called the positive closure of È. 
See The Languages >”, >", and >* 
Let È = {a, b}. 


a. Find es = > and >’, 
b. LetA = X’ US! and B = X? U °. Use words to describe A, B, and A U B. 


c. Describe a systematic way of writing the elements of =*. What change needs to be 
made to obtain the elements of >"? 
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Solution 
a. E° = {A}, E! = {a, b}, E? = {aa, ab, ba, bb}, and E? = {aaa, aab, aba, abb, baa, bab, 
bba, bbb} 


b. A is the set of all strings over È of length at most 1. 
B is the set of all strings over È of length 2 or 3. 
A U Bis the set of all strings over È of length at most 3. 


c. Elements of =* can be written systematically by writing all the strings of length 1, 
then all the strings of length 2, and so forth. 


dt: a, b, aa, ab, ba, bb, aaa, aab, aba, abb, baa, bab, bba, bbb, aaaa,.... 


Of course, the process of writing the strings in =* would continue forever, because =* 
is an infinite set. The only change that needs to be made to obtain È* is to place the 
null string at the beginning of the list. E 


Polish Notation: A Language Consisting of Postfix Expressions 


An expression such as a + b, in which a binary operator such as + sits between the two 
quantities on which it acts, is said to be written in infix notation. Alternative nota- 
tions are called prefix notation (in which the binary operator precedes the quantities on 
which it acts) and postfix notation (in which the binary operator follows the quantities 
on which it acts). In prefix notation, a + b is written + ab. In postfix notation, a + b is 
written ab +. 

Prefix and postfix notations were introduced in 1920 by the Polish mathematician Jan 
Łukasiewicz (pronounced Wu-ca-SHAY-vich). In his honor—and because some people 
have difficulty pronouncing his name—they are often referred to as Polish notation and 
reverse Polish notation, respectively. A great advantage of these notations is that they 
eliminate the need for parentheses in writing arithmetic expressions. For instance, in post- 
fix (or reverse Polish) notation, the expression 8 4 + 6/ is evaluated from left to right as 
follows: Add 8 and 4 to obtain 12, and then divide 12 by 6 to obtain 2. As another example, 
if the expression (a + b)-c in infix notation is converted to postfix notation, the result is 
ab+c-. 


a. If the expression ab-cd- + in postfix notation is converted to infix notation, what is 
the result? 


b. Let È = {4, 1, +, —}, and let L = the set of all strings over È obtained by writing 
either a 4 or a 1 first, then either a 4 or a 1, and finally either a+ or a —. List all ele- 
ments of L between braces, and evaluate the resulting expressions. 


Solution 

a. a-b+c-d 

b. L= {41+,41-,144+,14-,44+,44-,11+4+,11-—-} 
41+ =4+1=5, 41-=4-1=3, 14+=1+4=5, 
14-=1-4=—3, 44+=4+4 =8 44-=4-4=0, 
11+ =1+1=2, 11- =1-1 =0 E 


The following definition describes ways in which languages can be combined to form 
new languages. 
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Definition 


Let È be an alphabet. Given any strings x and y over È, the concatenation of x and y 
is the string obtained by writing all the characters of x followed by all the characters of 
y. For any languages L and L’ over È, three new languages can be defined as follows: 


The concatenation of L and L’, denoted LI’, is 
LL’ = {xy |x E Landy E L'}. 
The union of L and L’, denoted L U L’, is 
LUL' = ke E LE 
The Kleene closure of L, denoted L*, is 
L* = {x | x is a concatenation of any finite number of strings in L}. 


Note that À is in L” because it is regarded as a concatenation of zero strings in L. 


New Languages from Old 


Let L, be the set of all strings consisting of an even number of a’s (namely, A, aa, aaaa, 
aaaaaa,...), and let L, = {b, bb, bbb}. Find L,L5, L; U Ly, and (L; U L,)*. Note that the 
null string A is in L; because 0 is an even number. 

Solution 


L, Ly = the set of all strings that consist of an even number of a’s followed by b or by bb 
or by bbb. 


Lı U L = the set that includes the strings b, bb, bbb and any strings consisting of an even 
number of a’s. 


(Li U Ly)" = the set of all strings of a’s and b’s in which every occurrence of a is in a block 
consisting of an even number of a’s. | 


The Language Defined by a Regular Expression 


One of the most useful ways to define a language is by means of a regular expression, a 
concept first introduced by Kleene. We give a recursive definition for generating the set of 
all regular expressions over an alphabet. 


Definition 


Given an alphabet È, the following are regular expressions over >: 
I. Base: @, A, and each individual symbol in È are regular expressions over >. 


II. Recursion: If r and s are regular expressions over È, then the following are also 
regular expressions over =: 


(i) (rs) (i) (| 8) (ii) r”, 
where rs denotes the concatenation of r and s, r“ denotes the concatenation of r 


with itself any finite number (including zero) of times, and r | s denotes either one 
of the strings r or s. The regular expression r* is called the Kleene closure of r. 


II. Restriction: Nothing is a regular expression over È except for objects defined in 
(I) and (II) above. 
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As an example, one regular expression over È = {a, b, c} is 


a| | © | aby. 


If the alphabet È happens to include symbols—such as (, or |, or), or “—special provi- 
sions have to be made to avoid ambiguity. An escape character, usually a backslash, is 
added before the potentially ambiguous symbol. For instance, a left parenthesis would be 
written as \( and the backslash itself would be written as \\. 

To eliminate parentheses, an order of precedence for the operations used to define 
regular expressions has been introduced. The highest is *, concatenation is next, and | is the 
lowest. It is also customary to eliminate the outer set of parentheses in a regular expression 
when the order of precedence rules are sufficient for avoiding ambiguity. Thus 


(abo) = a(b" and (a | (bc)) =a | bc. 


Order of Precedence for the Operations in a Regular Expression 
a. Add parentheses to emphasize the order of precedence in the following expression: 
ab* | b*a. 
b. Use the rules about order of precedence to eliminate the parentheses in the following 
expression: (a | (b*)o))(a’)). 
Solution 
a. ((a(b*)) | (b*)a)) b. (| b*oa* E 


Given a finite alphabet, every regular expression over the alphabet defines a for- 
mal language. The function from regular expressions to formal languages is defined 
recursively. 


Definition 


For any finite alphabet È, the function L that associates a language to each regular 
expression over > is defined by (I)—(II]) below. For each such regular expression r, 
L(v) is called the language defined by r. 


I. Base: LØ) = Ø, L(A) = {A}, L(a) = {a} for every ain >. 


II. Recursion: If L(r) and L(r’) are the languages defined by the regular expressions 
rand r’ over >, then 


@LOT)=LOLY’) G)L¢|r7)=LO ULC’) ii) L@*) = (LOM). 


III. Restriction: The function L is completely determined by I and II above. 


Note that any finite language can be defined by a regular expression. For instance, the 
language {cat, dog, bird} is defined by the regular expression (cat | dog | bird). An important 
example is the following. 


Using Set Notation to Describe the Language Defined by a Regular Expression 


Let È = {a, b}, and consider the language defined by the regular expression (a | b)*. Use set 
notation to find this language, and describe it in words. 
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Solution The language defined by (a | b)“ is 
Lla | b)*) = (L(a| by)" 
= (L(a) U Lib)" 
= ({a} U {b})* 
={a, b}* by definition of operations on languages 
= the set of all strings of a’s and b’s 


=>* E 


In Section 5.9 we proved that concatenation of strings is associative. Taking unions of 
sets is also an associative operation. Thus for any regular expressions r, s, and t, 


L((rs)t) = L(r(st)), 
and 
Lr |S) |Ò = (Lir | s) U LÀ by definition of | 
= (L(r) U L(s)) U LY) by definition of | 
= L(r) U (L(s) U L(®)) by the associativity of union for sets 
= L(r) U (L(s|Ð) by definition of | 
= L(r|(s| 9) by definition of |. 


Because of these relationships, it is customary to drop the parentheses in “associative” 
situations and write 


rst = (rs)t = r(st) 
and 
rls|t=(|9|t=r|clod. 
As you become accustomed to working with regular expressions, you will find that you 


do not need to go through a formal derivation in order to determine the language defined 
by an expression. 


The Language Defined by a Regular Expression 
Let È = {0, 1}. Use words to describe the languages defined by the following regular ex- 
pressions over È. 
a. 0*1" | TO b. 00| D* 
Solution 


a. The strings in this language consist either of a string of 0’s followed by a string of 
Ps or of a string of 1’s followed by a string of 0’s. However, in either case the strings 
could be empty, which means that À is also in the language. 


b. The strings in this language have to start with a 0. The 0 may be followed by any 
finite number (including zero) of 0’s and 1’s in any order. Thus the language is the set 
of all strings of 0’s and 1’s that start with a 0. B 


Individual Strings in the Language Defined by a Regular Expression 


In each of (a) and (b), let È = {a, b} and consider the language L over È defined by the 
given regular expression. 


a. The regular expression is a“b(a | b)*. Write five strings that belong to L. 
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b. The regular expression is a” | (ab)*. Indicate which of the following strings belong to L: 


a b aaaa abba ababab 


Solution 
a. The strings b, ab, abbb, abaaa, and ababba are five strings from the infinitely many 
in L. 


b. The following strings are the only ones listed that belong to L: a, aaaa, and ababab. 
The string b does not belong to L because it is neither a string of a’s nor a string of 
possibly repeated ab’s. The string abba does not belong to L because any two b’s that 
might occur in a string of L are separated by an a. E 


A Regular Expression That Defines a Language 


Let È = {0, 1}. Find regular expressions over > that define the following languages: 


a. The language consisting of all strings of 0’s and 1’s that have even length and in which 
the 0’s and I’s alternate. 


b. The language consisting of all strings of 0’s and 1’s with an even number of 1’s. Such 
strings are said to have even parity. 

c. The language consisting of all strings of 0’s and ?’s that do not contain two consecu- 
tive L’s. 

Solution 


a. Ifa string in the language starts with a 1, the pattern 10 must continue for the length 
of the string. If it starts with 0, the pattern 01 must continue for the length of the 
string. Also, the null string satisfies the condition by default. Thus an answer is 


(10)" | O1*. 


b. Some basic strings with even parity are A, 0, and 10°1. Concatenation of strings with 
even parity also have even parity. Because such a string may start or end with a string 
of 0’s, one answer is 


(0 | 10*1)*. 
c. Note that a string may end in a 1, but any other 1 must be followed immediately by a 0. 


Thus, it is enough to enforce the rule that a 1 must be followed by a 0, unless the | is at 
the end of the string. A regular expression satisfying these conditions is 


(0 | 10)" | D. E 
Note that a given language may be defined by more than one regular expression. For 
example, both 
(a| b^" and (a|b* 


define the language consisting of the set of all strings of a’s and b’s. 


Deciding Whether Regular Expressions Define the Same Language 


In (a) and (b), determine whether the given regular expressions define the same language. 
If they do, describe the language. If they do not, give an example of a string that is in one 
of the languages but not the other. 


a. (a| A)" and a” b. 0* | 1* and (01)* 
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Solution 


a. Note that because the null string A has no characters, when it is concatenated with any 
other string x, the result is just x: for every string x, xà = Ax = x. Now L((a | X”) is the 
set of strings formed using a and A in any order, and so, because aA = Aa = a, this is 
the same as the set of strings consisting of zero or more a’s. Thus L((a | A)*) = La’). 


b. The two languages defined by the given regular expressions are not the same: 0101 is 
in the second language but not the first. E 


Practical Uses of Regular Expressions 


Many applications of computers involve performing operations on pieces of text. For 
instance, word and text processing programs allow us to find certain words or phrases 
in a document and possibly replace them with others. A compiler for a computer 
language analyzes an incoming stream of characters to find groupings that represent 
aspects of the computer language such as keywords, constants, identifiers, and opera- 
tors. And in bioinformatics, pattern matching and flexible searching techniques are 
used extensively to analyze the long sequences of the base pairs A, C, G, and T that 
occur in DNA. 

Through their connection with finite-state automata, which we discuss in the next sec- 
tion, regular expressions provide an extremely useful way to describe a pattern in order to 
identify a string or a collection of strings within a piece of text. Regular expressions make 
it possible to replace a long, complicated set of if-then-else statements with code that is 
easy both to produce and to understand. Because of their convenience, regular expressions 
were introduced into a number of UNIX utilities, such as grep (short for globally search 
for regular expression and print) and egrep (extended grep), in text editors, such as QED 
(short for Quick EDitor, the first text editor to use regular expressions), vi (short for visual 
interface), sed (short for stream editor and originally developed for UNIX but now used by 
many systems), and Emacs (short for Editor macros), and in the lexical scanner component 
of a compiler. The computer language Perl has a particularly powerful implementation for 
regular expressions, which has become a de facto standard. The implementations used in 
Java, Python, and .NET are similar. 

A number of shorthand notations have been developed to facilitate working with regu- 
lar expressions in text processing. When characters in an alphabet or in a part of an alpha- 
bet are understood to occur in a standard order, the notation [beginning character — ending 
character] is commonly used to represent the regular expression that consists of a single 
character in the range from the beginning to the ending character. It is called a character 
class. Thus 


[A-C] standsfor (A|B|C) 
and 
[0—9] standsfor (0|1|]2|3|4|5|6|7|8| 9). 


Character classes are also allowed to include more than one range of characters. For 
instance, 


[A—Cx-—z] standsfor (A|B|C|x|y|2 
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As an example, consider the language defined by the regular expression 


[A—Za-—z|((A—Za-—z]| [0—-9])*. 
The following are some strings in the language: 
Account Number, 223, jsmithl09, Draft2rev. 


In general, the language is the set of all strings that start with a letter sequence of digits or 
letters. This set is the same as the set of allowable number of computer languages. 
Other commonly used shorthands are 


[ABC] to stand for (A | B| C) 
and a single dot 
. to stand for an arbitrary character. 
Thus, for instance, if È = {A, B, C}, then 
A.C stands for (AAC|ABC| ACC). 


When the symbol ^is placed at the beginning of a character class, it indicates that a charac- 
ter of the same type as those in the range of the class is to occur at that point in the string, 
except for one of the specific characters indicated after the ^ sign. For instance, 


[* D—Z][0 — 9][0 — 9}° 


stands for any string starting with a letter of the alphabet different from D to Z, followed by 
any positive number of digits from 0 to 9. Examples are B3097, C0046, and so forth. If r is 
a regular expression, the notation r + denotes the concatenation of r with itself any positive 
finite number of times. In symbols, 


r+ = rr". 
For example, 
[A — Z]+ 


represents any nonempty string of capital letters. If is a regular expression, then 
r?=(ArA |r). 


That is, r? denotes either zero occurrences or exactly one occurrence of r. Finally, if m and 
n are positive integers with m = n, 


r{n} denotes the concatenation of r with itself exactly n times, 
and 
r{m,n} denotes the concatenation of r with itself anywhere from m through n times. 


Thus a check to help determine whether a given string could represent a local telephone 
number in the United States is to see whether it has the form 


[0 — 9][0 — 9][0 — 9]-[0 — 9][0 — 9][0 — 9][0 — 9], 
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or, equivalently, whether it has the form 


[0 — 9]{3}-[0 = 9]{4}. 


A Regular Expression for a Date 


People often write dates in a variety of formats. For instance, in the United States the fol- 
lowing all represent the fifth of February of 2050: 


Note In most of the rest 2/5/2050 2-5-2050 02/05/2050 02-05-2050 

of the world these expres- 

sions represent the second Write a regular expression that would help check whether a given string might be a valid 
of May of 2050: date written in one of these forms. 


Solution The language defined by the following regular expression consists of all strings 
that begin with one or two digits followed by either a hyphen or a slash, followed by either 
one or two digits, followed by either a hyphen or a slash, followed by four digits. 


[0 — 9]{1, 2}[-/][0 — 9]{1, 2}[0 — 9]{4} 


All valid dates of the given format are elements of the language defined by this expression, 
but the language also includes strings that are not valid dates. For instance, 09/54/1978 
is in the language, but it is not a valid date because September does not have 54 days, 
and 38/12/2184 is not valid because there is no 38th month. It is possible to write a more 
complicated regular expression that could be used to check all aspects of the validity of 
a date (see exercise 40 at the end of the section), but the kind of simpler expression given 
above is nonetheless useful. For instance, it provides an easy way to notify a user of an 
interactive program that a certain kind of mistake was made and that information should 
be reentered. a 


TEST YOURSELF 


Answers to Test Yourself questions are located at the end of each section. 


1. If x and y are strings, the concatenation of x and y 6. The function that associates a language to each 
is_ regular expression over an alphabet È is defined 
iian See ennes the Concannon okt recursively. The base for the definition is the state- 
and I! is ment that KØ) = ____, L(A) =______, and 

L(a) = _______ for every a in È. The recursion 
3. If Land L’ are languages, the union of L and L’ for the definition specifies that if L(r) and L(r’) are 
is_____. the languages defined by the regular expressions 
4. If Lis a language, the Kleene closure of L is a then Ler) =——Lr|r) = 
and L(r*) = ___ 


5. The set of regular expressions over an alphabet È 


is defined recursively. The base for the definition Be The toata [a CI ian example ora 


and denotes the regular expression 


is the statement that ____. The recursion for the 

definition specifies that if r and s are any regular 8. Use of a single dot in a regular expression stands 
expressions over È, then the following are also for 

regular expressions in the set: ——— — ,——— , ROPES 

e P 9. The symbol ^, placed at the beginning of a charac- 


ter class, indicates 
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10. If ris a regular expression, the notation r + 
denotes 


11. If ris a regular expression, the notation r? denotes 


EXERCISE SET 12.1" 
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12. If ris a regular expression, the notation r{n} 
denotes and the notation r{m, n} denotes 


In 1 and 2, let > = {x, y} be an alphabet. 


1. a. Let L be the language consisting of all strings 
over È that are palindromes and have length 
<= 4. List the elements of Lı between braces. 

b. Let L, be the language consisting of all strings 
over > that begin with an x and have length 
< 3. List the elements of Lp. 


2. a. Let L, be the language consisting of all strings 
over È of length < 3 in which all the x’s ap- 
pear to the left of all the y’s. List the elements 
of L; between braces. 

b. List between braces the elements of 54 the set 
of all strings of length 4 over È. 


c. Let A =>! US! and B = X? U 34. Describe 
A, B, and A U B in words. 


3. a. Ifthe expression ab + cd +- in postfix nota- 
tion is converted to infix notation, what is the 
result? 

H b. Let > = {1, 2, *, /} and let L be the set of 

all strings over È obtained by writing first a 
number (1 or 2), then a second number (1 or 2), 
which can be the same as the first one, and 
finally an operation (denoted * or /, where * in- 
dicates multiplication and / indicates division). 
Then L is a set of postfix, or reverse Polish, 
expressions. List all the elements of L between 
braces, and evaluate the resulting expressions. 

In 4-6, describe L,L,, Lı U Lz, and (Lı U L3)“ for the given 

languages L, and L3. 


4. L, is the set of all strings of a’s and b’s that start 
with an a and contain only that one a; L, is the set 
of all strings of a’s and b’s that contain an even 
number of a’s. 


5. Lis the set of all strings of a’s, b’s, and c’s that 
contain no c’s and have the same number of a’s as 
b’s; L, is the set of all strings of a’s, b’s, and c’s 
that contain no a’s or b’s. 


6. Lis the set of all strings of 0’s and 1’s that start 
with a 0; L, is the set of all strings of 0’s and 1’s 
that end with a 0. 


In 7-9, add parentheses to emphasize the order of prece- 
dence in the given expressions. 


7. (a | b*b)(a“ | ab) 
9. aly ox] oz) 


In 10-12, use the rules about order of precedence to 
eliminate the parentheses in the given regular expression. 


10. ((a(b*)) | (c(B*))) (ac) | (be)) 
Th. (1(1*)) | (0%) | (51) 
12. (xy)(((x*)y)*) | (Ox) | 90") 


In 13-15, use set notation to derive the language defined 
by the given regular expression. Assume > = {a, b, c}. 


13. A| ab 14. OIA 


8. 0°1| 0(0"L)* 


15. (a | b)c 


In 16-18, write five strings that belong to the language 
defined by the given regular expression. 


16. O10") 17. b* | b*ab* 18. x“ (yxxy | x)" 


In 19-21, use words to describe the language defined by 
the given regular expression. 


19. b‘ab*ab*a 20. 10|1)‘00 21. («| yy] y)" 


In 22-24, indicate whether the given strings belong to the 
language defined by the given regular expression. Briefly 
justify your answers. 


22. Expression: (b | A)a(a| břa(b | A), strings: aaaba, 
baabb 


23. Expression: (x*y | zy*)*, strings: zyyxz, zyyzy 


24. Expression: (01°2)*, strings: 120, 01202 


“For exercises with blue numbers or letters, solutions are given in Appendix B. The symbol H indicates that only a hint or a partial solution is 
given. The symbol * signals that an exercise is more challenging than usual. 
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In 25-27, find a regular expression that defines the given 
language. 


25. The language consisting of all strings of 0’s and 
Ps with an odd number of 1’s. (Such a string is 
said to have odd parity.) 


26. The language consisting of all strings of a’s and b’s 
in which the third character from the end is a b. 


27. The language consisting of strings of x’s and y’s in 
which the elements in every pair of x’s are sepa- 
rated by at least one y. 


Let r,s, and t be regular expressions over È = {a, b}. In 
28-30, determine whether the two regular expressions 
define the same language. If they do, describe the lan- 
guage. If they do not, give an example of a string that is in 
one of the languages but not the other. 


28. (r|s)t and rt| st 
29. (rs)* and r*s* 
30. (rs) and ((rs)*)* 


In 31-39, write a regular expression to define the given 
set of strings. Use the shorthand notations given in 
the section whenever convenient. In most cases, your 
expression will describe other strings in addition to the 
given ones, but try to make your answer fit the given 
strings as closely as possible within reasonable space 
limitations. 


31. All words that are written in lowercase letters and 
start with the letters pre but do not consist of pre all 
by itself. 


32. All words that are written in uppercase letters, and 
contain the letters BJO (as a unit) or INFO (as a unit). 


33. All words that are written in lowercase letters, end 
in ly, and contain at least five letters. 


ANSWERS FOR TEST YOURSELF 
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34. All words that are written in lowercase letters and 
contain at least one of the vowels a, e, 1, 0, or u. 


35. All words that are written in lowercase letters 
and contain exactly one of the vowels a, e, i, 0, 
or u. 


36. All words that are written in uppercase letters and 
do not start with one of the vowels A, E, I, O, or 
U but contain exactly two of these vowels next to 
each other. 


37. All United States social security numbers (which 
consist of three digits, a hyphen, two digits, 
another hyphen, and finally four more digits), 
where the final four digits start with a 3 and end 
with a 6. 


38. All telephone numbers that have three digits, then 
a hyphen, then three more digits, then a hyphen, 
and then four digits, where the first three digits 
are either 800 or 888 and the last four digits start 
and end with a 2. 


39. All signed or unsigned numbers with or without 
a decimal point. A signed number has one of the 
prefixes + or —, and an unsigned number does not 
have a prefix. Represent the decimal point as \. 
to distinguish it from the single dot symbol for an 
arbitrary character. 


H 40. Write a regular expression to perform a com- 
plete check to determine whether a given string 
represents a valid date from 1980 to 2079 written 
in one of the formats of Example 12.1.11. (Dur- 
ing this period, leap years occur every four years 
starting in 1980.) 


* 41. Write a regular expression to define the set of 
strings of 0’s and 1’s with an even number of 0’s 


and even number of 1’s. 


1. the string obtained by writing all the characters of x 
followed by all the characters of y 2. {xy |x € Land 
yELl'} 3.{s|seLorsEL} 
of any finite number of strings in L} 5. Ø, A, and each 


4. {t | t is a concatenation 


individual symbol in È are regular expressions over >; (rs); 
(| sO) 6. D; {Ak {a}; LOLE); LO) U Lr’); LO)" 
7. character class; (A | B | C) 8. an arbitrary character 
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9. a character of the same type as those in the range of the 
class is to occur at that point in the string except for one 

of the specific characters indicated after the ^ sign. 10. the 
concatenation of r with itself any positive finite number of 
11. (à |r) 12. the concatenation of r with itself 
exactly n times; the concatenation of r with itself anywhere 


times 


from m through n times 
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Finite-State Automata 


The world of the future will be an ever more demanding struggle against the 
limitations of our intelligence, not a comfortable hammock in which we can lie down 
to be waited upon by our robot slaves. —Norbert Wiener, 1964 


The kind of circuit discussed in Section 2.4 is called a combinational circuit. Such a circuit is 
characterized by the fact that its output is completely determined by its input/output table, or, in 
other words, by a Boolean function. Its output does not depend in any way on the history of pre- 
vious inputs to the circuit. For this reason, a combinational circuit is said to have no memory. 

Combinational circuits are very important in computer design, but they are not the only 
type of circuits used. Equally important are sequential circuits. For sequential circuits 
one cannot predict the output corresponding to a particular input unless one also knows 
something about the prior history of the circuit, or, more technically, unless one knows the 
state the circuit was in before receiving the input. The behavior of a sequential circuit is a 
function not only of the input to the circuit but also of the state the circuit is in when the 
input is received. A computer memory circuit is a type of sequential circuit. 

A finite-state automaton is an idealized machine that embodies the essential idea of a 
sequential circuit. Each piece of input to a finite-state automaton leads to a change in the 
state of the automaton, which in turn affects how subsequent input is processed. Imagine, 
for example, the act of dialing a telephone number. Dialing 1-800 puts the telephone circuit 
in a state of readiness to receive the final seven digits of a toll-free call, whereas dialing 
328 leads to a state of expectation for the four digits of a local call. Vending machines op- 
erate similarly. Just knowing that you put a quarter into a vending machine is not enough 
for you to be able to predict what the behavior of the machine will be. You also have to 
know the state the machine was in when the quarter was inserted. If 75¢ had already been 
deposited, you might get a beverage or some candy, but if the quarter was the first coin 
deposited, you would probably get nothing at all. 


A Simple Vending Machine 


A simple vending machine dispenses bottles of juice that cost $1 each. The machine accepts 
quarters and half-dollars only and does not give change. As soon as the amount deposited 
equals or exceeds $1 the machine releases a bottle of juice. The next coin deposited starts the 
process over again. The operation of the machine is represented by the diagram of Figure 12.2.1. 


25¢ 
deposited 


75¢ 
deposited 


half-dollar 


half-dollar 


half-dollar 


deposited 


half-dollar 


FIGURE 12.2.1 A Simple Vending Machine 
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Each circle represents a state of the machine: the state in which O¢ has been deposited, 
25¢, 50¢, 75¢, and $1 or more. The unlabeled arrow pointing to “O¢ deposited” indicates 
that this is the initial state of the machine. The double circle around “$1 or more deposited” 
indicates that a bottle of juice is released when the machine has reached this state. (It is 
called an accepting state of the machine because when the machine is in this state, it has 
accepted the input sequence of coins as payment for juice.) The arrows that link the states 
indicate what happens when a particular input is made to the machine in each of its vari- 
ous states. For instance, the arrow labeled “quarter” that goes from “O¢ deposited” to “25¢ 
deposited” indicates that when the machine is in the state “O¢ deposited” and a quarter is 
inserted, the machine goes to the state “25¢ deposited.” The arrow labeled “half-dollar” 
that goes from “75¢ deposited” to “$1 or more deposited” indicates that when the machine 
is in the state “75¢ deposited” and a half-dollar is inserted, the machine goes to the state 
“$1 or more deposited” and juice is dispensed. (In this case the purchaser would pay $1.25 
for the juice because the machine does not return change.) The arrow labeled “quarter” that 
goes from “$1 or more deposited” to “25¢ deposited” indicates that when the machine is 
in the state “$1 or more deposited” and a quarter is inserted, the machine goes back to the 
state “25¢ deposited.” (This corresponds to the fact that after the machine has dispensed a 
bottle of juice, it starts operation all over again.) 

Equivalently, the operation of the vending machine can be represented by a next-state 
table as shown in Table 12.2.1. 


TABLE 12.2.1 Next-State Table 


State 


50¢ deposited 
75¢ deposited 
© | $1 or more deposited 


75¢ deposited 
$1 or more deposited 
25¢ deposited 


Input 
Quarter Half-Dollar 
=> O¢ deposited 25¢ deposited 50¢ deposited 
25¢ deposited 50¢ deposited 75¢ deposited 


$1 or more deposited 
$1 or more deposited 
50¢ deposited 


The entries in the left-most column are all the possible states of the machine, with 
the arrow pointing to “O¢ deposited” indicating that the machine begins operation in this 
state and the double circle next to “$1 or more deposited” showing that a bottle of juice is 
released when the machine has reached this state. The inputs to the machine are shown 
in at the tops of the columns labeled “Quarter” and “Half-Dollar.” Entries in the body of 
the table show the states to which the machine goes when it has been in one state and a 
given input is applied to it. For instance, the entry in the third row of the column labeled 
“Half-Dollar” shows that when the machine is in state “50¢ deposited” and a half-dollar is 
deposited, it goes to state “$1 or more deposited.” 

Note that Table 12.2.1 conveys exactly the same information as the diagram of Fig- 
ure 12.2.1. If the diagram is given, the table can be constructed, and if the table is given, 
the diagram can be drawn. E 


Observe that the vending machine described in Example 12.2.1 can be thought of as 
having a primitive memory: It “remembers” how much money has been deposited (within 
limits) by referring to the state it is in. This capability for storing information and acting 
upon it is what gives finite-state automata their tremendous power. 

The most important finite-state automata are digital computers. Each computer consists 
of several subsystems: input devices, a processor, and output devices. A processor typically 
consists of a central processing unit and a finite number of memory locations. At any given 
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time, the state of the processor is determined by the locations and values of all the bits 
stored within its memory. A computer that has n different locations for storing a single bit 
can therefore exist in 2” different states. For a modern computer, n is many billions or even 
trillions, so the total number of states is enormous. But it is finite. Therefore, despite the 
complexity of a computer, just as for a vending machine, it is possible to predict the next 
state given knowledge of the current state and the input. Indeed, this is essentially what 
programmers try to do every time they write a program. Fortunately, modern, high-level 
computer languages provide a lot of help. 

a, aan The basic theory of automata was developed to answer very theoretical questions about 
David Hilbert the foundations of mathematics posed by the great German mathematician David Hillbert 
(1862-1943) in 1900. The ground-breaking work on automata was done in the mid-1930s by the English 
mathematician and logician Alan Turing. In the 1940s and 1950s, Turing’s work played an 
important role in the development of real-world automatic computers. 


Paul Fearn/Alamy Stock Photo 


Definition of a Finite-State Automaton 


A general finite-state automaton is completely described by giving a set of states, together 
with an indication about which is the initial state and which are the accepting states (when 
something special happens), a list of all input elements, and specification for a next-state 
function that defines which state is produced by each input in each state. This is formalized 
in the following definition: 


Science History Images/Alamy Stock Photo 


Definition 


Alan M. Turing 
(1912-1954) A finite-state automaton A consists of five objects: 


1. A finite set J, called the input alphabet, of input symbols. 

. A finite set S of states the automaton can assume. 

. A designated state sọ called the initial state. 

. A designated set of states called the set of accepting states. 

. A next-state function N: S x I — S that associates a “next-state” to each or- 
dered pair consisting of a “current state” and a “current input.” For each state s 
in S and input symbol m in J, N(s, m) is the state to which A goes if m is input to 
A when A is in state s. 


nABwWh 


The operation of a finite-state automaton is commonly described by a diagram called 
a (state-) transition diagram, similar to that of Figure 12.2.1. It is called a transition 
diagram because it shows the transitions the machine makes from one state to another in 
response to various inputs. In a transition diagram, states are represented by circles and ac- 
cepting states by double circles. There is one arrow that points to the initial state and there 
are other arrows that are labeled with input symbols and point from each state to other 
states to indicate the action of the next-state function. Specifically, an arrow from state s to 
state t labeled m means that N(s, m) = t. 

The next-state table for an automaton shows the values of the next-state function N for 
all possible states s and input symbols i. In the annotated next-state table, the initial state 
is indicated by an arrow and the accepting states are marked by double circles. 


A Finite-State Automaton Given by a Transition Diagram 


Consider the finite-state automaton A defined by the transition diagram shown in 
Figure 12.2.2. 
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a. What are the states of A? 

b. What are the input symbols of A? 
c. What is the initial state of A? 

d. What are the accepting states of A? 


e. Determine Ms, 1). 


FIGURE 12.2.2 


f. Construct the annotated next-state table for A. 
Solution 


. The states of A are so, sı, and s [since these are the labels of the circles]. 


a 
b. The input symbols of A are 0 and 1 [since these are the labels of the arrows]. 


ig 


The initial state of A is sọ [since the unlabeled arrow points to sọ]. 


Q 


. The only accepting state of A is 5 [since this is the only state marked by a double 
circle]. 


e. N(s,, 1) = s3 [since there is an arrow from Ss to s labeled 1] 


Input 


State 


A Finite-State Automaton Given by an Annotated 
Next-State Table 
Consider the finite-state automaton A defined by the following annotated next-state table: 
a. What are the states of A? 
b. What are the input symbols of A? Input 
c. What is the initial state of A? 


d. What are the accepting states of A? 
e. Find N(U, c). State 


f. Draw the transition diagram for A. © 


G 
¥ 
V 
Y 
Z 


© 
NASS 
NNWN 
NASN 


Solution 


a. The states of A are U, V, Y, and Z. 
b. The input symbols of A are a, b, and c. 


ig 


The initial state of A is U [since the arrow points to U]. 


Q 


. The accepting states of A are V and Z [since these are marked with double circles]. 


e. N(U, c) = Y [since the entry in the row labeled U and the column labeled c of the next- 
state table is Y]. 


f. The transition diagram for A is shown in Figure 12.2.3. It can be drawn more com- 
pactly by labeling arrows with multiple-input symbols where appropriate. This is 
illustrated in Figure 12.2.4. 
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a,b,c 


FIGURE 12.2.3 FIGURE 12.2.4 a 


The Language Accepted by an Automaton 


Now suppose a string of input symbols is fed into a finite-state automaton in sequence. 
At the end of the process, after each successive input symbol has changed the state of the 
automaton, the automaton ends up in a certain state, which may be either an accepting state 
or a nonaccepting state. In this way, the action of a finite-state automaton separates the set 
of all strings of input symbols into two subsets: those that send the automaton to an accept- 
ing state and those that do not. Those strings that send the automaton to an accepting state 
are said to be accepted by the automaton. 


Definition 


Let A be a finite-state automaton with set of input symbols 7. Let J" be the set of all 
strings over /, and let w be a string in J”. Then w is accepted by A if, and only if, A 
goes to an accepting state when the symbols of w are input to A in sequence from left 
to right, starting when A is in its initial state. The language accepted by A, denoted 
L(A), is the set of all strings that are accepted by A. 


Finding the Language Accepted by an Automaton 


Consider the finite-state automaton A defined in Example 12.2.2 and shown again below. 


a. To what states does A go if the symbols of the following strings are input to A in se- 
quence, starting from the initial state? 


(i) 01 Gi) 0011 Gii) 0101100 (iv) 10101 


b. Which of the strings in part (a) send A to an accepting state? 
. What is the language accepted by A? 


a 0a 


. Is there a regular expression that defines the same language? 
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Solution 


a. (1) Sy (il) Sq (ili) Sy (iV) 59 
b. The strings 01 and 10101 send A to an accepting state. 


c. Observe that if w is any string that ends in 01, then w is accepted by A. For if w is any 
string of length n = 2, then after the first n — 2 symbols of w have been input, A is 
in one of its three states: so, $1, or s2. But from any of these three states, input of the 
symbols 01 in sequence sends A first to sı and then to the accepting state s2. Hence 
any string that ends in 01 is accepted by A. 

Also note that the only strings accepted by A are those that end in O1. (That is, 
no other strings besides those ending in 01 are accepted by A.) The reason is that the 
only accepting state of A is s), and the only arrow pointing to s) comes from sı and 
is labeled 1. Thus in order for an input string w of length n to send A to an accepting 
state, the last symbol of w must be a 1 and the first n— 1 symbols of w must send A 
to state s4. Now three arrows point to s;, one from each of the three states of A, and 
all are labeled 0. Thus the last of the first n — 1 symbols of w must be 0, or, in other 
words, the next-to-the-last symbol of w must be 0. Hence the last two symbols of w 
must be 01, and thus 


L(A) = the set of all strings of 0’s and 1’s that end in 01. 


d. Yes. One regular expression that defines L(A) is (0 | 1)*01. E 


A finite-state automaton with multiple accepting states can have output devices at- 
tached to each one so that the automaton can classify input strings into a variety of differ- 
ent categories, one for each accepting state. This is how finite-state automata are used in 
the lexical scanner component of a computer compiler to group the symbols from a stream 
of input characters into identifiers, keywords, and so forth. 


The Eventual-State Function 


Now suppose a finite-state automaton is in one of its states (not necessarily the initial state) 
and a string of input symbols is fed into it in sequence. To what state will the automaton 
eventually go? The function that gives the answer to this question for every possible com- 
bination of input strings and states of the automaton is called the eventual-state function. 


Definition 


Let A be a finite-state automaton with set of input symbols J, set of states S, and 
next-state function N: S X I — S. Let I“ be the set of all strings over 7, and define 
the eventual-state function N*: S x I > S as follows: 
For any state s and for any input string w, 
the state to which A goes if the 
N*(s, w) = | symbols of w are input to A in sequence, 
starting when A is in state s 


Computing Values of the Eventual-State Function 


Consider again the finite-state automaton of Example 12.2.2 shown below for convenience. 
Find N*(s;, 10110). 
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ORO 


WW a> 9 
Solution By definition of the eventual-state function, 


the state to which A goes if the 
N*(sı, 10110) = | symbols of 10110 are input to A in 
sequence, starting when A is in state sı 


By referring to the transition diagram for A, you can see that starting from sı, when a | is 
input, A goes to sz; then when a 0 is input, A goes back to s4; after that, when a 1 is input, A 
goes to sy; from there, when a | is input, A goes to sọ; and finally, when a 0 is input, A goes 
back to sı. This sequence of state transitions can be written as follows: 


s5 — y — s — > => fji 


So 
Thus, after all the symbols of 10110 have been input in sequence, the eventual state of A 


is S1, SO 


N*(s,, 10110) = 5}. m 


The definitions of string and language accepted by an automaton can be restated sym- 
bolically using the eventual-state function. Suppose A is a finite-state automaton with set 
of input symbols / and next-state function N, and suppose that 7* is the set of all strings over 
I and that w is a string in J”. 


w is accepted by A = N“(so, w) is an accepting state of A | 


L(A) = {w E P | N“(so, w) is an accepting state of A} | 


Designing a Finite-State Automaton 


Now consider the problem of starting with a description of a language and designing an 
automaton to accept exactly that language. 


A Finite-State Automaton That Accepts the Set of Strings of 0’s and 1’s 
for Which the Number of 1’s Is Divisible by 3 


a. Design a finite-state automaton A that accepts the set of all strings of 0’s and 1’s such 
that the number of 1’s in the string is divisible by 3. 

b. Is there a regular expression that defines this set? 

Solution 


a. Let sọ be the initial state of A, sı its state after one 1 has been input, and s; its state 
after two l’s have been input. Note that sọ is the state of A after zero 1’s have been 
input, and since zero is divisible by 3 (0 = 0-3), sọ must be an accepting state. 
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The states so, sı}, and sy must be different from one another because from state so 
three l’s are needed to reach a new total divisible by 3, whereas from state sı two 
additional 1’s are necessary, and from state sj just one more 1 is required. 

Now the state of A after three 1’s have been input can also be taken to be sọ because 
after three 1’s have been input, three more are needed to reach a new total divisible by 3. 
More generally, if 3k 1’s have been input to A, where k is any nonnegative integer, then 
three more are needed for the total again to be divisible by 3 (since 3k +3 = 3(k + 1)). 
Thus the state in which 3k ls have been input, for any nonnegative integer k, can be 
taken to be the initial state so. 

By similar reasoning, the states in which (3k + 1) I’s and (3k + 2) I’s have been in- 
put, where k is a nonnegative integer, can be taken to be sı and s2, respectively. 

Now every nonnegative integer can be written in one of the three forms 3k, 3k + 1, or 
3k + 2 (see Section 4.5), so the three states sọ, s1, and s are all that is needed to create 
A. Thus the states of A can be drawn and labeled as shown below. 


© © 
© 


Next consider the possible inputs to A in each of its states. No matter what state A 
is in, if a O is input the total number of 1’s in the input string remains unchanged. Thus 
there is a loop at each state labeled 0. 

Now suppose a | is input to A when it is in state sọ. Then A goes to state s4 (since the 
total number of 1’s in the input string has changed from 3k to 3k + 1). Similarly, if a 1 
is input to A when it is in state s,, then A goes to state s, (since the total number of 1’s 
in the input string has changed from 3k + 1 to 3k + 2). Finally, if a 1 is input to A when 
it is in state sy, then it goes to state sọ (since the total number of 1’s in the input string 
becomes (3k + 2) + 1 = 3k +3 = 3(k + 1), which is a multiple of 3). 

It follows that the transition diagram for A has the appearance shown below. 


This automaton accepts the set 
of strings of 0’s and 1’s for which 
the number of 1’s is divisible by 3. 


b. A regular expression that defines the given set is 0"|(0"10" 10°10°)”. 


A Finite-State Automaton That Accepts the Set of All Strings of 0’s and 1’s 
Containing Exactly One 1 


a. Design a finite-state automaton A to accept the set of all strings of 0’s and I’s that 
contain exactly one 1. 


b. Is there a regular expression that defines this set? 
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Solution 


a. The automaton A must have at least two distinct states: 
So: initial state; 
51: state to which A goes when the input string contains exactly one 1. 
If A is in state sọ and a 0 is input, A may as well stay in state sọ (since it still needs to 


wait for a 1 to move to state s,), but as soon as a | is input, A moves to state sı. Thus a 
partial drawing of the transition diagram is as shown below. 


= 
Now consider what happens when A is in state s. If a 0 is input, the input string still 
has a single 1, so A stays in state sı. But if a 1 is input, then the input string contains 
more than one 1, so A must leave sı (Since no string with more than one 1 is to be ac- 
cepted by A). It cannot go back to state sọ because there is a way to get from sọ to s4, and 
after input of the second 1, A can never return to state s;. Hence A must go to a third 
state, s2, from which there is no return to sı. Thus from s, every input may as well leave 


A in state s2. It follows that the completed transition diagram for A has the appearance 
shown below. 


This automaton accepts the set of 
strings 0’s and 1’s, with exactly one 1. 


b. A regular expression that defines the given set is 0°10". a 


Simulating a Finite-State Automaton Using Software 


Suppose items have been coded with strings of 0’s and 1’s. A program is to be written to 
govern the processing of items coded with strings that end in 011; items coded any other 
way are to be ignored. This situation can be modeled by the finite-state automaton shown 
in Figure 12.2.5. 


This automaton recognizes 
strings that end in 011. 


FIGURE 12.2.5 


The symbols of the code for the item are fed into this automaton in sequence, and every 
string of symbols in a given code sends the automaton to one of the four states so, 5), 52, Or 
53. If state s3 is reached, the item is processed; if not, the item is ignored. 

The action of this finite-state automaton can be simulated by a computer algorithm as 
given in Algorithm 12.2.1. 
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Algorithm 12.2.1 A Finite-State Automaton 


[This algorithm simulates the action of the finite-state automaton of Figure 12.2.5 by mimick- 
ing the functioning of the transition diagram. The states are denoted 0, 1, 2, and 3.] 


Input: string /a string of 0’s and l’s plus an end marker e] 


Algorithm Body: 
state := 0 
symbol := first symbol in the input string 
while (symbol # e) 
if state=0 then if symbol = 0 
then state := 1 
else state = 0 
else if state = 1 then if symbol = 0 
then state := 1 
else state := 2 
else if state = 2 then if symbol = 0 
then state := 1 
else state := 3 
else if state = 3 then if symbol = 0 
then state := 1 
else state := 0 
symbol := next symbol in the input string 
end while 
[After execution of the while loop, the value of state is 3 if, and only if, the input 
string ends in Olle.] 


Output: state 


Note how use of the finite-state automaton allows the creator of the algorithm to focus 
on each step of the analysis of the input string independently of the other steps. 

An alternative way to program this automaton is to enter the values of the next-state 
function directly as a two-dimensional array. This is done in Algorithm 12.2.2. 


Algorithm 12.2.2 A Finite-State Automaton 


[This algorithm simulates the action of the finite-state automaton of Figure 12.2.5 by 
repeated application of the next-state function. The states are denoted 0, 1, 2, and 3.] 


Input: string [a string of 0’s and \’s plus an end marker e] 


Algorithm Body: 
N(O, 0) := 1, N(O, 1) := 0, NC, 0) := 1, NC, 1) := 2, 
N(2, 0) := 1, N(2, 1) := 3, NG, 0) := 1, NG, 1) := 0 
state := 0 


symbol := first symbol in the input string 
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while (symbol # e) 
state := N (state, symbol) 
symbol := next symbol in the input string 
end while 
[After execution of the while loop, the value of state is 3 if, and only if, the input 
string ends in Ol le.] 


Output: state 


Finite-State Automata and Regular Expressions 


In the previous sections, each time we considered a language accepted by a finite-state 
automaton, we found a regular expression that defined the same language. Stephen 
Kleene showed that our ability to do this is not sheer coincidence. He proved that any 
language accepted by a finite-state automaton can be defined by a regular expression 
and that, conversely, any language defined by a regular expression is accepted by a 
finite-state automaton. Thus for the many applications of regular expressions discussed 
in Section 12.1, it is theoretically possible to find a corresponding finite-state automaton, 
which can then be simulated using the kinds of computer algorithms described in the 
previous subsection. 

In practice, it is often of interest to retain only pieces of the patterns sought. For instance, 
to obtain a reference in an HTML document, one would specify a regular expression de- 
fining the full HTML tag, <ahref= “the desired URL’>, but one would be interested in 
retrieving only the string between the quotation marks. Because of these kinds of consid- 
erations, actual implementations of finite-state automata include additional features.” 

We break the statement of Kleene’s theorem into two parts. 


Kleene’s Theorem, Part 1 


Given any language that is accepted by a finite-state automaton, there is a regular 
expression that defines the same language. 


Proof: Suppose A is a finite-state automaton with a set J of input symbols, a set S of 
n states, and a next-state function N: S X I — S. Let I* denote the set of all strings 
over I. Number the states s1, 59, 53,..., Sn, using sı to denote the initial state, and for 
each integer k = 1, 2,3,..., 2, let 


when the symbols of x are input to A in sequence, 


i j¿=4x ET | A goes from state s; to state s; without traveling 


through an intermediate state s, for which h > k 


Note that either index i or index j in L j could be greater than k; the only restric- 
tion is that the symbols of a string in is j cannot make A both enter and exit an inter- 
mediate state with index greater than k. 


(continued on page 852) 


“For more information, see Mastering Regular Expressions, 3rd ed., by Jeffrey E. F. Friedl (Sebastopol, CA: 
O’Reilly & Associates, 2006). Tutorials are also available on the Internet. 
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If s; is an accepting state and if k = n and i = 1, then L}; is the set of all strings 
that send A to s; when the symbols of the string are input to A in sequence starting 
from sı. Thus 


Li C L(A). 


Moreover, because the sequence of symbols in every string in L(A) sends A to some 
accepting state s;, 


L(A)is the union of all the sets Lī, ;, where s; is an accepting state. 


We use a version of mathematical induction to build up a set of regular expres- 
sions over 7. Let the property P(m) be the sentence 


For any pair of integers i andj with 1 =i, j S n, 
there is a regular expression 77’; that defines L;”,. <— P(m) 


Show that P(0) is true: For each pair of integers i and j with 1 S i, j =n, L? jis the 
set of all strings that send A from s; to s; without sending it through any intermedi- 
ate state s, for which h > 0. Because the subscript of every state in A is greater than 
zero, the strings in 1 j do not send A through any intermediate states at all, and so 
each is a single input symbol from Z. In other words, for all integers i and j with 
Tsi] =m 


L;i = {a E I | Ms; a) = s}. 


Hence L; jis a subset of Z, and so (because 7 is finite), there is an integer M so that we 
may denote the elements of Ib j as follows: 


Li; = {dp dy, a3,..., ay} CL. 


Now, by definition of regular expression, each single input symbol of 7 is a regular ex- 
pression over J; thus every element of L? jis a regular expression over /. The result is that 
for all integers i and j with | = i, j S n, the following regular expression defines Ee F 


a, |a | a| -++ | am 


Show that for every integer k with 0 Sk Sn, if P(k) is true then P(k + 1) is true: 
Let k be any integer with 1 = k = n, and suppose that 


For each pair of integers p and q with 1 S p,q Sn, <P% 
there is a regular expression T, q that defines D q- inductive hypothesis 


We will show that 


For each pair of integers i andj with 1 S i, j S n, 


there is a regular expression aa that defines nea 


<—P(k+1) 

So suppose that į and j are any pair of integers with | < i, j S n, and observe that 
any string in Ge sends A from s; to s;, either by a route that makes A pass through 
5,41 Or by a route that does not make A pass through s;,,. Now each string that 
sends A from s; to s; and makes A pass through s+; one or more times can be bro- 
ken into segments. The symbols in the first segment send A from s; to 5,4, without 
making A pass through s;,,,; those in each of the intermediate segments send 5,4; 
to itself without making A pass through s;+,; and those in the final segment send 
A from sg+ı to s; without making A pass through s;,,,. (The intermediate segment 
could be the null string.) A typical path showing two intermediate segments is il- 
lustrated on the next page. 
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Note that each intermediate segment of the string is in LEi, kķ+1; and by assumption 
the regular expression i «+1 defines this set. By the =e reasoning, r; ,+, defines 
the set of all possible first segments of the string, and Éa „j defines the set of all pos- 
sible final segments of the string. In addition, r% j defines the set of all strings that 
send A from s; to s; without making it pass chrouch a state s,, with m > k. Thus we 


may define the regular Den i a j as follows: 
RL Ele 
fij T Till, eee, iti) rij 


Then a ' defines the set of all strings that sonda from s; to 3 oe making it pass 
inoue any states s,, with m > k + 1, and so rj. a defines i if ! [as was to be shown]. 

To complete the proof, let s}, 5;,,..., 5;, be the accepting state of A. Because L(A) 
is the union of all the Lj, ; where s; is an accepting state, we have 


L(A) = L(ri 1 a) U L(ri, a) Ur U L(ri 3) by the recursive definition 
A 1 for the language defined 
= AIAG, |) 


by a regular expression 
Thus if we let r = ri j| ri pl tt [7i j, we have that L(A) = L(r). In other words, we 
have constructed a regular expression r that defines the language accepted by A. 


Kleene’s Theorem, Part 2 


Given any language defined by a regular expression, there is a finite-state automaton 
that accepts the same language. 


The most common way to prove part 2 of Kleene’s theorem is to introduce a new cat- 
egory of automata called nondeterministic finite-state automata. These are similar to the 
(deterministic) finite-state automata we have been discussing, except that for any given 
state and input symbol, the next state is a subset of the set of states of the automaton, pos- 
sibly even the empty set. Thus the next state of the automaton is not uniquely determined 
by the combination of a current state and an input symbol. A string is accepted by a nonde- 
terministic finite-state automaton if, and only if, when the symbols in the string are input 
to the automaton in sequence, starting from an initial state, there is some sequence of next 
states through which the automaton could travel that would send it to an accepting state. 
For instance, the transition diagram at the left is an example of a very simple nondetermin- 
istic finite-state automaton that accepts the set of all strings beginning with a 1. Observe 
that N(so, 1) = {5 1, 55} and N(so, 0) = Ø. 

Given a language defined by any regular expression, there is a straightforward re- 
cursive algorithm for finding a nondeterministic finite-state automaton that defines 
the same language. The proof of Kleene’s theorem is completed by showing that for 
any such non-deterministic finite-state automaton, there is a (deterministic) finite-state 
automaton that defines the same language. We leave the details of the proof to a course 
in automata theory. 
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Regular Languages 


According to Kleene’s theorem, the set of languages defined by regular expressions is 
identical to the set of languages accepted by finite-state automata. Any such language is 
called a regular language. The brief allusions we made earlier to context-free languages 
and Chomsky’s classification of languages suggest that not every language is regular. We 
will prove this by giving an example of a nonregular language. 

To construct the example, note that because a finite-state automaton can assume only 
a finite number of states and because there are infinitely many input sequences, by the 
pigeonhole principle there must be at least one state to which the automaton returns over 
and over again. This is the essential feature of an automaton that makes it possible to find 
a nonregular language. 


Showing That a Language Is Not Regular 


Let the language L consist of all strings of the form atb*, where k is a positive integer. 
Symbolically, L is the language over the alphabet 2 = {a, b} defined by 


L= {wE ¥®*|w= ab“, where k is a positive integer}. 


Use the pigeonhole principle to show that L is not regular. In other words, show that there 
is no finite-state automaton that accepts L. 


Solution [Use a proof by contradiction.] Suppose not. That is, suppose there is a finite- 
state automaton A that accepts L. [A contradiction will be derived.] Since A has only a finite 
number of states, these states can be denoted 5), 59, 53,...,5,, where n is a positive integer. 
Consider all input strings that consist entirely of a’s: a, a, a - a‘, .... Now there are infi- 
nitely many such strings and only finitely many states. Thus, by the pigeonhole principle, 
there must be a state s„ and two input strings a” and a’ with p # q such that when either a” 
or a‘ is input to A, A goes to state s„. (See Figure 12.2.6.) [The pigeons are the strings of a’s, 
the pigeonholes are the states, and the correspondence associates each string with the state to 
which A goes when the string is input.] 


Strings of a’s States of A 


F 
— 


F (at ) = the state to which A 
goes when a’ is input 
= N*(sy, a’) 


Since F is not one-to-one, J strings 
a? and af with p # q such that both 
a? and af send A to the same state s,- 


There are an There are only 
infinite number n States. 
of these strings. 


FIGURE 12.2.6 
Now, by supposition, A accepts L. Hence A accepts the string 


ab’. 
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This means that after p a’s have been input, at which point A is in state s,,, inputting p ad- 
ditional b’s sends A into an accepting state, say s,. But that implies that 


afb” 
also sends A to the accepting state s4, and so afb” is accepted by A. The reason is that after 


q a’s have been input, A is also in state s,,, and from that point, inputting p additional b’s 
sends A to state są, which is an accepting state. Pictorially, if p < q, then 


p a's are input 


p b’s are input 


q- p additional 
a’s are input 


Now, by supposition, L is the language accepted by A. Thus since afb” is accepted by 
A, ab? E L. But by definition of L, L consists only of strings with equal numbers of a’s 
and b’s. So since p + q, afb” € L. Hence afb” E L and afb” € L, which is a contradiction. 
It follows that the supposition is false, and so there is no finite-state automaton that 


accepts L. 


The five objects that make up a finite-state au- 
tomaton are ; n > > 
and 


. The next-state table for an automaton shows the 


values of 


In the annotated next-state table, the initial state is 
indicated with an and the accepting states 
are marked by 


A string w consisting of input symbols is ac- 
cepted by a finite-state automaton A if, and only 
if, ___. 


The language accepted by a finite-state automaton 
AIS— = — 


If N is the next-state function for a finite-state 
automaton A, the eventual-state function N* is 


10. 


defined as follows: For each state s of A and for 
each string w that consists of input symbols of A, 


N*(s, w) = ____. 


One part of Kleene’s theorem says that given any 
language that is accepted by a finite-state automa- 
ton, there is 


. The second part of Kleene’s theorem says that 


given any language defined by a regular expres- 
sion, there is 


A regular language is 


Given the language consisting of all strings of 
the form afb“, where k is a positive integer, the 
pigeonhole principle can be used to show that the 
language is 


Find the state of the vending machine in Example 
12.2.1 after each of the following sequences of 
coins have been input. 

a. Quarter, half-dollar, quarter 

b. Quarter, half-dollar, half-dollar 


c. Half-dollar, quarter, quarter, quarter, half- 
dollar 


In 2-7, a finite-state automaton is given by a transition 
diagram. For each automaton: 
a. Find its states. 
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b. Find its input symbols. In 8 and 9, a finite-state automaton is given by an anno- 
c. Find its initial state. tated next-state table. For each automaton: 
d. Find its accepting states. a. Find its states. 


e. Write its annotated next-state table. b. Find its input symbols. 
c. Find its initial state. 
a d. Find its accepting states. 
e. Draw its transition diagram. 
8. Next-State Table 
3. 
25 
State 
© 
9. Next-State Table 
4. 
2g 
© 
5. State 
10. A finite-state automaton A, given by the transi- 
tion diagram below, has next-state function N and 
eventual-state function N*. 
6. 
a. Find Ms, 1) and M50, 1). 
b. Find M(s>, 0) and Ns), 0). 
c. Find N*(so, 10011) and N*(s,, 01001). 
7 d. Find N*(sz, 11010) and N“(so, 01000). 


11. A finite-state automaton A, given by the transition 
diagram on the next page, has next-state function 
N and eventual-state function N“. 
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Find N(s3, 0) and M(s>, 1). 

Find N(so, 0) and N(s4, 1). 

Find N*(sọ, 010011) and N*(s3, 01101). 
Find N*(sọ, 1111) and N*(s5, 00111). 


ange 


12. Consider again the finite-state automaton of 

exercise 2. 

a. To what state does the automaton go when the 
symbols of the following strings are input to it 
in sequence, starting from the initial state? 

(i) 1110001 (ii) 0001000 (iii) 11110000 

b. Which of the strings in part (a) send the autom- 
aton to an accepting state? 

c. What is the language accepted by the 
automaton? 

d. Find a regular expression that defines the 
language. 


13. Consider again the finite-state automaton of 

exercise 3. 

a. To what state does the automaton go when the 
symbols of the following strings are input to it 
in sequence, starting from the initial state? 

(i) bb (ii) aabbbaba_ (iii) babbbbbabaa 
(iv) bbaaaabaa 

b. Which of the strings in part (a) send the autom- 
aton to an accepting state? 

c. What is the language accepted by the 
automaton? 

d. Find a regular expression that defines the 
language. 

In each of 14-19, (a) find the language accepted by the au- 
tomaton in the referenced exercise, and (b) find a regular 
expression that defines the same language. 


14. Exercise 4 15. Exercise 5 16. Exercise 6 


18. Exercise 8 19. Exercise 9 


In each of 20-28, (a) design an automaton with the given 
input alphabet that accepts the given set of strings, and 


17. Exercise 7 
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(b) find a regular expression that defines the language 
accepted by the automaton. 


20. Input alphabet = {0, 1}; Accepts the set of all 
strings for which the final three input symbols are 1. 


21. Input alphabet = {0, 1}; Accepts the set of all 
strings that start with 01. 


H 22. Input alphabet = {a, b}; Accepts the set of all 


strings of length at least 2 for which the final two 
input symbols are the same. 


23. Input alphabet = {0, 1}; Accepts the set of all 
strings that start with O1 or 10. 


24. Input alphabet = {0, 1}; Accepts the set of all 
strings that start with 101. 


25. Input alphabet = {0, 1}; Accepts the set of all 
strings that end in 10. 


26. Input alphabet = {a, b}; Accepts the set of all 
strings that contain exactly two b’s. 


27. Input alphabet = {0, 1}; Accepts the set of all 
strings that start with 0 and contain exactly one 1. 


28. Input alphabet = {0, 1}; Accepts the set of all 
strings that contain the pattern 010. 

In 29-47, design a finite-state automaton to accept the 

language defined by the regular expression in the refer- 

enced exercise from Section 12.1. 


29. Exercise 16 30. Exercise 17 31. Exercise 18 


32. Exercise 19 33. Exercise 20 34. Exercise 21 


35. Exercise 24 36. Exercise25 37. Exercise 26 


38. Exercise 27 39. Exercise 31 40. Exercise 32 


41. Exercise 33 42. Exercise 34 43. Exercise 35 


44. Exercise 36 45. Exercise37 46. Exercise 38 


47. Exercise 39 


48. A simplified telephone switching system allows 

the following strings as legal telephone numbers: 

a. A string of seven digits in which neither of the 
first two digits is a 0 or 1 (a local call string). 

b. A 1 followed by a three-digit area code string 
(any digit except 0 or 1 followed by a 0 or 1 
followed by any digit) followed by a seven- 
digit local call string. 

c. A 0 alone or followed by a three-digit area 
code string plus a seven-digit local call string. 
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Design a finite-state automaton to recognize H 53. Let L be the language consisting of all strings of 
all the legal telephone numbers in (a), (b), and the form 
(c). Include an “error state” for invalid tele- 7 2 eee 
a”, where n = m‘, for some positive integer m. 
phone numbers. 

Show that there is no finite-state automaton that 


49. Write a computer algorithm that simulates the ac- 
accepts L. 


tion of the finite-state automaton of exercise 2 by 
mimicking the action of the transition diagram. 54. a. Let A be a finite-state automaton with input al- 
phabet £, and suppose L(A) is the language ac- 
cepted by A. The complement of L(A) is the set 
of all strings over È that are not in L(A). Show 
that the complement of a regular language is 
H 51. Let L be the language consisting of all strings of regular by proving the following: If L(A) is the 
the form language accepted by a finite-state automaton 
A, then there is a finite-state automaton A’ that 
accepts the complement of L(A). 
Show that there is no finite-state automaton that b. Show that the intersection of any two regular 
accepts L. languages is regular as follows: First prove that 
if L(A,) and L(A>) are languages accepted by au- 
tomata A, and A, respectively, then there is an 
automaton A that accepts (L(A))° U (L(A)))°. 
ab", where m and n are positive integers and m S n. Then use one of De Morgan’s laws for sets, the 
double complement law for sets, and the result 
of part (a) to prove that there is an automaton 
that accepts L(A,) L(A). 


50. Write a computer algorithm that simulates the ac- 
tion of the finite-state automaton of exercise 8 by 
repeated application of the next-state function. 


ab", where m and n are positive integers and m = n. 


52. Let L be the language consisting of all strings of 
the form 


Show that there is no finite-state automaton that 
accepts L. 


ANSWERS FOR TEST YOURSELF 


1. a finite set of input symbols; a finite set of states; a in an accepting state 5. the set of strings that are accepted 
designated initial state; a designated set of accepting by A_ 6. the state to which A goes if it is in state s and the 
states; a next-state function that associates a “next-state” characters of w are input to it in sequence 7. a regular 
with each state and input symbol of the automaton 2. the expression that defines the same language 8. a finite-state 
next-state function for each state and input symbol of the automaton that accepts the same language 9. a language 
automaton 3. arrow; double circles 4. when the symbols defined by a regular expression (Or: a language accepted by 
in the string are input to the automaton in sequence from left a finite-state automaton) 10. not regular 


to right, starting from the initial state, the automaton ends up 


ÆJ Simplifying Finite-State Automata 


Our life is frittered away by detail. . . . Simplify, simplify. 
—Henry David Thoreau, Walden, 1854 


Any string input to a finite-state automaton either sends the automaton to an accepting 
state or not, and the set of all strings accepted by an automaton is the language accepted 
by the automaton. It often happens that when an automaton is created to do a certain job 
(as in compiler construction, for example), the automaton that emerges “naturally” from 
the development process is unnecessarily complicated; that is, there may be an automa- 
ton with fewer states that accepts exactly the same language. It is desirable to find such 
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an automaton because the memory space required to store an automaton with n states is 
approximately proportional to n’. Thus approximately 10,000 memory locations are re- 
quired to store an automaton with 100 states, whereas only about 100 memory locations 
are needed to store an automaton with 10 states. In addition, the fewer states an automaton 
has, the easier it is to write a computer algorithm based on it; and to see that two automata 
both accept the same language, it is easiest to simplify each to a minimal number of states 
and compare the simplified automata. In this section we show how to take a given automa- 
ton and simplify it in the sense of finding an automaton with fewer states that accepts the 
same language. 


Seu avn An Overview 


Consider the finite-state automata A and A’ in Figure 12.3.1. A moment’s thought should 
convince you that A’ accepts all those strings, and only those strings, that contain an even 
number of 1’s. But A, although it appears more complicated, accepts exactly those strings 
also. Thus the two automata are “equivalent” in the sense that they accept the same lan- 
guage, even though A’ has fewer states than A. 


0 


FIGURE 12.3.1 Two Equivalent Automata 


Roughly speaking, the reason for the equivalence of these automata is that some of the 
states of A can be combined without affecting the acceptance or nonacceptance of any 
input string. It turns out that s, can be combined with state sg and that s; can be combined 
with state sı}. (How to figure out which states can be combined is explained later in this 
section.) The automaton with the two combined states {s9, s2} and {s), 53} is called the quo- 
tient automaton of A and is denoted A. Its transition diagram is obtained by combining the 
circles for sọ and s, and for sı and s3 and by replacing any arrow from a state s to a state t by 
an arrow from the combined state containing s to the combined state containing ¢. For in- 
stance, since there is an arrow labeled 1 from s, to s, in A, there is an arrow labeled 1 from 
{5,53} to {so, $2} in A. The complete transition diagram for A is shown in Figure 12.3.2. As 
you can see, except for labeling the names of the states, it is identical to the diagram for A’. 


FIGURE 12.3.2 Transition Diagram for A | 


In general, simplification of a finite-state automaton involves identifying “equivalent 
states” that can be combined without affecting the action of the automaton on input strings. 
Mathematically speaking, this means defining an equivalence relation on the set of states 
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of the automaton and forming a new automaton whose states are the equivalence classes of 
the relation. The rest of this section is devoted to developing an algorithm to carry out this 
process in a practical way. 


*-Equivalence of States 


Two states of a finite-state automaton are said to be *-equivalent (this is read “star equiv- 
alent”) if, and only if, any string accepted by the automaton when it starts from one of 
the states is accepted by the automaton when it starts from the other state. Recall that 
the value of the eventual-state function, N*, for a state s and input string w is the state to 
which the automaton goes if the characters of w are input in sequence when the automa- 
ton is in state s. 


Definition 


Let A be a finite-state automaton with next-state function N and eventual-state func- 
tion N”. Define a binary relation on the set of states of A as follows: Given any states 
s and t of A, we say that s and ¢ are *-equivalent and write s R+ t if, and only if, for 
each input string w, 


either both N"(s, w) and N*(t, w) are accepting states or both 
are nonaccepting states. 


In other words, states s and f are *-equivalent if, and only if, for each input string w, 
N*(s, w) is an accepting state <> N’(t, w) is an accepting state. 
Or, more simply, for each input string w, 


f goes to an accepting state i L f goes to an accepting state "| 


w is input when A is in state s w is input when A is in state t 


It follows immediately, by substitution into the definition, that 


R» is an equivalence relation on S, the set of states of A. 12.3.1 | 


You are asked to prove this formally in the exercises at the end of this section. 


k-Equivalence of States 


From a procedural point of view, it is difficult to determine the -equivalence of two states 
using the definition directly. According to the definition, you must know the action of the 
automaton starting in states s and f on all input strings in order to tell whether s and t are 
equivalent. But since most languages have infinitely many input strings, you cannot check 
individually the effect of every string that is input to an automaton. As a practical matter, 
you can tell whether or not two states s and ¢ are *-equivalent by using an iterative proce- 
dure based on a simpler kind of equivalence of states called k-equivalence. Two states are 
k-equivalent if, and only if, any string of length less than or equal to k that is accepted by 
the automaton when it starts from one of the states is accepted by the automaton when it 
starts from the other state. 
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Definition 


Let A be a finite-state automaton with next-state function N and eventual-state func- 
tion N*. Define a relation on the set of states of A as follows: Given any states s and 
t of A and an integer k = 0, we say that s is k-equivalent to ¢ and write s Ryt if, and 
only if, for every input string w of length less than or equal to k, either N”(s, w) and 
N”(t, w) are both accepting states or they are both nonaccepting states. 


Certain useful facts follow quickly from the definition of k-equivalence: 


For each integer k = 0, k-equivalence is an equivalence relation. 123.2 
For each integer k = 0, the k-equivalence classes partition the set of all 

states of the automaton into a union of mutually disjoint subsets. 123.3 
For each integer k = 1, if two states are k-equivalent, then they are 

also (k — 1) equivalent. 12.3.4 
For each integer k = 1, each k-equivalence class is a subset of a 

(k — 1)-equivalence class. 235 


Any two states that are k-equivalent for every integer k = 0 are *-equivalent. 12.3.6 


Proofs of these facts are left for the exercises. 

The following theorem gives a recursive description of k-equivalence of states. It says, 
first, that any two states are 0-equivalent if, and only if, either both are accepting states or 
both are nonaccepting states and, second, that any two states are k-equivalent (for k = 1) 
if, and only if, (1) they are (k — 1)-equivalent and (2) for any input symbols their next-states 
are also (k — 1)-equivalent. 


Theorem 12.3.1 


Let A be a finite-state automaton with next-state function N. Given any states s and 
tin A, 


o either s and ż are both accepting states 
1. s is O-equivalent tot <> | pang | 


or they are both nonaccepting states 


s and t are (k — 1)-equivalent, and 
2. for every integer k= 1, < for any input symbol m, N(s, m) and 
s is k-equivalent to t N(t, m) are also (k — 1)-equivalent 


The truth of Theorem 12.3.1 follows from the fact that inputting a string w of length k 
has the same effect as inputting the first symbol of w and then the remaining k — | symbols 
of w. A detailed proof is somewhat technical. 

Theorem 12.3.1 implies that if you know which states are (k — 1)-equivalent (where k is 
a positive integer) and if you know the action of the next-state function, then you can figure 
out which states are k-equivalent. Specifically, if s and t are (k — 1)-equivalent states whose 
next-states are (k — 1)-equivalent for every input symbol m, then s and f are k-equivalent. 
Thus the k-equivalence classes are obtained by subdividing the (k — 1)-equivalence classes 
according to the action of the next-state function on the members of the classes. An ex- 
ample should make this procedure clear. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


862 CHAPTER12 REGULAR EXPRESSIONS AND FINITE-STATE AUTOMATA 


Cuvee Finding k-Equivalence Classes 


Find the 0-equivalence classes, the 1-equivalence classes, and the 2-equivalence classes for 
the states of the automaton shown below. 


= 


Solution 


1. 0-equivalence classes: By Theorem 12.3.1 two states are 0-equivalent if, and only if, 
both are accepting states or both are nonaccepting states. Thus there are two sets of 
0-equivalent states: 


{5o, 51, 4} (the nonaccepting states) and {s, 53} (the accepting states), 
and so 


the 0-equivalence classes are {sq, S1, 54} and {s2, 53}. 


2. 1-equivalence classes: By Theorem 12.3.1, two states are 1-equivalent if, and only 
if, they are 0-equivalent and, after input of any input symbol, their next-states are 
0-equivalent. Thus s; is not 1-equivalent to sọ because when a 0 is input to the au- 
tomaton in state sı it goes to state s2, whereas when a 0 is input to the automaton 
in state sọ it goes to state so, and s and sọ are not 0-equivalent. On the other hand, 
sı is 1-equivalent to są because when a 0 is input to the automaton in state s; or s4 
the next-states are sy and s3, which are 0-equivalent; and when a 1 is input to the 
automaton in state s} or s4 the next-states are sy and sı, which are 0-equivalent. 
By a similar argument, s2 is 1-equivalent to s3. Since 1-equivalent states must also 
be 0-equivalent [by property (12.3.4)], no other pairs of states can be 1-equivalent. 
Hence 


the 1-equivalence classes are {so}, {s1, 54}, and {5,53}. 


3. 2-equivalence classes: By Theorem 12.3.1, two states are 2-equivalent if, and only 
if, they are 1-equivalent and, after input of any input symbol, their next-states are 
1-equivalent. Now sı is 2-equivalent to s4 because they are l-equivalent; and when 
a l is input to the automaton in state sı or s4 the next-states are są and sı, which are 
1-equivalent; and when a 0 is input to the automaton in state sı or s4 the next-states are 
Sj and s3, which are |-equivalent. Similarly, s is 2-equivalent to s3. Since 2-equivalent 
states must also be 1-equivalent /by property (12.3.4)j, no other pairs of states can be 
2-equivalent. Hence 


the 2-equivalence classes are {so}, {5,, s4}, and {59, 53}. 


Note that the set of 2-equivalence classes equals the set of 1-equivalence classes. E 
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Finding the «-Equivalence Classes 


Example 12.3.2 illustrates the relative ease with which the sets of k-equivalence classes 
of states can be found. But to simplify a finite-state automaton, you need to find the set of 
*-equivalence classes of states. The next theorem says that for some integer K, the set 
of «-equivalence classes equals the set of K-equivalence classes. 


Theorem 12.3.2 


If A is a finite-state automaton, then for some integer, K = 0, the set of K-equivalence 
classes of states of A equals the set of (K + 1)-equivalence classes of states of A, and 
for all such K these are both equal to the set of *-equivalence classes of states of A. 


The detailed proof of Theorem 12.3.2 is somewhat technical, but the idea of the proof 
is not hard to understand. Theorem 12.3.2 follows from the fact that for each positive 
integer k, the k-equivalence classes are obtained by subdividing the (k — 1)-equivalence 
classes according to a certain rule that is the same for each k. Since the number of states 
of the automaton is finite, this subdivision process cannot continue forever, and so for 
some integer K = 0, the set of K-equivalence classes equals the set of (K + 1)-equivalence 
classes. Moreover, the set of m-equivalence classes equals the set of K-equivalence classes 
for every integer m = K, which implies that the set of +-equivalence classes equals the set 
of K-equivalence classes. 


el veer Finding «-Equivalence Classes of R 


Let A be the finite-state automaton defined in Example 12.3.2. Find the *-equivalence 
classes of states of A. 


Solution According to Example 12.3.2, the set of 1-equivalence classes for A equals the 
set of 2-equivalence classes. By Theorem 12.3.2, then, the set of *-equivalence classes also 
equals the set of 1-equivalence classes. Hence 


the *-equivalence classes are {sọ}, {s1, 54}, and {so, 53}. 


In the notation of Section 8.3, the equivalence classes are denoted 


[so] = {so} [si] = (51, s4} = [s4] [92] = (50, 53} = [s3]. m 


The Quotient Automaton 


We next define the quotient automaton A of an automaton A. However, in order for all 
parts of the definition to make sense, we must make two observations. 


No *-equivalence class of states of A can contain both accepting 


and nonaccepting states. 12.3.7 


The reason this is true is that the 0-equivalence classes divide the set of states of A 
into accepting and nonaccepting states, and the *-equivalence classes are subsets of 
0-equivalence classes. 


If two states are *-equivalent, then their next-states are also 


*-equivalent for each input symbol m. 12.3.8 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


864 CHAPTER12 REGULAR EXPRESSIONS AND FINITE-STATE AUTOMATA 


This is true for the following reason. Suppose states s and f are *-equivalent. Then any 
input string that sends A to an accepting state when A is in state s sends A to an accepting 
state when A is in state t. Now suppose m is any input symbol, and consider the next-states 
Ns, m) and N(t, m). Inputting a string of length k to A when A is in state N(s, m) or N(t, m) 
produces the same effect as inputting a certain string of length k + 1 to A when A is in state 
s or t (namely, the concatenation of m with the string of length k). Hence any string that 
sends A to an accepting state when A is in state N(s, m) also sends A to an accepting state 
when A is in state M(t, m). It follows that M(s, m) and N(t, m) are *-equivalent. Complete 
proofs of properties (12.3.7) and (12.3.8) are left to the exercises. 

Now we can define the quotient automaton A of A. It is the finite-state automaton whose 
states are the +*-equivalence classes of states of A, whose initial state is the *-equivalence 
class containing the initial state of A, whose accepting states are of the form [s] where s is 
an accepting state of A, whose input symbols are the same as the input symbols of A, and 
whose next-state function is derived from the next-state function for A in the following 
way: To find the next-state of A for a state s and an input symbol m, pick any state t in [s] 
and look to see what next-state A goes to if m is input when A is in state t, the equivalence 
class of this state is the next-state of A. 


Definition 


Let A be a finite-state automaton with set of states S, set of input symbols /, and next- 

state function N. The quotient automaton A is defined as follows: 

1. The set of states, S, of A is the set of +-equivalence classes of states of A. 

2. The set of input symbols, 7, of A equals 7. 

3. The initial state of A is [so], where sọ is the initial state of A. 

4. The accepting states of A are the states of the form [s], where s is an accepting 
state of A. 

5. The next-state function N: § X I — S is defined as follows: 


For all states [s] in S and input symbols min J, N({s], m) = [N(s, m)]. 


(That is, if m is input to A when A is in state [s], then A goes to the state that is the 
*-equivalence class of N(s, m).) 


Note that since the states of A are sets of states of A, A generally has fewer states than A. 
(A and A have the same number of states only in the case where each *-equivalence class 
of states contains just one element.) Also, by property (12.3.7), each accepting state of A 
consists entirely of accepting states of A. Furthermore, property (12.3.8) guarantees that 
the next-state function N is well defined. 

By construction, a quotient automaton A accepts exactly the same strings as A. We state 
this formally as Theorem 12.3.3 and leave the details of a proof to a more advanced course 
in automata theory. 


Theorem 12.3.3 


If A is a finite-state automaton, then the quotient automaton A accepts exactly the 
same languages as A. In other words, if L(A) denotes the language accepted by A and 
L(A) denotes the language accepted by A, then 


EA LA): 
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Constructing the Quotient Automaton 


Let A be a finite-state automaton with set of states S, next-state function N, relation R, of 
*-equivalence of states, and relation R, of k-equivalence of states. It follows from Theo- 
rems 12.3.2 and 12.3.3 and from the definition of quotient automaton that to find the quo- 
tient automaton A of A, you can proceed as follows: 


1. Find the set of 0-equivalence classes of S. 


2. For each integer k = 1, subdivide the (k— 1)-equivalence classes of S (as described 
earlier) to find the k-equivalence classes of S. Stop subdividing when you observe that 
for some integer K the set of (K + 1)-equivalence classes equals the set of K-equivalence 
classes. At this point, conclude that the set of K-equivalence classes equals the set of 
-equivalence classes. 


3. Construct the quotient automaton A whose states are the «equivalence classes of states 
of A and whose next-state function N is given by 


N({s], m) = LNs, m)] for each state s of A and each input symbol m. 


[That is, to see where A goes if m is input to A when it is in state [s], look to see where A goes 
ifm is input to A when it is in state s. The «-equivalence class of that state is the answer.] 


Clever Constructing a Quotient Automaton 


Consider the automaton A of Examples 12.3.2 and 12.3.3. This automaton is shown again 
below for reference. Find the quotient automaton of A. 


Solution According to Example 12.3.3, the *-equivalence classes of the states of A are 
{So}, {S1 54}, and {s2, 53}. 
Hence the states of the quotient automaton A are 
[so] = {so}, [51] = {51, 84} = [s4], [52] = {52, 53} = [s3]. 


The accepting states of A are s, and 53, so the accepting state of A is [sy] = [s3]. The next- 
state function N of A is defined as follows: for all states [s] and input symbols m of A, 


N([s], m) = [Ms, m)] = the +-equivalence class of N(s, m). 
Thus 

N([so], 0) = [N(so, 0)] = the *-equivalence class of N(sọ, 0). 
Now M59, 0) = So, and so 


N([so], 0) = the +-equivalence class of sọ = [sọ]. 
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Similarly, 
M[sol. 1) = [N(s9, D] = [51] 
N([s,], 0) = [N(1, 0)] = [s2] 
Msi]. 1) = [N(s1, D] = [sa] = [s1] 
N([s2], 0) = [N(S2, 0)] = [s3] = [s2] 
N([s2], 1) = [M(s2, 1)] = [s4] = [s1]. 


The transition diagram for A is, therefore, as shown below. 


[so] 


By Theorem 12.3.3, this automaton accepts the same language as the original automaton. 
a 


Equivalent Automata 


When a finite-state automaton is implemented by a circuit, output indicators may be at- 
tached to its states to indicate whether they are accepting or nonaccepting. For example, 
accepting states might produce an output of | and nonaccepting states an output of 0. Then 
a finite-state automaton can be thought of as an input/output device whose input consists 
of strings and whose output consists of 0’s and 1’s. Recall that a circuit can be thought of 
as a black box that transforms combinations of input signals into output signals. Two cir- 
cuits that produce identical output signals for each combination of input signals are called 
equivalent. Similarly, a finite-state automaton can be regarded as a black box that pro- 
cesses input strings and produces output signals (indicating whether or not the strings are 
accepted). Two finite-state automata are called equivalent if they produce identical output 
signals for each input string. This implies that two finite-state automata are equivalent if, 
and only if, they accept the same language. 


Definition 


Let A and A’ be finite-state automata with the same set of input symbols Z. Let L(A) 
denote the language accepted by A and L(A’) the language accepted by A’. Then A is 
said to be equivalent to A’ if, and only if, L(A) = L(A’). 


Showing That Two Automata Are Equivalent 


Show that the automata A and A’ that follow are equivalent. 


The label 0, 1 on an arrow of a transition 
diagram means that for either input 0 or 
1, the next-state of the automaton is the 


state to which the arrow points. 
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Solution 
For the automaton A: The 0-equivalence classes are 


since sọ and sı are accepting states and 


{so s1} and {59, 53} z 
S and s3 are nonaccepting states. 


The l-equivalence classes are since sọ and s; are not l-equivalent (be- 
cause N(so, 1) = sı, and M(s,, 1) = s3, and 
{so}, {sı} and {s2, s3} sı is not 0-equivalent to s3) but s and s3 


. are l-equivalent. 
The 2-equivalence classes are q 


{so}, {sı} and {s2, s3} since s and s3 are l-equivalent. 


This discussion shows that the set of 1-equivalence classes equals the set of 2-equiva- 
lence classes, so by Theorem 12.3.2 this is equal to the set of *-equivalence classes. Hence 
the «-equivalence classes are 

{So}, {sy}, and {5o, S3}. 


For the automaton A': By reasoning similar to that done previously, the 0-equivalence 
classes are 


{s0 5253}, and {s1}. 
The l-equivalence classes are 
{dso s3} {s2}, and {si}. 


The 2-equivalence classes are the same as the l-equivalence classes, which are therefore 
equal to the *-equivalence classes. Thus the *-equivalence classes are 


{tso s3) {so}, and {si}. 


To calculate the next-state functions for A and A’, you repeatedly use the fact that in 
the quotient automaton, the next-state of [s] and m is the class of the next-state of s and m. 
For instance, 


Msi], 1) = [M(s1, D] = [s3] = [s2] 
and N'(Eso], 0) = [N'(s0, 0)] = [s3] = [so], 


where N is the next-state function for A and N’ is the next-state function for A’. 
The complete transition diagrams for the quotient automata A and A’ are shown below. 


As you can see, except for the labeling of the names of the states, A and A’ are identical 
and hence accept the same language. Now by Theorem 12.3.3, each original automaton 
accepts the same language as its quotient automaton. Thus A and A’ accept the same lan- 
guage, and so they are equivalent. E 


In mathematics an object such as a finite-state automaton is called a structure. In gen- 
eral, when two mathematical structures are the same in all respects except for the labeling 
given to their elements, they are called isomorphic, which comes from the Greek words 
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isos, Meaning “same” or “equal,” and morphe, meaning “from.” It can be shown that two 
automata are equivalent if, and only if, their quotient automata are isomorphic, provided 
that “inaccessible states” have first been removed. (Inaccessible states are those that cannot 
be reached by inputting any string of symbols to the automaton when it is in its initial state.) 


TEST YOURSELF 


1. Given a finite-state automaton A with eventual- are_______ or both are . Moreover, for ev- 
state function N* and given any states s and fin A, ery integer k = 1, s is k-equivalent to t if, and only 
we say that s and ¢ are *-equivalent if, and only if, if, (1) s and t are (k — 1)-equivalent and (2) ——— 

. If A is a finite-state automaton, then for some 
2. Given a finite-state automaton A with eventual- integer K = 0, the set of K-equivalence classes of 


EXERCISE SET 12.3 


state function N* and given any states s and fin A, 
we say that s and ¢ are k-equivalent if, and only if, 


Given states s and ¢ in a finite-state automaton A, s 
is 0-equivalent to rif, and only if, either both s and t 


states of A equals the set of -equivalence 
classes of A, and for all such K these are both 
equal to the set of 


. Given a finite-state automaton A, the set of states 


of the quotient automaton A is 


1. Consider the finite-state automaton A given by the a. Find the 0-, 1-, and 2-equivalence classes of 
following transition diagram: states of A. 
b. Draw the transition diagram for A, the quotient 
automaton of A. 
. Consider the finite-state automaton A discussed in 
Example 12.3.1: 
a. Find the 0-, 1-, and 2-equivalence classes of 
states of A. 
b. Draw the transition diagram for A, the quotient 
automaton of A. 
2. Consider the finite-state automaton A given by the a. Find the 0- and 1-equivalence classes of states 


following transition diagram: 
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of A. 
b. Draw the transition diagram of A, the quotient 
automaton of A. 


12.3 SIMPLIFYING FINITE-STATE AUTOMATA 869 


4. Consider the finite-state automaton given by the 5. Consider the finite-state automaton given by the 
following transition diagram: following transition diagram: 


a. Find the 0-, 1-, 2-, and 3-equivalence classes of 
a. Find the 0-, 1-, 2-, and 3-equivalence classes of states of A. a . = . 
states of A: b. Draw the transition diagram for A, the quotient 


b. Draw the transition diagram for A, the quotient automaton of A. 


automaton of A. 


6. Consider the finite-state automaton given by the following transition diagram: 


Ha. Find the 0-, 1-, 2-, and 3-equivalence classes of states of A. 
b. Draw the transition diagram for A, the quotient automaton of A. 


7. Are the automata A and A’ shown below equivalent? 
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8. Are the automata A and A’ shown below equivalent? 


H 15. 


CHAPTER12 REGULAR EXPRESSIONS AND FINITE-STATE AUTOMATA 


. Prove property (12.3.1). 
. How should the proof of property (12.3.1) be 


modified to prove property (12.3.2)? 


. Prove property (12.3.3). 
. Prove property (12.3.4). 


Prove property (12.3.5). 


16. 


18. 


Prove property (12.3.6). 


. Prove that if two states of a finite-state automaton 


are k-equivalent for some integer k, then those 
states are m-equivalent for every nonnegative 
integer m < k. 


Write a complete proof of property (12.3.7). 


. Write a complete proof of property (12.3.8). 
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ANSWERS FOR TEST YOURSELF 
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1. for all input strings w, either N*(s, w) and N*(t, w) are both 
accepting states or both are nonaccepting states 

2. for all input strings w of length less than or equal to k, 
either N*(s, w) and N*(t, w) are both accepting states or both 


are nonaccepting states 3. accepting states; nonaccepting 
states; for any input symbol m, N(s, m) and N(t, m) are also 
(k—1)-equivalent 4. (K + 1); «-equivalence classes of 
states of A 5. the set of *«-equivalence classes of states of A 
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PROPERTIES OF THE 
REAL NUMBERS* 


In this text we take the real numbers and their basic properties as our starting point. We 
give a core set of properties, called axioms, which the real numbers are assumed to satisfy, 
and we state some useful properties that can be deduced from these axioms. 

We assume that there are two binary operations defined on the set of real numbers, 
called addition and multiplication, such that if a and b are any two real numbers, the sum 
of a and b, denoted a +b, and the product of a and b, denoted a: b or ab, are also real 
numbers. These operations satisfy properties FI-F6, which are called the field axioms. 


Fl. Commutative Laws For all real numbers a and b, 
at+b=b+a and ab=ba. 
F2. Associative Laws For all real numbers a, b, and c, 
(at+b)+c=at(bt+c) and (ab)c = a(be). 
F3. Distributive Laws For all real numbers a, b, and c, 
ab+c)=ab+ac and (b+c)a= ba+ca. 


F4. Existence of Identity Elements There exist two distinct real numbers, denoted 0 and 1, 
such that for every real number a, 


O+a=at+O0O=a and l-a=a:l=a. 


F5. Existence of Additive Inverses For every real number a, there is a real number, 
denoted —a and called the additive inverse of a, such that 


a+(—a) =(-a)+a=0. 


F6. Existence of Reciprocals For every real number a # 0, there is a real number, 
denoted 1/a or a™', called the reciprocal of a, such that 


All the usual algebraic properties of the real numbers that do not involve order can be 
derived from the field axioms. The most important are collected as theorems T1-T16 as 
follows. In all these theorems the symbols a, b, c, and d represent arbitrary real numbers. 


Tl. Cancellation Law for Addition If a+b = a+c, then b = c. (In particular, this 
shows that the number 0 of Axiom F4 is unique.) 


T2. Possibility of Subtraction Given a and b, there is exactly one x such that a+ x = b. 
This x is denoted by b — a. In particular, 0 — a is the additive inverse of a, —a. 


“Adapted from Tom M. Apostol, Calculus, Volume I (New York: Blaisdell, 1961), pp. 13-19. 


A-1 
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T3. b—-a=b+(—-a). 
T4. —(—a) =a. 

T5. a(b—c) = ab- ac. 
T6. 0-a=a:0=0. 


T7. Cancellation Law for Multiplication If ab = ac anda # 0, then b = c. (In particu- 
lar, this shows that the number 1 of Axiom F4 is unique.) 


T8. Possibility of Division Given a and b with a # 0, there is exactly one x such that 
ax = b. This x is denoted by b/a and is called the quotient of b and a. In particular, 
1/a is the reciprocal of a. 


T9. Ifa + 0, then b/a = bea. 
T10. Ifa + 0, then (a7 h~! = a. 
T11. Zero Product Property If ab = 0, then a = 0 or b = 0. 
T12. Rule for Multiplication with Negative Signs 
(—a)b = a(—b) = —-(ab),,  (—a)(—b) = ab, 


and 


T13. Equivalent Fractions Property 


ao ae ifb + Oandc +0. 


T14. Rule for Addition of Fractions 


ae ad + bc 
b d bd ’ 


ifb + Oandd # 0. 


T15. Rule for Multiplication of Fractions 


ac ac 
<= if b d f 
bd bd’ i Oandd #0 


T16. Rule for Division of Fractions 


a 

b ; 

= be if b #0,c #0, and d # 0. 
d 


The real numbers also satisfy the following axioms, called the order axioms. It is assumed 
that among all real numbers there are certain ones, called the positive real numbers, that 
satisfy properties Ord!—Ord3. 


Ordl. For any real numbers a and b, if a and b are positive, so are a + b and ab. 
Ord2. For every real number a # 0, either a is positive or —a is positive but not both. 


Ord3. The number 0 is not positive. 
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The symbols <, >, <, and =, and negative numbers are defined in terms of positive numbers. 


Definition 


Given real numbers a and b, 


a < b means b + (—a) is positive. b > a meansa < b. 
a Sb meansa < bora =b. b = a means a = b. 
Ifa < 0, we say that a is negative. Ifa = 0, we say that a is nonnegative. 


From the order axioms Ordl—Ord3 and the above definition, all the usual rules for cal- 
culating with inequalities can be derived. The most important are collected as theorems 
T17-T27 as follows. In all these theorems the symbols a, b, c, and d represent arbitrary 
real numbers. 


T17. Trichotomy Law For arbitrary real numbers a and b, exactly one of the three rela- 
tions a < b,b <a, ora = b holds. 


T18. Transitive Law Ifa<bandb<c,thena<c. 

T19. Ifa < b, thena+c < b+c. 

T20. Ifa < b and c > 0, then ac < bc. 

T21. If a # 0, then a’ > 0. 

T22. 1 >0. 

T23. Ifa < b and c <0, then ac > bc. 

T24. Ifa < b, then —a > —b. In particular, if a < 0, then —a > 0. 
T25. If ab > 0, then both a and b are positive or both are negative. 
T26. Ifa < cand b < d, thena +b < c+d. 

T27. IfO0 <a < cand0 <b < d, then 0 < ab < cd. 


One final axiom distinguishes the set of real numbers from the set of rational numbers. It 
is called the least upper bound axiom. 


LUB. Any nonempty set S of real numbers that is bounded above has a least upper bound. 
That is, if B is the set of all real numbers x such that x = s for every s in S and if B 
has at least one element, then B has a smallest element. This element is called the 
least upper bound of S. 


The least upper bound axiom holds for the set of real numbers but not for the set of ratio- 
nal numbers. For example, the set of all rational numbers that are less than V2 has upper 
bounds but not a least upper bound within the set of rational numbers. 
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SOLUTIONS AND HINTS 
TO SELECTED EXERCISES 


SECTION 1.1 


1. a. 
b. 


To 


os 


10. a. 


12. a. 


x? = —1 (Or: the square of x is —1) 
a real number x 


between a and b 
distinct real numbers a and b; there is a real number c 


r is positive 

positive; the reciprocal of r is positive (Or: positive; 
1/r is positive) 

is positive; 1/r is positive (Or: is positive; the recip- 
rocal of r is positive) 


. There are real numbers whose sum is less than their 


difference. 

True. For example, 1+(—1)=0,1-(-1)= 
1+1=2, and0 <2. 

The square of each positive integer is greater than 
or equal to the integer. 


True. If n is any positive integer, then n = 1. Mul- 
tiplying both sides by the positive number n does 
not change the direction of the inequality (see 
Appendix A, T20), and so nn. 


have four sides 

has four sides 

has four sides 

is a square; has four sides 
J has four sides 


have reciprocals 
a reciprocal 
s is a reciprocal for r 


real number; product with every number leaves the 
number unchanged 


b. with every number leaves the number unchanged 


c. 


SSS 


SECTION 1.2 


1. A=CandB=D 


2. a. 


C. 


The set of all positive real numbers x such that 0 is 
less than x and x is less than 1 
The set of all integers n such that n is a factor of 6 


10. 


Tl. 


a. No, {4} is a set with one element, namely 4, where- 
as 4 is just a symbol that represents the number 4 

b. Three: the elements of the set are 3, 4, and 5. 

c. Three: the elements are the symbol 1, the set {1}, 
and the set {1, {1}} 


Hint: R is the set of all real numbers, Z is the set of all 
integers, and Z* is the set of all positive integers. 


. Hint: Ty and T, do not have the same number of elements 


as T, and T_3. 


a. {1, —1} 
c. the set has no elements 
d. Z (every integer is in the set) 


. a. No, B ZA because j E B and j €A 


d. Yes, C is a proper subset of A. Both elements of C 
are in A, but A contains elements (namely c and f ) 
that are not in C. 


a. Yes 

b. No, the number 1 is not a set and so it cannot be 
a subset. 

f. No, the only element in {2} is the number 2 and 
the number 2 is not one of the three elements in 
{L {2}, {3}}. 

i. Yes, the only element in {1} is the number 1, which 
is an element in {1, {2}}. 


a. No. Observe that (—2)? = (—2)(—2) = 4, 
whereas —2? = —(27) = —4. So ((—2)°, —2°) = 
(4, —4), whereas (—2”, (—2)*) = (—4, 4). And 
(4, —4) # (—4, 4) because —4 # 4. 
c. Yes. Note that 8—9 = —1 and W/—1 = 1, and so 
(8-9, Y-I) =(-1, —1). 


a. {(w, a), (w, b), (x, a), (x, b), O, a), Y, b), @, a), (z, bY} 
A X B has 4:2 = 8 elements. 

b. {(a, w), (b, w), (a, x), (b, x ), (a, y), (b, y), (a, z), (b, D} 
B X A has 4:2 = 8 elements. 

c. {(w, w), (w, x), (w, y), w, 2), œ, w), Œ, x), OY), Œ, 2), 
Q, w), O, 2, O, Y), O, 2) E, w), (ZX), E Y), E d} 
A X A has 4:4 = 16 elements. 

d. {(a, a), (a, b), (b, a), (b, b)} B X B has 2:2 = 4 
elements. 


A-4 
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13. a. A X (B X C) = {(1, u, m)), (1, (u, n), A s B j T B 
(2, (u, m), (2, (u, n)), (3, (u, m)), (3, (u, n} TE E STE 
b. (A X B) X C= {(, u), m), (A, u), n), 
((2, u), m), (2, u), n), (3, u), m), (3, 1), n)} a —# Pa 


c. AXBX C= {(1, u, m), (1, u, n), (2, u, m), 
(2, u, n), (3, u, m), (3, u, n)} 


15. 0000, 0001, 0010, 0100, 1000 


b. R is not a function because it satisfies neither 
property (1) nor property (2) of the definition. It 
fails property (1) because (4, y) É R, for any y in 


SECTION 1.3 B. It fails property (2) because (6, 5) E R and 
1. a. No. Yes. No. Yes. (6, 6) S R and 37 6. 
_ S is not a function because (5, 5) E S and (5, 7) 
b. R= {(2, 6), (2, 8), (2, 10), G, 6), (4, 8)} © Sand5#7.S0Sd cap ck ei) oT 
c. Domain of R = A = {2, 3, 4}, co-domain of an, N 220 OSS not satisfy property (2) o 
R =B = (6,8, 10} the definition of function. 
d iiai T is not a function both because (5, x) É T for 


any x in B and because (6, 5) E T and (6, 7) E T and 
5 # 7. So T does not satisfy either property (1) or 
property (2) of the definition of function. 


9. a. There is only one: {(0, 1), (1, D} 
b. {(0, D} (A, D} 


11. L(0201) = 4, L(12) = 2 


3. a. 370 because at = 3 = 1, which is an integer. 


1-(-1)_2 4... 
17 1) because —3 = 3, which is not an 13. a. Domain = A = {—1, 0, 1}, 
integer. et Se ee 
2-(-) 3 eS co-domain = B = {t, u, v, w} 
2, —1) E T because —3 =3=1, which is an b. F(-1) = u, FO) = w, F) =u 
integer. 
G, -2) & T because PCI S aah Ge pat aa 15. a. This diagram does not determine a function be- 
inten $ a cause 2 is related to both 2 and 6. 
b. T= {(1, 2), 2, —1), 6, 0} b. This diagram does not determine a function be- 
a Domain of T aa k E Ri 5 3}, co-domain of cause 5 is in the domain but it is not related to any 
T= F = {-2, -1,0} an element in the co-domain. 
; 2 1) _ (ly _1 
d am 16. f=) = ClY = 1, f0) = 0 = 0, fH = (5) = 3. 
B+ 2x? +1 
a | 19. For each x E R, g(x) = ae 2x ak +) = 
KFA P aia a | 
} AN 2x = f(x). Therefore, by definition of equality of func- 


tions, f = g. 
5. a. (2,1) E S because 2 = 1. (2, 2) E S because 2 = 2. 


283 because 2 ¥ 3. (—1) S (2) because —1 = —2. SECTION 1.4 


b. A 
1. ViG) = {v;, v2, v3, va}, E(G) = {e1, e2, e3} 
x = y in shaded region <— graph of S Edge-endpoint function: 
Edge Endpoints 
> x ey {v1, v2} 
e2 {v,, U3 } 
e3 {v3} 
3. e e4 
7. a. R T E ` 
6 U2 U4 Us 
e2 
Z rd 
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5. 


10. 


12. 


Imagine that the edges are strings and the vertices are 
knots. You can pick up the left-hand figure and lay it 
down again to form the right-hand figure as shown below. 


(i) e4, e2, and e; are incident on vj. 
Gi) v, v, and v3 are adjacent to v3. 
(ili) ez, eg, @9, and e3 are adjacent to e}. 
(iv) Loops are é¢ and e7. 
(v) eg and eg are parallel; e4 and es are parallel. 
(vi) vę is an isolated vertex. 
(vii) degree of v; = 5 


a. Yes. According to the graph, Sports Illustrated is an 
instance of a sports magazine, a sports magazine is a 
periodical, and a periodical contains printed writing. 


To solve this puzzle using a graph, introduce a notation 
in which, for example, wc / fg means that the wolf and 
the cabbage are on the left bank of the river and the 
ferryman and the goat are on the right bank. Then draw 
those arrangements of wolf, cabbage, goat, and ferry- 
man that can be reached from the initial arrangement 
(wgcf/) and that are not arrangements to be avoided 
(such as (wg / fc)). At each stage ask yourself, “Where 
can I go from here?” and draw lines or arrows pointing 
to those arrangements. This method gives the graph 
shown below. 

Start 


( wgcfl ) 


1.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-6 


Examining the diagram reveals the solutions 


(wgcf h) — (we | gf) > (wef! 8) > w! gcf) > 
(wef! c) > (g/ wef) > (gf / we) > (/ wecf) 


and 


(wgcf!) — (we! gf) > wef g) > (c/ wef) > 
(gcf/ w) — (8 / wef) — (gf/ we) > (I wecf) 


14. Hint: The answer is yes. Represent possible amounts 


of water in jugs A and B by ordered pairs. For instance, 
the ordered pair (1, 3) would indicate that there is 
one liter of water in jug A and three liters in jug B. 
Starting with (0, 0), draw arrows from one ordered 
pair to another if it is possible to go from the situ- 
ation represented by one pair to that represented 

by the other by either filling a jug, emptying a jug, 
or transferring water from one jug to another. You 
need only draw arrows from states that have arrows 
pointing to them; the other states cannot be reached. 
Then find a directed path (sequence of directed 
edges) from the initial state (0, 0) to a final state 

(1, 0) or (0, 1). 


3 


Vertex e has maximal degree, so color it with color #1. 
Vertex a does not share an edge with e, and so color #1 
may also be used for it. From the remaining uncolored 
vertices, all of d, g, and fhave maximal degree. Choose 
any one of them—say, d—and use color #2 for it. Ob- 
serve that vertices c and f do not share an edge with d, 
but they do share an edge with each other, which means 
that color #2 may be used for one but not the other. 
Choose to color f with color #2 because the degree of f 
is greater than the degree of c. The remaining uncolored 
vertices, b, c, and g, are unconnected, and so color #3 
may be used for all three. 


16. Hint: There are two solutions: 


(1) Time 1: hiring, library 
Time 2: personnel, undergraduate education, 
colloquium 
Time 3: graduate education 

(2) Time 1: hiring, library 
Time 2: graduate education, colloquium 
Time 3: personnel, undergraduate education 
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A-7 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


SECTION 2.1 


1. 


12. 


14. 


Common form: If p then q. 

p. 

Therefore, q. 
(a+ 2b)\(a@ —b) can be written in prefix notation. All 
algebraic expressions can be written in prefix notation. 


Common form: p V q. 
~p. 
Therefore, q. 


My mind is shot. Logic is confusing. 


a. Itis a statement because it is a true sentence. 1,024 
is a perfect square because 1,024 = 32°, and the 
next smaller perfect square is 317 = 961, which has 


fewer than four digits. 
a. sA\i b. ~sA~i 
a. (hAw)A~s d. (~wA~s)Ah 


a. pvq 


a pAqgAr & pA(~qV~r) 


- Inclusive or. For instance, a team could win the playoff 


by winning games 1, 3, and 4 and losing game 2. Such 
an outcome would satisfy both conditions. 


P 4 |P P^ 

T T| F F 

T F| F F 

F T| T T 

F F| T F 

P q r | qAr | pA@Ar) 
T T T| T T 
T T F| F F 
T F T| F F 
T F F| F F 
F T T| T F 
F T F| F F 
F F T| F F 
F F F| F F 
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16. 


18. 


21. 


23. 


P q \PAq |PV@Ad |P 
T T T T T 
T F F T T 
F T F F F 
F F F F F 
Pp V (p ^ q) and p always have the 
same truth values, so they are 
logically equivalent. (This proves 
one of the absorption laws.) 
P t | pvt 
T T T 
F T T 


p V t and t always have the same truth 
values, so they are logically equivalent. 
(This proves one of the universal bound laws.) 


` 


p^q (pAg)Ar | pPA@GAr) 
T T T T 


q/r 


mila) ml/m}a]/Ha}a|/4a}s 
mlm} slalnm} ml Ha}lHa}s 
“lalalalalalala 
“lalalalalala 
“lalalalala 
“lalala om] a] on 
“lalala mala 


Ci 


(p^q)^r and p ^ (q ^ r) always have the same truth values, 
so they are logically equivalent. (This proves the associative law 
for /.) 


PAGVAIVT|(PADVT|PAGvn 


“lalalalalalala 


a|ulalalm| malaje 
“m|a|ml|alml|almjals 
adlalm}m}a) a} a}al> 
MialAalalm) alsa 
mialm}ala)ala 
dioalm}aia) ala 


as | 


(p ^q) vrand p ^ (q V r) have different truth values in the fifth 
and seventh rows, so they are not logically equivalent. (This 
proves that parentheses are needed with ^ and v.) 


25. Hal is not a math major or Hal’s sister is not a computer 


science major. 


27. The connector is not loose and the machine is not un- 
plugged. 


31. a. 01, 02, 11, 12 


32. —2=xorx=7 


2.1 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-8 


34.2<x=5 
36. | =xorx< —3 


38. This statement’s logical form is (p ^ q) Vr, 
so its negation has the form ~((p ^q) v r) = 
~(p \q)\ ~r = (~p V ~q) ^ ~r. Thus a negation for 
the statement is (num_orders = 100 
or num_instock > 500) and num_instock = 200. 


oe p q4 | ~p | ~q | PAI | parma | ~PVPA~D | PADV EPV PA) 
T TĪÎFĪ|F T F F T 
T F| FT F T T T 
F TÍTĪF F F T T 
F PTT F F T T 
Its truth values are all T’s, so 
(p\q)V (~PV (DA ~@)) 
is a tautology. 
mel 4 | ~P |) ~d | PA~d | ~va PA~DNOVD 
T T F F F T F 
T F F T T F F 
F T/T F F T F 
F F TÍT F T F 


Its truth values are all F’s, so (p A ~q) ^ (~p V q) is a contradiction. 


44.a. Noreal numbers satisfy this inequality 


46. a. Solution 1: Construct a truth table for p ® p using 
the truth values for exclusive or. 


P | p®p 

because an exclusive or statement 
T F ; 

is false when both components are 
F F true and when both components 

are false. 


Since all its truth values are false, p ® p = c, a con- 


tradiction. 


Solution 2: Replace q by p in the logical equiva- 
lence p Bg = (p v q) ^ ~(p ^ q), and simplify the 


result. 
p®p=(pvp^n~p^p) by definition of ® 
=p\~p by the identity laws 
=c by the negation law 
for ^ 


47. There is a famous story about a philosopher who once gave 
a talk in which he observed that whereas in English and 
many other languages a double negative is equivalent to a 
positive, there is no language in which a double positive is 
equivalent to a negative. To this, another philosopher, Sid- 
ney Morgenbesser, responded sarcastically, “Yeah, yeah.” 

[Strictly speaking, sarcasm functions like negation. 
When spoken sarcastically, the words “Yeah, yeah” are 
not a true double positive; they just mean “no.”] 


48. a. The distributive law 
b. The commutative law for V 
c. The negation law for V 
d. The identity law for ^ 


50. (pA ~q) Vp=PpV(pA~@q) by the commutative 
law for V 
=p by the absorption law 
(with ~q in place of q) 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A-9 APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


53. ~(~pAQV(~pA~Q)V(PAQ 13. a. p al = | pal ~pve 
= ~[~p ^ (q4 V ~@]V (PQ) bythe distributive law 
= e A ~~ ni q) = N the negation law for v 1 l E 1 2 
=~(~p)V (pq) by the identity law for ^ T F F F F 
=pv(p^q) by the double negative law F T T T T 
=p by the absorption law F F T T T 
t t 


SECTION 2.2 p — q and ~p V q always have the same 


truth values, so they are logically equivalent. 
1. If this loop does not contain a stop or a go to, then it . f f 
will repeat exactly N times. 14. a. Hint: p —> q V r is true in all cases except when p is 
true and both q and r are false. 
3. If you do not freeze, then Pll shoot. 
16. Let p represent “You paid full price” and q represent 


ion h hesi i : , 
5. ee “You didn’t buy it at Crown Books.” Thus, “If you paid 
p q ~p ~q ~pv4 | ~PvVqa—>~4 full price, you didn’t buy k at Crown Books” has the 
form p —> q. And “You didn’t buy it at Crown Books or 
T T F F T F you paid full price” has the form q v p. 
T F F T F T 
P q = qVP 
F T J F T F a4 
F F T T T T I I T T These two statements 
T F F T are not logically 
7. conclusion hypothesis equivalent because their 
F T T T forms have different 
P q 7 ~q PATI PATI Sr truth values in rows 2 
T T T F F T F F T F and 4. 
T T F F F T (An alternative representation for the forms of the two 
T F T T T T statements is pP>~q and ~q v p. In this case, the truth 
values differ in rows 1 and 3.) 
T F F T T F 
19. False. The negation of an if-then statement is not an if- 
F T T F F T then statement. It is an and statement. 
P T P Š F T 20. a. P is a square and P is not a rectangle. 
F F T T F T d. nis prime and both n is not odd and n is not 2. 
F F F T F T Or: n is prime and n is neither odd nor 2. 
f. Tom is Ann’s father and either Jim is not her uncle 
or Sue is not her aunt. 
9. 
P 4 VF \o~r| pAwr | avr | pAw~r<eagvr 21. a. Because p — q is false, p is true and q is false. 
Pep Tl sk F T F Hence ~p is false, and so ~p — q is true. 
T T EIT T T T 22. a. IfP is not a rectangle, then P is not a square. 
T F TIE F T F d. Ifnis not odd and n is not 2, then n is not prime. 
f. If either Jim is not Ann’s uncle or Sue is not her 
T F F|T T F F aunt, then Tom is not her father. 
F T T|F F T F 23. a. Converse: If P is a rectangle, then P is a square. 
F T EIT F T F Inverse: If P is not a square, then P is not a 
rectangle. 
F FT F F T F d. Converse: Ifnis odd or n is 2, then n is prime. 
F F FT F F T Inverse: If n is not prime, then n is not odd and 
nis not 2. 


12. If x > 2 then xX? > 4, and if x < —2 then xX > 4. 
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2.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-10 


f. Converse: If Jim is Ann’s uncle and Sue is her 26. p q 7 E ORES = 
aunt, then Tom is her father. q P q P\|P 4d 
Inverse: If Tom is not Ann’s father, then Jim is not T OF F F T T 
her uncle or Sue is not her aunt. 
T F T F F F 
24. 
P q P>4 | q4>P F T F T T T 
T T T T F F T T Si T 
T F| F T e 
~q —> ~p and p — q always have the same 
F T T F truth values, so they are logically equivalent. 
F F T T 28. Hint: A person who says “I mean what I say” claims to 
7 7 speak sincerely. A person who says “I say what I mean” 


— claims to speak with precision. 
p— q and q — p have different truth values 


in the second and third rows, so they are not 
logically equivalent. 


29. (p> (AVN) Oe (pa~) >r) 


PpP 4 r | ~| avr | pA~q) p>Qv | pA~qmr | Pe avne(A~g)—n 
T T 7 | F T F T T T 
T T F|F T F T T T 
T F PTT T T T T T 
T F F|T F Ti F F T 
F T T|F T F T T T 
F T F|F T F T T T 
F F Tf | T F T T T 
F F F|T F F T T T 


t 


(p > (q vV r)) & ((p ^A ~q) > r) is a tautology because 
all of its truth values are T. 


32. If this quadratic equation has two distinct real roots, 
then its discriminant is greater than zero, and if the 
discriminant of this quadratic equation is greater than 
zero, then the equation has two real roots. 


34 


If the Cubs do not win tomorrow’s game, then they will 
not win the pennant. 

If the Cubs win the pennant, then they will have won 
tomorrow’s game. 


37. Ifa new hearing is not granted, payment will be made 
on the fifth. 


40. If I catch the 8:05 bus, then I am on time for work. 


42. If this number is not divisible by 3, then it is not 
divisible by 9. 
If this number is divisible by 9, then it is divisible by 3. 


44. If Jon’s team wins the rest of its games, then it will win 
the championship. 


46. a. This statement is the converse of the given state- 
ment, and so it is not necessarily true. For instance, 
if the actual boiling point of compound X were 
200°C, then the given statement would be true but 
this statement would be false. 

b. This statement must be true. It is the contrapositive 
of the given statement. 
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A-11 APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


47. a. pA~q—>r=~(pA~gvr 8 premises conclusion 
. . = . maaa 
b. Result of (a) = ~[~(~(p A~q)) A ~r] p q r'\~q pva p>~q p>r F 

an acceptable answer 
=~[(pA~g) ^ ~r] T T T|F T F T 
by the double negative law T T FIF T F F 
(another acceptable answer) 
49.a. (p—>r)e (q4 >r)=(~pvr e (~qvr) EER 1 1 i 1 
=[~(~pvyr vca v NI niaar vop) TE F\T| T T F 
an acceptable answer 
= [pa~ v ay DIAG A~DV (pv OI E 1 i 1 
by De Morgan’s law (another E t plr T T T F pan 
acceptable answer) 
b. Result of (a) = ~[~(p A~) A~(~qv D] A F ETIT F T T 
~[~@G@A~n A~(~pvn)] 
by De Morgan’s law F F F| T F T T 
= ~i~ A nr) A Ann] A This row shows that it is possible for 
~[~qGA~NADpA~n] an argument of this form to have true 
by De Morgan’s law premises and a false conclusion. Thus 
this argument form is invalid. 
S E CTI O N 2.3 12. a. premises conclusion 
nvKkea—ooCOoOoooooooasr—vcvc oreo) 
1. V2isnotrational. 3. Logic is not easy. p q p>q q p 
6 Hanee gouclusion Ea T T T 
. 
P q P>q4 | PpP>q PV T F F F 
T T T T T E T T 1 E = 
T F F T F F T F 
F T T F This row shows that it is possible for an argument 
of this form to have true premises and a false 
F ap T F = conclusion. Thus this argument form is invalid. 
This row shows that it is possible for an argument | 14. premise conclusion 
of this form to have true premises and a false 
conclusion. Thus this argument form is invalid. P q P PV 
i . IÈ I T IN = 
7. premises conclusion 
T F T I = 
P 4 r \|~q|P|p->4qd| ~qvr r 
F T F 
TERETE ET I aw T i 
F F F 
T T F|ăF|T F 
These two rows show that in all 
T E TTT F T situations where the premise is true, 
T F FITT F T the conclusion is also true. Thus the 
argument form is valid. 
F T T|FJ|F T T : ; 
18. premises conclusion 
F T F|FJ|F T F 
P q PVG ~q Pp 
F F TITJF T T T T T F 
F F F/T I/F T T T F T T T = 
This row describes the only situation in which all the premises — F T T F 
are true. Because the conclusion is also true here, the argument 
form is valid. F F F T 


This row represents the only situation in which 
both premises are true. Because the conclusion 
is also true here the argument form is valid. 
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22. Let p represent “Tom is on team A” and q represent 
“Hua is on team B.” Then the argument has the form 


24. 


25. 


26. 


27. 


=P >q 
~4 >P 
SPY ~| 
premises conclusion 

P 4 |~P|\~q |\~P—>4|~4—>P|~PV~4 
T TF F T au F aii 
I le || 18 aly T w I 
ETT F T T T 
F F/T T F F 


This row shows that it is possible for an argument 
of this form to have true premises and a false conclusion. 
Thus this argument form is invalid. 


pq 


p invalid: converse error 


Sg valid: elimination 


a) 


valid: transitivity 


invalid: inverse error 


36. The program contains an undeclared variable. 
One explanation: 


1. 


There is not a missing semicolon and there is not a 
misspelled variable name. (by (c) and (d) and defini- 
tion of ^) 

It is not the case that there is a missing semicolon 
or a misspelled variable name. (by (1) and De 
Morgan’s laws) 

There is not a syntax error in the first five lines. (by 
(b) and (2) and modus tollens) 

There is an undeclared variable. (by (a) and (3) and 
elimination) 


37. The treasure is buried under the flagpole. 
One explanation: 
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i; 


2: 


The treasure is not in the kitchen. (by (c) and (a) and 
modus ponens) 

The tree in the front yard is not an elm. (by (b) and 
(1) and modus tollens) 

The treasure is buried under the flagpole. 

(by (d) and (2) and elimination) 


2.3 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


A-12 


38. a. Ais a knave and B is a knight. 
One explanation: 


1 
2 
3: 
4. 
5 
6 


10. 


Suppose A is a knight. 

.. What A says is true. (by definition of knight) 

.. B is a knight also. (That’s what A said.) 

.. What B says is true. (by definition of knight) 

<. A is a knave. (That’s what B said.) 

.. We have a contradiction: A is a knight and a 
knave. (by (1) and (5)) 

.. The supposition that A is a knight is false. (by 
the contradiction rule) 

z. A is a knave. (negation of supposition) 

.. What B says is true. (B said A was a knave, 
which we now know to be true.) 

z. B is a knight. (by definition of knight) 


d. Hint: W and Y are knights; the rest are knaves. 


39. The chauffeur killed Lord Hazelton. 
One explanation: 
1. Suppose the cook was in the kitchen at the time of 


41. 


the murder. 
2. v. The butler killed Lord Hazelton with strychnine. 
(by (c) and (1) and modus ponens) 
3... We have a contradiction: Lord Hazelton was 
killed by strychnine and a blow on the head. 
(by (2) and (a)) 
4... The supposition that the cook was in the kitchen 
is false. (by the contradiction rule) 
5... The cook was not in the kitchen at the time of the 
murder. (negation of supposition) 
6... Sara was not in the dining room when the murder 
was committed. (by (e) and (5) and modus ponens) 
7... Lady Hazelton was in the dining room when 
the murder was committed. (by (b) and (6) and 
elimination) 
8. ~. The chauffeur killed Lord Hazelton. (by (d) and 
(7) and modus ponens) 
dd) p-t by premise (d) 
cep by premise (c) 
op by modus tollens 
D ~p by (1) 
sepya by generalization 
3) ~pvq>r by premise (a) 
~PV4 by (2) 
seal” by modus ponens 
(4) >p by (1) 
r by (3) 
PAF by conjunction 
6) ~pAr->~s by premise (e) 
~pAr by (4) 
os by modus ponens 
(6) sv~q by premise (b) 
ad) by (5) 
z~a by elimination 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


43. (1) ~w by premise (d) 16. P 
u/w by premise (e) Q AND 
U by elimination 
(2) u—>~p by premise (c) 
u by (1) R O 
op by modus ponens 
3) = i oe si — @) 18. a. (PAQA~R)V(~PAQAR) 
vehi NOS by modus ponens b. P - 
O rA~s by (3) Q Ps 
Sua, by specialization 
6) ~t>s by premise (b) R > O 
5s by (4) 
so~t by modus tollens > 
SECTION 2.4 
AND 
1 R=1 3. S=1 
5 20. a (PAQAR)V(PA~QAR)V(~PA ~QA ~R) 
Input Output b. 
P Q R ? 
1 1 1 
— ~ 
of ma DA 
0 0 1 it 
7. I o 
nput Output 
pees S 


22. The input/output table is 


(0) 1 1 1 
0 1 0 0 Input Output 
0 0 1 1 P Q R S 
0 0 0 0 1 1 1 0 
1 1 0 1 


paš 


= 
Pa 
D 
Q 
= 
=| ojo 
= | © 
o| S OOS 


ak 
=| oO 


ED Se 
o;|o 

j=) 

i=) 
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2.5 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-14 


One circuit (among many) having this input/output table (PAQ)VQ 
is shown below. =Qv(PAQ) by the commutative law for V 
P $ =(QVP)A(QvVQ) by the distributive law 
Q . AND =(QVP)AQ by the idempotent law 
=(PVQ)AQ by the commutative law for /\ 
E n> . Alternatively, by the absorption laws, both statement 


forms are logically equivalent to Q. 
30. (PAQ)V(~PAQ)V(~PA~Q) 


so: = (P ^ Q) v (~P ^ Q) v (~P A ~Q)) 
by inserting parentheses 


AND (which is legal by the 
associative law) 
o = (P ^ Q) v (~P ^ (Q v ~Q)) 
by the distributive law 
=(PAQ)v(~P^t) by the negation law for v 
=(PAQ)V ~P by the identity law for ^ 
24. Let P and Q represent the positions of the switches in =~PV(PAQ) a 


the classroom, with 0 being “down” and 1 being “up.” (~PVP)A(~PVO) bythe distributive law 
Let R represent the condition of the light, with 0 being = (Pv ~P) \(~P\v Q) 


“off” and 1 being “on.” Initially, P = Q = 0 and R = 0. 
If either P or Q (but not both) is changed to 1, the light 


by the commutative law for v 


turns on. So when P = 1 and Q = 0, then R = 1, and SEAS) O tay ay 
when P = 0 and Q = 1, then R = 1. Thus when one =(~PvQ)At by the commutative law for ^ 
switch is up and the other is down the light is on, and =~PvQ by the identity law for ^ 
hence moving the switch that is down to the up position 
turns the light off. So when P = 1 and Q = 1, then The following is, therefore, a circuit with at most two 
R = 0. It follows that the input/output table has the fol- logic gates that has the same input/output table as the 
lowing appearance: circuit corresponding to the given expression. 
Input Output P “> o 
P Q R ) on 
Q ——_] 
34. b. (P| QO) (P} O) 
=~(P | Q) by part (a) 
= ~[~(P v Q)] by definition of | 
0 1 1 =PvQ by the double negative law 


d. Hint: Use the results of exercise 13 of Section 2.2 and 
part (a) and (c) of this exercise. 


One circuit (among many) having this input/output table 


is the following: S ECTI ON 2.5 


1. 1949 = 16+2+1 = 10011, 


4. 45810 = 256+ 128 +64 + 8 +2 = 111001010, 


7. 1110, =8+4+2 = 144 


10. 1100101, = 64+32+4+ 1 = 101o 


P, i g 
i. 1 0 1 41, 
. ; + 1 0 hL 
26. The Boolean expression for (a) is (P ^A Q) v Q, and for = 
1 0 0 0 0 


(b) it is (P v Q) A Q. We must show that if these expres- 
sions are regarded as statement forms, then they are 
logically equivalent. Now 
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A-15 


19. 


21. 
23. 


25. 


27. 


29. 


31. 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 

TE 1 32. 6219 = (32+ 16+8+4+42)i9 

EO p E = 111110, — 00111110 
t - ; ; . > |-18| 19 = (16 + 2)19 = 00010010 

z l 
i flip the bits 11101101 

1 10 %1 ade, 11101110 
ee Thus the 8-bit two’s complements of 62 and —18 are 
=- 110 hL 00111110 and 10110111. Adding the 8-bit two’s comple- 

i 1 bk ments in binary notation gives 
0 w 

101 7 0 1, oWO TETP E 1450 
— a “aaa 

1 1 O 1 Q 
M tore Toh lols 

23| 49 = 2310 = (16+4+24 1)io = 00010111, F 
flip the bits addi : Truncating the 1 in the 2°th position gives 00101100. 
> 11101000 ——>11101001. So the answer is Since the leading bit of this number is a 0, the answer is 

11101001. positive. Converting back to decimal form gives 
|-4]| 10 = 410 = 00000100, MPTP" 11111011 00101100 —> 101100, = (32 +8 + 4)jo = 4440. 
add 1 


=> 11111100. So the answer is 11111100. 


Because the leading bit is 1, this is the 8-bit two’s 


complement of a negative integer. 11010011 


flip the bits 00101100 244 4. 00101 101, = 


(32 +8+4+1),9 =|—45| 10. So the answer is —45. 


Because the leading bit is 1, this is the 8-bit two’s 


complement of a negative integer. 11110010 


flip the bits 00001101 444 00001110, = 


(8 +4 + 2) =|—14] 0. So the answer is —14. 


5719 = (32+16+841)49 = 111001, 00111001 


|-118| 19 = (64+32+ 16 +4 + 2)iọ = 01110110, 


Hip the Bits. 10001001 “44S 10001010. So the 8-bit 


two’s complements of 57 and —118 are 00111001 and 
10001010. Adding the 8-bit two’s complements in 
binary notation gives 


0o;o0;1)]1}1);0;];0) 1 


1;0;0;0;1],0;1/]0 


1; 1;0);0;,0);0; 141 


Since the leading bit of this number is a 1, the answer is 
negative. Converting back to decimal form gives 


11000011 MP the bits. 00111100 2444S 00111101, 


= (32+ 1648444 Dig = |61 hio. 


So the answer is —61. 
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33. 


37. 


So the answer is 44. 


|-6|10 = (4+ 210 
flip the bits 


= 110, > 00000110 > 
dd 1 
11111001 > 11111010 
|-73|19 = (644+8+ 1)io 


= (1001001 flip the bits 


10110110 TOUN 10110111 


Thus the 8-bit two’s complements of —6 and -73 are 
11111010 and 10110111. Adding the 8-bit two’s comple- 
ments in binary notation gives 


trj EELO 140 


PO E L OT EL 


E E E 1] 1 | O70 ) 0 fd 


Truncating the 1 in the 28th position gives 10110001. 
Since the leading bit of this number is a 1, the answer is 
negative. Converting back to decimal form gives 


10110001 HP the bits < 01001110 2444s 010011113 


= (6448444241) = 799 = |-79| 10 


So the answer is —79. 


a. The 8-bit two’s complement of — 128 is computed as 
follows: 


|—128] 19 = 12819 = (210 


= 10000000, FHP the bits | 01111111 2444S 10000000. 


So the 8-bit two’s complement of — 128 is 10000000. 
If the two’s complement procedure is applied to this 


result, the following is obtained 


10000000 LP the bits. 01111111 $44 10000000. 


So the 8-bit two’s complement of the 8-bit two’s 
complement of — 128 is 10000000, which is the 8-bit 
two’s complement of — 128. 


38. A2BCi¢ = 10°16 +2-167 + 11-16 +12 = 41,660 49 
41. 0001110000001010101111105 
44. 2E x6 


47. a. 6-8'+1-89+5-87+0-84+2-1 = 25,4109 


SECTION 3.1 


1. a. False b. True 


2. a. The statement is true. The integers correspond 

to certain of the points on a number line, and the 
real numbers correspond to all the points on the 
number line. 

b. The statement is false; 0 is neither positive nor 
negative. 

c. The statement is false. For instance, let r = —2. 
Then —r = —(—2) = 2, which is positive. 

d. The statement is false. For instance, the number 4 is 
a real number, but it is not an integer. 


3. a. Whenm = 25 andn = 10, the statement “m is a fac- 
tor of n?” is true because n? = 100 and 100 = 4:25. 
But the statement “m is a factor of n” is false because 
10 is not a product of 25 times any integer. Thus the 
hypothesis of R(m, n) is true and the conclusion is 
false, so the statement as a whole is false. 

c. When m = 5 andn = 10, both statements “m is a 
factor of n?” and “m is a factor of n” are true because 
n = 10 = 5-20 = m-20. Thus both the hypothesis 
and conclusion of R(m, n) are true, and so the state- 
ment as a whole is true. 


4. a. Q(—2, 1) is the statement “If —2 < 1 then 
(2)? < 1°” The hypothesis of this statement 
is —2 < 1, which is true. The conclusion is 
(—2)? < 1’, which is false because (—2)* = 4 and 
1° = 1 and4 £ 1. Thus Q(—2, 1) is a conditional 
statement with a true hypothesis and a false conclu- 
sion. So Q(—2, 1) is false. 

c. QG, 8) is the statement “If 3 < 8 then 3 < 8°.” 
The hypothesis of this statement is 3 < 8, which is 
true. The conclusion is 37 < 87, which is also true 
because 37 = 9 and 8” = 64 and 9 < 64. Thus 


3.1 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-16 


16. 


17. 
18. 


19. 
20. 


21. 


22. 


23. 


Q(3, 8) is a conditional statement with a true hy- 
pothesis and a true conclusion. So Q(3, 8) is true. 


a. The truth set is the set of all integers d such 
that 6/d is an integer, so the truth set is 
{=6; =3;) =.= 1,1,2;:3;,6}. 

c. The truth set is the set of all real numbers x with 
the property that 1 = x? = 4, so the truth set is 
{x © R|-2 =x <—lorl <x = 2}. In other 
words, the truth set is the set of all real numbers 
between —2 and —1 inclusive together with those 
between 1 and 2 inclusive. 


a as {—9, —8,.—7, —6,,—5, —4, —3; —2, —1,,0,.1,.2, 3, 


4, 5, 6, 7, 8, 9} 
baa, bab, bac, bba, bbb, bbc, bca, bcb 


Counterexample: Let x = 1/2. Then + = Ta = 2, and 
1/2 + 2. (This is one counterexample among many.) 


Counterexample: Let m = 1 and n = 1. Then 
mn=1-1=landmt+n=1+4+1=2. But l 42, 
and so m:n > m+n. (This is one counterexample 
among many.) 


- @©,f) 14. b, ©, 6), €) 


a. Partial answer: Every rectangle is a quadrilateral. 
b. Partial answer: At least one set has 16 subsets. 


a. V dinosaur x, x is extinct. 
c. V irrational number x, x is not an integer. 
e. V integer x, x? does not equal 2, 147, 581, 953. 


a. Jan exercise x such that x has an answer. 


a. ds E D such that E(s) and M(s). (Or: 3s E D such 
that E(s) \ M(s).) 

b. Vs E D, if C(s) then E(s). (Or: Vs E D, C(s) > E (s).) 

e. (ds E D such that C(s) A E(s)) A (As E D such that 
C(s) A ~E(s)) 


(b), (d), (©) 


Partial answer: The square root of a positive real num- 


ber is positive. 


a. The total degree of G is even, for any graph G. 
c. pis even, for some prime number p. 


a. Vx, if xis a Java program, then x has at least 5 lines. 


a. Vx if x is an equilateral triangle, then x is isosceles. 
V equilateral triangles x, x is isosceles. 
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24. 


25. 


26. 


27. 


28. 


30. 


31. 


32. 


33. 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


a. J a hatter x such that x is mad. 
4x such that x is a hatter and x is mad. 
a. V nonzero fraction x, the reciprocal of x is a fraction. 


Vx, if x is a nonzero fraction then the reciprocal of x 
is a fraction. 
c. V triangle x, the sum of the angles of x is 180°. Vx, if 
x is a triangle then the sum of the angles of x is 180°. 
e. V even integers x and y, the sum of x and y is even. 
Vx and y, if x and y are even integers then the sum of 
x and y is even. 


b. Vx(Int(x) = Ratl(x)) A dx(Ratl(x) A ~Int(x)) 


a. False. Figure b is a circle that is not gray. 
b. True. All the gray figures are circles. 


b. One answer among many. If a real number is nega- 
tive, then when its opposite is computed, the result 
is a positive real number. 

This statement is true because for each real 
number x, —(—|x|) = |x| (and any negative real 
number can be represented as — |x|, for some real 
number x). 

d. One answer among many: There is a real number 
that is not an integer. This statement is true. For 
instance, 5 is areal number that is not an integer. 


b. One answer among many. If an integer is prime, 
then it is not a perfect square. 

This statement is true because a prime number 
is an integer greater than | that is not a product of 
two smaller positive integers. So a prime number 
cannot be a perfect square because if it were, it 
would be a product of two smaller positive integers. 


Hint: Your answer should have the appearance shown in 
the following made-up example: 

Statement: “If a function is differentiable, then it is 
continuous.” 

Formal version: V function f, if f is differentiable, then f 
is continuous. 


Citation: Calculus by D. R. Mathematician, Best Pub- 
lishing Company, 2019, page 263. 


a. True: Any real number that is greater than 2 is 
greater than 1. 
c. False: (—3) > 4 but —3 +2. 


a. True. Whenever both a and b are positive, so is their 
product. 

b. False. Let a = —2 and b = —3. Then ab = 6, which 
is not less than zero. 
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SECTION 3.2 


1. 
3. 


(a) and (e) are negations. 


a. J a string s such that s does not have any characters. 
(Or: 3a string s such that s has no characters.) 

c. V movie m, m is less than or equal to 6 hours long. 
(Or: Y movie m, m is no more than 6 hours long.) 


In 4-6 there are other correct answers in addition to 
those shown. 


4. 


Tl. 


B. 


a. Some dogs are unfriendly. (Or: There is at least one 
unfriendly dog.) 

c. All suspicions were unsubstantiated. (Or: No suspi- 
cions were substantiated.) 


a. There is a valid argument that does not have a true 
conclusion. (Or: There is at least one valid argument 
that does not have a true conclusion.) 


a. Sets A and B have at least one point in common. 


a. This vertex is connected to at least one other vertex 
in the graph. (Or: There is at least one other vertex 
in the graph to which this vertex is connected.) (Or: 
This vertex is connected to some other vertex in the 


graph.) 


Ja real number x such that x > 3 and xX? < 9. 


The proposed negation is not correct. The given state- 
ment makes a claim about any two irrational numbers 
and means that no matter what two irrational numbers 
you might choose, the sum of those numbers will be 
irrational. For this to be false means that there is at least 
one pair of irrational numbers whose sum is rational. 
On the other hand, the negation proposed in the exercise 
(“The sum of any two irrational numbers is rational”) 
means that given any two irrational numbers, their sum 
is rational. This is a much stronger statement than the 
actual negation: The truth of this statement implies the 
truth of the negation (assuming that there are at least 
two irrational numbers), but the negation can be true 
without having this statement be true. 


Correct negation: There are at least two irrational num- 
bers whose sum is rational. 


Or: The sum of some two irrational numbers is rational. 


The proposed negation is not correct. There are two 
mistakes: The negation of a “for every” statement is not 
a “for every” statement; and the negation of an if-then 
statement is not an if-then statement. 


. . . be 
Correct negation: There exists an integer n such that n° 
is even and n is not even. 


15. 


16. 


18. 


20. 


22 


24. 


25. 


26 


28 


30 


a. True: All the odd numbers in D are positive. 
c. False: x = 16, x = 26, x = 32, and x = 36 are all 
counterexamples. 


Ja real number x such that x? = 1 and x + 0. In other 
words, J a real number x such that xX =landx <0. 


da real number x such that x(x + 1) > 0 and both x = 0 
and x = —1. 


Jintegers a, b, and c such that a — b is even and b — c 
is even and a — c is not even. 


There is an integer with the property that the square 
of the integer is odd but the integer itself is not odd. 
(Or: At least one integer has an odd square but is not 
itself odd.) 


a. Ifa person is a child in Tom’s family, then the per- 
son is female. 
If a person is a female in Tom’s family, then the 
person is a child. 
The second statement is the converse of the first. 

a. Converse: If n+ 1 is an even integer, then n is a 
prime number that is greater than 2. 
Counterexample: Let n = 15. Then n + 1 = 16, 
which is even but n is not a prime number that is 
greater than 2. 

Statement: V real number x, if xX = 1 then x > 0. 


Contrapositive: Y real number x, if x = 0 then x* < 1. 
Converse: V real number x, if x > 0 then Val 
Inverse: V real number x, if xX? < 1 then x <0. 


The statement and its contrapositive are false. As a 

counterexample, let x = —2. Then = (-2)° = 4, and 

so x” = 1. However x + 0. 

The converse and the inverse are also false. As a coun- 

terexample, let x = 1/2. Then = 1/4, and so x > 0 but 
2 

xe, 


Statement: Yx E R, if x(x + 1) > 0 then x > 0 or 
HS = 1, 


Contrapositive: Vx E R, if x = 0 and x = —1, then 
x(xt+1) 50. 


Converse: Yx E R, if x > Oor x < —1 then x(x + 1) > 0. 
Inverse: Yx E R, if x(x + 1) < 0 then x = 0 and x = —1. 


The statement, its contrapositive, its converse, and its 
inverse are all true. 


Statement: V integers a, b, and c, if a — b is even and 
b —c is even, then a — c is even. 


3:2 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


32 


34. 


36 


37. 


39 


41 


43 


45 


. 
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Contrapositive: V integers a, b, and c, if a — c is not even, 
then a — b is not even or b — c is not even. 


Converse: V integers a, b, and c, if a — c is even then 
a— bis even and b — c is even. 


Inverse: V integers a, b, and c, if a — b is not even or 
b—c is not even, then a — c is not even. 


The statement is true, but its converse and inverse 

are false. As a counterexample, let a = 3, b = 2, and 

c = 1. Then a—c = 2, which is even, but a— b = 1 and 
b—c = 1, so it is not the case that both a — b and b — c 
are even. 


Statement: If the square of an integer is odd, then the 
integer is odd. 


Contrapositive: If an integer is not odd, then the square 
of the integer is not odd. 


Converse: If an integer is odd, then the square of the 
integer is odd. 


Inverse: If the square of an integer is not odd, then the 
integer is not odd. 


The statement, its contrapositive, its converse, and its 
inverse are all true. 


a. Ifnis divisible by some prime number between 1 


and Vn inclusive, then n is not prime. 


One possible answer: Let P(x) be “2x # 1.” The 
statement “Vx E Z, 2x # 1” is true because there is 
no integer which, when doubled, equals 1. But the 
statements “Vx E Q, 2x # 1” and “Vx E R, 2x # 1” 
are both false because x = 1/2 satisfies the equation 
2x = 1 and 1/2 is in both R and Q. 


The claim is “Vx, if x = 1 and x is in the sequence 
0204, then x is to the left of all the 0’s in the sequence.” 


The negation is “gx such that x = 1 and x is in the se- 
quence 0204, and x is not to the left of all the 0’s in the 
sequence.” The negation is false because the sequence 
does not contain the character 1. So the claim is vacu- 
ously true (or true by default). 


If a person earns a grade of C— in this course, then the 
course counts toward graduation. 


If a person is not on time each day, then the person will 
not keep this job. 


If a number is prime, then it is greater than 1. 


To say that “Being divisible by 8 is a necessary condition 
for being divisible by 4” means that, “If a number is not 
divisible by 8 then that number is not divisible by 4. The 
negation is, “There is a number that is not divisible by 8 
and is divisible by 4.” 
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47. 


50. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


To say that “having a large income is a sufficient condi- 
tion for being happy” means that “If a person has a 
large income then that person is happy.” The negation 
is “There is a person who has a large income and is not 


happy.” 
No. Interpreted formally, the statement says, “If carriers 


do not offer the same lowest fare, then you may not 
select among them.” 


SECTION 3.3 


1. 


2. 


3. 


7. 


9. 


a. True: Tokyo is the capital of Japan. 
b. False: Athens is not the capital of Egypt. 


a. True: 27 >3 
b. False: 1741 


ages 
b. y=—1 


a. Letn = 16. Thenn > x because 16 > 15.83. 


The statement says that no matter what circle anyone 
might give you, you can find a square of the same color. 


Solution 1: The statement is true because the only circles 
in the Tarski world are a, b, and c, and given a or c, which 
are blue, square j is also blue, and given b, which is gray, 
squares g and h are also gray. 

Solution 2: The statement is true. The Tarski world has 
exactly three circles: a, b, and c. 


Choose square | Is y the same 


Given circle 


x= y= color as x? 
a j yes 7 
b gorh yes V 
c j yes V 


Solution 1: The statement is true because the Tarski 
world has exactly four squares: e, g, h, and j and triangle 
d is above all of them. 


Solution 2: The statement is true. The Tarski world has 
exactly four squares: e, g, h, and j. 


Choose Choose square 
triangle x = d y= Is x above y? 
e yes ¥ 
f yes ¥ 
h yes 7 
j yes ¥ 


a. There are five elements in D. For each, an element 
in E must be found so that the sum of the two equals 
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10. a. 


11. a. 


12. a. 


0. So: for x = —2, take y = 2; for x = —1, take 
y = 1; for x = 0, take y = 0;for x = 1, take y = —1; 
and for x = 2, take y = —2. 

Alternatively, note that for each integer x in 
D, the integer —x is also in D, including 0 (because 
—0 = 0), and for every integer x, x +(—x) = 0. 


True. Every student chose at least one dessert: Uta 
chose pie, Tim chose both pie and cake, and Yuen 
chose pie. 

This statement says that some particular dessert was 
chosen by every student. This is true: Every student 
chose pie. 


There is a student who has seen Casablanca. 

Every student has seen at least one movie. 

There is a movie that has been seen by every stu- 
dent. (There are many other acceptable ways to state 
these answers.) 


Negation: 3x in D such that Vy in E, x+y # 1. 
The negation is true. When x = —2, the only num- 
ber y with the property that x+y = 1 is y = 3, and 
3 is not in E. 


Negation: Vx in D, 3y in E such that x + y # —y. 


The negation is true because the original statement 
is false. To see that the original statement is false, 
take any x in D and choose y to be any number in 
E with y # — 2 Then 2y # —x, and adding x and 
subtracting y from both sides gives x+ y # —y. 


In 13-19 there are other correct answers in addition to 
those shown. 


13. a. 


15. a. 


Statement: For every color, there is an animal of 
that color. 

There are animals of every color. 

Negation: 4a color C such that V animal A, A is not 
colored C. 

For some color, there is no animal of that color. 


Statement: There is a book that every person has 
read. 

Negation: There is no book that every person 
has read. 

(Or: V book b, 3 a person p such that p has not 
read b.) 


Statement: For every odd integer n, there is an inte- 
ger k such that n = 2k + 1. 


Given any odd integer, there is another integer 

for which the given integer equals twice the other 
integer plus 1. Given any odd integer n, we can find 
another integer k so that n = 2k + 1. 


18. a. 


20. a. 


21. a. 


An odd integer is equal to twice some other integer 
plus 1. 


Every odd integer has the form 2k + 1 for some 
integer k. 


Negation: Jan odd integer n such that V integer k, 
n#2k+1. 

There is an odd integer that is not equal to 2k + 1 for 
any integer k. 

Some odd integer does not have the form 2k + 1 for 
any integer k. 


Statement: For every real number x, there is a real 
number y such that x + y = 0. 

Given any real number x, there exists a real number 
y such that x+ y = 0. 

Given any real number, we can find another real 
number (possibly the same) such that the sum of the 
given number plus the other number equals 0. 
Every real number can be added to some other real 
number (possibly itself) to obtain 0. 


Negation: Ja real number x such that V real number 
y,xty #0. 


There is a real number x for which there is no real 
number y with x+y = 0. 


There is a real number x with the property that 
x+y # 0 for any real number y. 


Some real number has the property that its sum with 
any other real number is nonzero. 


Statement (1) says that no matter what square anyone 
might give you, you can find a triangle of a different 
color. This is true because the only squares are e, g, h, 
and j, and given squares g and h, which are gray, you 
could take triangle d, which is black; given square 

e, which is black, you could take either triangle f or 

i, which are gray; and given square j, which is blue, 
you could take either triangle f or h, which are gray, 
or triangle d, which is black. In each case the chosen 
triangle has a different color from the given square. 


(1) The statement “Y real number x, 3 a real number 
y such that 2x + y = 7” is true. Given any real 
number x, take y to be 7 — 2x. 


(2) The statement “J a real number x such that Y 
real number y, 2x + y = 7” is false. If it were 
true, the single number x would equal = for 
every real number y, and that is impossible. 

Both statements (1) “V real number x, 3 a real 

number y such that x + y = y+ x” and (2) “Ja real 

number x such that V real number y, x+y = y+ x” 

are true. 


3.3. SOLUTIONS AND HINTS TO SELECTED EXERCISES 
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22. a. Given any real number, you can find a real number 


24. 


25. 


27 


29. 


31. 
32. 
33. 


34. 


so that the sum of the two is zero. In other words, 
every real number has an additive inverse. This 
statement is true. 

There is a real number with the following prop- 
erty: No matter what real number is added to it, 
the sum of the two will be zero. In other words, 
there is one particular real number whose sum 
with any real number is zero. This statement is 
false; no one number will work for all numbers. For 
instance, if x + 0 = 0, then x = 0, but in that case 
x+1=1#0. 


a. ~(Vx E D(Vy E E(P(x, y)))) 
= Jx E D(~(Vy E E(P(, y)))) 


= dx E DGy E E~ PQ, y))) 


This statement says that all of the circles are above all of 
the squares. This statement is true because the circles are 
a, b, and c, and the squares are e, g, h, and j, and all of a, 
b, and c lie above all of e, g, h, and j. 


Negation: There is a circle x and a square y such that x 
is not above y. In other words, at least one of the circles 
is not above at least one of the squares. 


The statement says that there are a circle and a square 
with the property that the circle is above the square and 
has a different color from the square. This statement is 
true. For example, circle a lies above square e and is dif- 
ferently colored from e. (Several other examples could 
also be given.) 


a. Version with interchanged quantifiers: 3x E R such 
that Vy E R, x < y. 

The given statement says that for any real number x, 
there is a real number y that is greater than x. This 

is true: For any real number x, let y = x + 1. Then 

x < y. The version with interchanged quantifiers 
says that there is a real number that is less than every 
other real number (including the negative ones). This 
is false. 


b. 


Y person x, 3 a person y such that x is older than y. 


Ja person x such that V person y, x is older than y. 


a. Formal version: V person x, 3 a person y such that x 
loves y. 

b. Negation: 3a person x such that V person y, x does 
not love y. In other words, there is someone who 
does not love anyone. 


a. Formal version: 3 a person x such that V person y, x 

loves y. 

b. Negation: V person x, 3 a person y such that x does 
not love y. In other words, everyone has someone 


whom they do not love. 
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37. a. Statement: V even integer n, J an integer k such that 
n = 2k. 
b. Negation: 3an even integer n such that V integer k, 
n # 2k. 


There is some even integer that is not equal to twice 
any other integer. 


39. a. Statement: Ja program P such that V question Q 
posed to P, P gives the correct answer to Q. 
b. Negation: Y program P, there is a question Q that 
can be posed to P such that P does not give the cor- 
rect answer to Q. 


40. a. V minutes m, Ja sucker s such that s was born in 
minute m. 


41. a. This statement says that given any positive integer, 
there is a positive integer such that the first integer is 
1 more than the second integer. This is false. Given 
the positive integer x = 1, the only integer with the 
property that x = y+1 is y = 0, and 0 is not a posi- 
tive integer. 

b. This statement says that given any integer, there 
is an integer such that the first integer is 1 more 
than the second integer. This is true. Given any 
integer x, take y = x— 1. Then y is an integer, and 
yt1l=Q@-)D+1=x. 

e. This statement says that given any real number, 
there is a real number such that the product of the 
two is equal to 1. This is false because 0-y = 0 # 1 
for every number y. So when x = 0, there is no real 
number y with the property that xy = 1. 

f. This statement is true because the real number 0 has 
the property that Vy E R,O+y=y. 


42. Je > 0 such that V integer N, 3 an integer n such that 
n> N and either L—¢ =a, or a, = L+ e. In other 
words, there is a positive number e such that for ev- 
ery integer N, it is possible to find an integer n that is 
greater than N and has the property that a, does not 
lie between L—eandL+e. 


44. a. This statement is true. The unique real number with 
the given property is 1. Note that 
1-y = y for all real numbers y, 


and if x is any real number such that for instance, 
x:2 = 2, then dividing both sides by 2 gives 
x= 2/2=1. 


46. a. True. Both triangles a and c lie above all the 
squares. 
b. Formal version: 


x(Triangle(x) A (Vy(Square(y) — Above(x, y)))) 
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c. Formal negation: 
Vx(~(Triangle(x) A (Vy(Square(y) — Above(, y))))) 
= Vx(~Triangle(x) V ~(Vy (Square(y) — Above(x, y)))) 


= Vx(~Triangle(x) V Gy (Square(y) A ~Above(x, y)))) 


48. a. False. There is no square to the right of circle k. 
b. Formal version: 


Vx(Circle(x) — (4y(Square(y)  RightOf(y, x)))) 


c. Formal negation: 


4dx(Circle(x) ^A ~(4y(Square(y) A RightOf(y, x)))) 


= 4x(Circle(x) A Vy(~Square(y) V ~RightOf(y, x) 


49. a. False. For example circle d is gray and there is no 
square that is colored gray. 
b. Formal version: 


Vx(Circle(x) — Jy(Square(y) A SameColor(y, x))) 


c. Formal negation: 


dx(~(Circle(x) — 4y(Square(y) A SameColor(y, x)))) 


= 4x(Circle(x) ^A ~(4y(Square(y) A SameColor(y, x)))) 


= 4x(Circle(x) ^A Vy(~(Square(y) A SameColor(y, x)))) 


= dx(Circle(x) A Vy(~Square(y)) V ~(SameColor(y, x))) 


51. a. True. Square e has the property that every triangle 
above it has the same color e because e is colored 
blue and the only triangles above e, namely a and c, 
are also colored blue. 
b. Formal version: 


Ax(Square(x) ^ (Vy(Triangle (y) A Above(y, X) 
— SameColor(y, x))) 

c. Formal negation: 
Vx(~ (Square(x) A (Vy((Triangle(y) A Above(y, xX) 
— SameColor(y, x))))) 


= Vx(~Square(x) V (~(Vy(Triangle(y) A Above(y, x) 
— SameColor(y, x))))) 


= Vx(~Square(x) V 4y(~ (Triangle(y) A Above(y, x) 


— SameColor(y, x)))) 


= Vx(~Square(x) V 4y((Triangle(y) ^ Above(y, x)) 
/\(~SameColor(y, x)))) 
53. a. True. Circle b and squares h and j are all colored 


black. 
b. Formal version: 


x(Circle(x) A Sy(Square(y)  SameColor(x, y))) 


55. 


59. 


c. Formal negation: 


Vx(~ Circle(x) ^A dy(Square(y) A SameColor(y, x))) 


= Vx(~ Circle(x) V ~(Gy(Square(y) A SameColor(y, x)))) 
= Vx(~ Circle(x) V Vy(~Square(y) V ~SameColor(y, x))) 


a. No matter what the domain D or the predicates 
P(x) and Q(x) are, the given statements have the 
same truth value. If the statement “Vx in D, 

(P(x) A Q(x)” is true, then P(x) A Q(x) is true for 
every x in D, which implies that both P(x) and Q(x) 
are true for every x in D. But then P(x) is true for 
every x in D, and also Q(x) is true for every x in 

D. So the statement “(Vx in D, P(x)) A (Vx, in D, 
Q(x))” is true. Conversely, if the statement “(Vx in 
D, P(x)) A (Vx in D, QQ)” is true, then P(x) is true 
for every x in D, and also Q(x) is true for every x in 
D. This implies that both P(x) and Q(x) are true for 
every x in D, and so P(x) ^A Q(x) is true for every x 
in D. Hence the statement “Vx in D, (P(x) A Q(x)” 
is true. 


a. Yes b. X = m, X = u C. X = bọ, X = u 


SECTION 3.4 


1. 


21. 


b. (E+E =f? +2EE+ 

c (Bu + 5v? = (Bw + 23u)(5v) + (Sv) 

d. (g(r) +8)? = (80)? + 280)gl) + (86)? 
0 is even. 


= (2:5 43-4) (= 22) 


2,4 
3t5 G5) 


l; a: 
— is not an irrational number. 


Invalid; converse error 


Valid by universal modus ponens (or universal instantia- 
tion) 


Invalid; inverse error 
Valid by universal modus tollens 
Invalid; converse error 


Vx, if x is a good car, then x is not cheap. 

a. Valid, universal modus ponens (or universal 
instantiation) 

b. Invalid, converse error 


Valid. (A valid argument can have false premises and a 
true conclusion!) 


3.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


23 


25 
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mortals 


mice 


people 


The major premise says the set of people is included in 
the set of mice. The minor premise says the set of mice 
is included in the set of mortals. Assuming both of these 
premises are true, it must follow that the set of people is 
included in the set of mortals. Since it is impossible for 
the conclusion to be false if the premises are true, the 
argument is valid. 


Valid. The major and minor premises can be diagrammed 
as follows: 


beings who 
occasionally 
make mistakes 


gods 


According to the diagram, the set of teachers and the 
set of gods can have no common elements. Hence, if the 
premises are true, then the conclusion must also be true, 
and so the argument is valid. 


Invalid. Let C represent the set of all college cafeteria 
food, G the set of all good food, and W the set of all 
wasted food. Then any one of the following diagrams 
could represent the given premises. 


Q 
Q 


© 
co 
© 
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28 


31. 


33. 


Only in drawing (1) is the conclusion true. Hence it is 
possible for the premises to be true while the conclusion 
is false, and so the argument is invalid. 


(3) Contrapositive form: If an object is gray, then it is a 
circle. 


(2) If an object is a circle, then it is to the right of all the 
blue objects. 


(1) If an object is to the right of all the blue objects, then 
it is above all the triangles. 


.. If an object is gray, then it is above all the triangles. 


4. If an animal is in the yard, then it is mine. 

1. If an animal belongs to me, then I trust it. 

5. If I trust an animal, then I admit it into my study. 
3 


. If I admit an animal into my study, then it will beg 
when told to do so. 


6. If an animal begs when told to do so, then that animal 
is a dog. 

2. If an animal is a dog, then that animal gnaws bones. 

.. If an animal is in the yard, then that animal gnaws 
bones; that is, all the animals in the yard gnaw bones. 


If a bird is in this aviary, then it belongs to me. 


2. 

4. Ifa bird belongs to me, then it is at least 2.75 meters high. 
1. If a bird is at least 2.75 meters high, then it is an ostrich. 
3. 


If a bird lives on mince pies, then it is not an ostrich. 


Contrapositive: If a bird is an ostrich, then it does not 
live on mince pies. 


.. If a bird is in this aviary, then it does not live on 
mince pies; that is, no bird in this aviary lives on 
mince pies. 


SECTION 4.1 


1. 


a. Yes: —17 = 2(-9)+1 

b. No. 0 is even because 0 = 0-2. 

c. Yes: 2k—1 = 2(k—1)+1 and k-—1 is an integer 
because it is a difference of integers. 


a. Yes: 6m + 8n = 2(3m + 4n) and (3m + 4n) is an 
integer because 3, 4, m, and n are integers, and 
products and sums of integers are integers. 

b. Yes: 10mn+7 = 2(5mn + 3)+ 1 and 5mn+3 is an 
integer because 3, 5, m, and n are integers, and prod- 
ucts and sums of integers are integers. 

c. Not necessarily. For instance, if m = 3 and n = 2, 


then m-n? =9-4= 5, which is prime. (However, 
2: 
m — nr is composite for many values of m and n 


because of the identity m-n = (m—n)\(m +n), 
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10. 


12. 


14. 


17. 


19. 


21. 


23. 


25. 


27. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


For example, let m = n = 2. Then m and n are integers 


l1 1 1 1 
such that m > 1 and n > 1 and— +- =-+-=1, 
m n 2 2 


which is an integer. 


For example, let n = 7. Then n is an integer such that 
n > 5and2”— 1 = 127, which is prime. 


For example, 25, 9, and 16 are all perfect squares, 
because 25 = 5°, 9 = 3°, and 16 = 4°, and 25 = 9 +16. 
Thus 25 is a perfect square that can be written as a sum 
of two other perfect squares. 


a. Negation for the statement: There exist real num- 
bers a and b such that a < banda’ £ b°. 

b. Counterexample for the statement: Let a = —2 and 
b = —1. Then a < b because —2 < —1, but ath 
because (—2)? = 4 and (—1)? = 1 and 4 £ 1. [So the 
hypothesis of the statement is true and its conclusion 
is false.] 


Counterexample: Let m = 2 andn = 1. Then 
2m+n=2:24+1=5, 
which is odd. But m is not odd, and so it is false that 


both m and n are odd. [This is one counterexample among 
many. ] 


This property is true for some integers and false for 
other integers. For instance, if a = 0 and b = 1, the 
property is true because (0 + 1? = 0°+ 1, butifa = 1 
and b = 1, the property is false because (1 + 1? =4 
and 17+ 1? = 2 and 4 # 2. 


Hint: This property is true for some integers and false 
for other integers. To justify this answer you need to 
find examples of both. 


2=VP4HP4=27,6=V74P4V, 

8 = 2+2, 10 = 37+ 17, 12 = 27427427, 
14 = 37+2°+17, 16 = 4, 

18 = 3+3? = 4+ 1? + 1°, 20 = 4+2, 
22 = 3+3 +2, 24 = 8+2 +2? 


a. Ifan integer is greater than 1, then its reciprocal is 
between 0 and 1. 


b. Start of proof: Suppose m is any integer such that 
m > 1. Conclusion to be shown: 0 < 1/m < 1. 


a. Ifthe product of two integers is 1, then either both 
are 1 or both are —1. 

b. Start of proof: Suppose m and n are any integers 
with mn = 1. Conclusion to be shown: m = n= 1 
orm=n=-—1l. 


Hint: (b) 2k +1 


28. a. 


30. 


V integers m and n, if m and n are odd then m + n is 
odd. 


Y odd integers m and n, m + n is odd. 


If m and n are any odd integers, then m + n is odd. 
b. (a) definition of odd, (b) substitution, (c) any sum of 
integers is an integer, (d) definition of even 


a. V integers m and n, if m is even and n is odd, then 
m+n is odd. 
V even integers m and odd integers n, m + n is odd. 


If m is any even integer and n is any odd integer, 
then m +n is odd. 
b. (a) any odd integer 


(b) integer r 
(c) 2r+(2s+ 1) 
(d) m+n is odd 


SECTION 4.2 


1. 


Proof: Suppose n is any [particular but arbitrarily chosen] 
odd integer. 


[We must show that 3n + 5 is even. By definition of even, this 
means we must show that 3n + 5 = 2- (some integer).] 


By definition of odd, n = 2r + 1 for some integer r. 
Then 


3n+5 = 3(2r+1)+5 by substitution 


=6r+3+5 
=6r+8 
= (3r +4) by algebra. 


[Idea for the rest of the proof: We want to show that 

3n +5 = 2- (some integer). At this point we know that 
3n +5 = (3r + 4). So is 3r + 4 an integer? Yes, because 
products and sums of integers are integers.] 

Let k = 3r +4. 


Then 3n +5 = 2(3r + 4) = 2k, and k is an integer be- 
cause products and sums of integers are integers. Hence 
3n + 5 is even by definition of even. 


Hint: To show that an integer is odd, you need to show 
that it equals 2- (some integer) + 1. 


Two versions of a correct proof are given below to il- 
lustrate some of the variety that is possible. 


Proof 1: Suppose a is any even integer and b is any odd 
integer. [We must show that a — b is odd.] By definition 
of even and odd, a = 2r and b = 2s + 1 for some inte- 
gers r and s. By substitution and algebra, 


a—b=2r—(2s4+ 1) = 2r—2s—-1 =2(r—s—-1)+1. 


4.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


7. 


10. 


12. 


15. 


16. 
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Lett = r—s—1. Then ris an integer because differenc- 
es of integers are integers. Thus a— b = 2t + 1, where 

t is an integer, and so, by definition of odd, a — b is odd 
[as was to be shown]. 


Proof 2: Suppose a is any even integer and b is any 
odd integer. By definition of even and odd, a = 2r and 
b = 2s + 1 for some integers r and s. Then 


a—b=2r—-(Qs+1)=2(r—-s—1)+1. 


Now r—s-— l is an integer because differences of inte- 
gers are integers, and so a — b equals twice some integer 
plus 1. Thus a — b is odd. 


Proof: Suppose k is any odd integer and m is any even 
integer. [We must show that k + n? is odd.] By defini- 
tion of odd and even, k = 2a + 1 and m = 2b for some 
integers a and b. Then 
+m = (a+ 1)? + (2b) 
= 4a’ +4at+1+4b° 
=4@+a+b+1 
= 2(2@° + 2a +2b?) +1 


by substitution 


by algebra. 


But 2a? + 2a + 2b° is an integer because it is a sum of 
products of integers. Thus K + m? is twice an integer plus 
1, and so K? + m? is odd [as was to be shown]. 


Hint: It is convenient to represent two consecutive inte- 
gers as n and n + 1 or as n— 1 and n for some integer n. 


Proof: Suppose n is any even integer. Then n = 2k for 
some integer k. Hence 
GY = CD*= (CDs 1*= 1 


[by the laws of exponents from algebra]. This is what was 
to be shown. 


To prove the given statement is false, we prove that its 
negation is true. 


The negation of the statement is “For every integer 

m = 3, m° — 1 is not prime.” 

Proof of the negation: Suppose m is any integer with 

m = 3. By basic algebra, m—-l= (m— 1)(m + 1). Be- 
cause m = 3, both m — 1 and m + 1 are positive integers 
greater than 1, and each is smaller than m —1.Som-—1 
is a product of two smaller positive integers, each greater 
than 1, and hence m? — 1 is not prime. 


The incorrect proof just shows the theorem to be true in 
the one case where k = 2. A real proof must show that it 
is true for every integer k > 0. 


The mistake in the “proof” is that the same symbol, 
k, is used to represent two different quantities. By 
setting m = 2k and n = 2k + 1, the proof implies that 
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17. 


20. 


21. 


22. 


24. 


28. 
32. 


34. 
37. 
40. 


n = m+ 1, and thus it deduces the conclusion only 

for this one situation. When m = 4 and n = 17, for 
instance, the computations in the proof indicate that 
n—m = 1, but actually n — m = 13. In other words, the 
proof does not deduce the conclusion for an arbitrarily 
chosen even integer m and odd integer n, and hence it is 
invalid. 


This incorrect proof assumes what is to be proved. The 
word since in the third sentence is completely unjusti- 
fied. The second sentence tells only what happens if 

K +2k+1 is composite. But at that point in the proof, 
it has not been established that K? + 2k + 1 is composite. 
In fact, that is exactly what is to be proved. 


True. Proof: Suppose m and n are any odd integers. 
[We must show that mn is odd.] By definition of odd, 
n = 2r+1andm = 2s + 1 for some integers r and s. 
Then 


mn = (2r+1)(2s+ 1) 
= 4rs+2r+2s+1 
=2(2rstr+s)+1 


by substitution 


by algebra. 


Now 2rs +r + s is an integer because products and sums 
of integers are integers and 2, r, and s are all integers. 
Hence mn = 2-(some integer) + 1, and so, by definition 
of odd, mn is odd. 


Hint: You will need to express an integer of the form 
—(2k + 1) as 2(some integer) + 1. 


False. Counterexample: Let a = 1 and b = 0. Then 
4a+5b+3 = 4-1+5-:0+3 = 7, which is odd. [This is 
one counterexample among many. Can you find a way to 
characterize all counterexamples?] 


False. Counterexample: Let m = 1 and n = 3. Then 
m+n = 4is even, but neither summand m nor sum- 
mand n is even. 


Hint: The statement is true. 


Proof: Suppose n is any integer. Then 
A(n? +n+1)—3n? = 4n? + 4n+4—-3n7 
=n+4n+4=(n+2) 


(by algebra). Now (n + 2} isa perfect square because 
n+ 2 is an integer (being a sum of n and 2). Hence 
A(n? +n+1)—3n*isa perfect square, as was to be 
shown. 


Hint: This is true. 
Hint: The statement is true. 


Hint: The answer is no. 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


SECTION 4.3 


1. 


T1. 


12. 


13. 


-35 _ -35 
6 6 

4,2 _4:9+2-5 _ 46 
5195 45 ~ 45 


Let x = 0.3737373737.... 
Then 100x = 37.37373737...., and so 
100x — x = 37.37373737 ... — 0.3737373737... 


Thus 99x = 37, and hence x = a. 


e Let x = 320.5492492492.... Then 


10000x = 3205492.492492492...., and 
10x = 3205.492492492..., and so 
10000x — 10x = 3205492 — 3205. 


Thus 9990x = 3202287, and hence x = 3202287 


9990 ° 


b. V real numbers x and y, if x # 0 and y # 0 then 
xy # 0. 


Given that a and b are integers, both b — a and ab? 

are integers (since differences and products of inte- 
gers are integers). Also, by the zero product prop- 

erty, ab” + 0 because neither a nor b is zero. Hence 

(b —a)/ab” is a quotient of two integers with a nonzero 
denominator, and so it 

is rational. 


Proof: Suppose n is any [particular but arbitrarily chosen] 
integer. Then n = n: 1, and son = n/1 by dividing both 
sides by 1. Now n and 1 are both integers, and 1 # 0. 
Hence n can be written as a quotient of integers with a 
nonzero denominator, and so n is rational. 


(a) any [particular but arbitrarily chosen] rational 
number 

(b) integers a and b 

(©) (a/b? 

(d) b° 

(e) zero product property 

(f) r° is rational 


a. V real number r, if r is rational then —r is rational. 
Or: Yr, if r is a rational number then —r is rational. 
Or: V rational number r, —r is rational. 


b. The statement is true. Proof: Suppose r is a [particu- 
lar but arbitrarily chosen] rational number. [We must 
show that —r is rational.] By definition of rational, 

r = a/b for some integers a and b with b # 0. Then 


by substitution 


by algebra. 


Now since a is an integer, so is —a (being the prod- 
uct of —1 and a). Hence —r is a quotient of integers 
with a nonzero denominator, and so —r is rational 
[as was to be shown]. 


15. Proof: Suppose r and s are rational numbers. By defini- 


tion of rational, r = a/b and s = c/d for some integers 
a, b, c, and d with b # 0 and d # 0. Then 


by substitution 


ac 
a bd by the rules of algebra for multiplying fractions. 
Now ac and bd are both integers (being products of 
integers) and bd # 0 (by the zero product property). 
Hence rs is a quotient of integers with a nonzero denom- 
inator, and so, by definition of rational, rs is rational. 


16. Hint: Counterexample: Let r be any rational number and 
s = 0. Then r and s are both rational, but the quotient of 
r divided by s is not a real number and therefore is not a 
rational number. 


Revised statement to be proved: For all rational num- 
bers r and s, if s # 0 then r/s is rational. 


17. Hint: The conclusion to be shown is that a certain quan- 
tity (the difference of two rational numbers) is rational. 
To show this, you need to show that the quantity can 
be expressed as a ratio of two integers with a nonzero 
denominator. 


.  alb+c/d  (ad+be)/(bd) — adt+be 
18. Hint: 7 = 7 = od 


19. Hint: Ifa < b thena+a < a+b (by T19 of Appendix 
A), or equivalently, 2a < a + b. Thus a < Î = b (by T20 


of Appendix A). 


21. True. Proof: Suppose m is any even integer and n is 
any odd integer. [We must show that m + 3n is odd.] By 
properties 1 and 3 of Example 4.3.3, m is even (because 
m = m-m) and 3n is odd (because both 3 and n are odd). 
It follows from property 5 [and the commutative law for 
addition] that m? + 3nis odd [as was to be shown]. 


24. Proof: Suppose r and s are any rational numbers. By 
Theorem 4.3.1, both 2 and 3 are rational, and so, by 
exercise 15, both 27 and 3s are rational. Hence, by 
Theorem 4.3.2, 2r + 3s is rational. 


27. Let 
1 1 1 
grt grt gn grt grt =j 
x= = = ‘ = 
i 1 1 I grt gi 
2 2 2 


4.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-26 


31. 


33. 


35. 
37. 


Now 2”*! — 1 and 2” are both integers (since n is a 


nonnegative integer) and 2” # 0 by the zero product 
property. Therefore, x is rational. 


Proof: Suppose c is a real number such that 
Rc + rc +rictro = 0, 


where ro, F1, 72, and r3 are rational numbers. By defini- 
tion of rational, rọ = do/bo, rı = a/b), r2 = a/b», and 
r3 = az/b, for some integers, do, a), 42, 43, and nonzero 
integers bo, b4, b2, and b3. By substitution, 


eA 
nc tre trie tro 


bob b2a3 3 bob b3a2 2 bobrb3a, fi bi bxb3a9 


bobibabs bobibybs°bobybabsbyb bab 
= (), 


Multiplying both sides by bob, byb3 gives 
bob\b2a3 * e F bobıb3az' e Bg bobrb3a, F bıbab3a9 =0. 


Let n; = bob, b2a3, n = bob b3a, ny = bobrb3a), and 

ng = bybyb3d9. Then no, ny, m, and n; are all integers (be- 

ing products of integers). Hence c satisfies the equation 
nc + me +nıc +n = 0, 


where all of no, 1), 22, and n; are integers. This is what 
was to be shown. 


a. Hint: Note that (x — r(x — s) = x? — (r + s)x + rs. If 
both r and s are odd, then r + s is even and rs is odd. 
So the coefficient of x? is 1 (odd), the coefficient of x 
is even, and the constant coefficient, rs, is odd. 


This “proof” assumes what is to be proved. 


By setting both r and s equal to a/b, this incorrect 
proof violates the requirement that r and s be arbitrarily 
chosen rational numbers. If both r and s equal a/b, then 
r=s. 


SECTION 4.4 


1. 


7. 


Yes, 52 = 13-4 

Yes, 56 = 7-8 

Yes, (3k + 1)(3k + 2)(3k + 3) = 3[(3k + 1)(3k + 2) 
(k+ 1)], and (3k + 1)(3k + 2)(k + 1) is an integer be- 


cause k is an integer and sums and products of integers 
are integers. 


No, 29/3 = 9.67, which is not an integer. 
Yes, 66 = (—3)(—22). 
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10. 


12. 


14. 


15. 


16. 


18. 


19. 


20. 


22. 


Yes, 6a(a + b) = 3a[2(a + b)], and 2(a + b) is an integer 
because a and b are integers and sums and products of 
integers are integers. 


No, 34/7 = 4.86, which is not an integer. 


Yes, a —1=(4k +1" -1= (16K + 8k+1)-1= 

16k + 8k = 8(2k7 +k), and 2k +k is an integer because 
k is an integer and sums and products of integers are 
integers. 


(a) alb 
(b) b=a-r 
(c) =r 
(d) a |(—b) 


Proof: Suppose a, b, and c are any integers such that a |b 
and alc. [We must show that a|(b + c).] By definition of 


divides, b = ar and c = as for some integers r and s. 
Then 


b+c=ar+as=a(rts) by algebra. 


Let t = r+ s. Then ris an integer (being a sum of inte- 
gers), and thus b + c = at where t is an integer. By defi- 
nition of divides, then, a|(b + c) [as was to be shown]. 


Hint: The conclusion to be shown is that a certain quan- 
tity is divisible by a. To show this, you need to show that 
the quantity equals a times some integer. 


a. V integers n if n is a multiple of 3 then —n is a mul- 
tiple of 3. 

b. The statement is true. Proof: Suppose n is any inte- 

ger that is a multiple of 3. [We must show that —n is 
a multiple of 3.] By definition of multiple, n = 3k for 
some integer k. Then 
—n = —(k) 
= 3(-h) 


by substitution 


by algebra. 


Now —k is an integer because k is. Hence, by defini- 
tion of multiple, —7 is a multiple of 3 /as was to be 
shown]. 


Counterexample: Let a = 2 and b = 1. Then 
at+tb=2+1=3, and so 3|(a+b) because 3 = 3-1. 
On the other hand, a— b = 2—1 = 1, and 3/1 because 
1/3 is not an integer. Thus 3Ka — b). [So the hypothesis 
of the statement is true and its conclusion is false.] 


Hint: The consecutive integers can be conveniently 
represented as n — 1, n, and n + 1 orasn,n+1,n+2, 
where n is an integer. 


Hint: The given statement can be rewritten formally as 
“V integers n, if n is divisible by 6 then n is divisible by 
2.” This statement is true. 
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24. 


25. 


26. 
27. 
32. 


36. 


37. 
38. 


40. 


41. 


42. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


The statement is true. Proof: Suppose a, b, and c are 
any integers such that a|b and a|c. [We must show that 
a|(2b — 3c).] By definition of divisibility, we know that 
b = am and c = an for some integers m and n. It follows 
that 2b — 3c = 2(am) — 3(an) (by substitution) = 

a(2m — 3n) (by basic algebra). Let t = 2m — 3n. Then 

t is an integer because it is a difference of products of 
integers. Hence 2b — 3c = at, where t is an integer, and 
so a|(2b — 3c) by definition of divisibility fas was to be 
shown]. 


The statement is false. Counterexample: Let 
a=2,b=8,andc = 8. Then a is a factor of c because 
8 = 2-4 and b is a factor of c because 8 = 1-8, but 

ab = 16 and 16 is not a factor of 8 because 8 # 16-k for 
any integer k since 8/16 = 1/2. 


Hint: The statement is true. 
Hint: The statement is false. 


No. Each of these numbers is divisible by 3, and so their 
sum is also divisible by 3. But 100 is not divisible by 3. 
Thus the sum cannot equal $100. 


a. The sum of the digits is 54, which is divisible by 
9. Therefore, 637,425,403,705,125 is divisible by 
9 and hence also divisible by 3 (by transitivity of 
divisibility). Because the rightmost digit is 5, then 
637,425,403,705,125 is divisible by 5. And because 
the two rightmost digits are 25, which is not divisible 
by 4, then 637,425,403,705,125 is not divisible by 4. 


a. 1,176 = 2-3-7 


a. 8,424 = 2°-3*-13 

c. Hint: The answer is no. Note that each factor of 10 
is comprised of a factor of 2 and a factor of 5. 

d. Hint: The answer is 26. Note that in order for 
8,424-m to be a perfect square, each prime factor 
must be raised to an even power. 


a. Because 12a = 25b, the unique factorization theo- 
rem guarantees that the standard factored forms of 
12a and 25b must be the same. Thus 25b contains 
the factors 27-3 (=12). But since neither 2 nor 3 
divides 25, the factors 27-3 must all occur in b, 
and hence 12|b. Similarly, 12a contains the factors 
= 25, and since 5 is not a factor of 12, the factors 
5° must occur in a. So 25|a. 


Hint: 458-88 = (37-5)8- (27-11) = 3!6- 58-215- 115, How 
many factors of 10 does this number contain? 


a. 6! =6:5-4:3-2:1 = 2:3+5+2+2-3-2 = 24-37-5 


43. Hint: There are 108 mathematics students and 120 


computer science students at the university. 


44. 


47. 


Proof: Suppose n is a nonnegative integer whose decimal 
representation ends in 0. Then n = 10m +0 = 10m for 
some integer m. Factoring out a 5 yields n = 10m = 5(2m), 
and 2m is an integer since m is an integer. Hence 10m is 
divisible by 5, which is what was to be shown. 


Hint: You may take it as a fact that for any positive 
integer k, 
10° = 99 ... 9+1;  thatis, 
kof these 
10% = 9-10! +9-108 7 +-+- 


+9-10'+9-10°+1. 


SECTION 4.5 


1. 
5. 
7. 
Tl. 


13. 


14. 


16. 


18. 


20. 


q=7,r=7 3. q=0,r= 36 
q= -5,r= 10 
a 4 b.7 


a. When today is Saturday, 15 days from today is two 
weeks (which is Saturday) plus one day (which is 
Sunday). Hence DayN should be 0. According to the 
formula, when today is Saturday, DayT = 6, and so 
when N = 15, 

DayN = (DayT + N) mod 7 
= (6+ 15) mod7 
= 21 mod 7 = 0, which agrees. 


Solution 1: 30 = 4-7 + 2. Hence the answer is two days 
after Monday, which is Wednesday. 


Solution 2: By the formula, the answer is (1 + 30) mod 
7 = 31 mod7 = 3, which is Wednesday. 


Hint: There are two ways to solve this problem. One is 
to find that 1,000 = 7-142 + 6 and note that if today is 
Tuesday, then 1,000 days from today is 142 weeks plus 
6 days from today. The other way is to use the formula 
DayN = (DayT + N) mod 7, with DayT = 2 (Tuesday) 
and N = 1,000. 


Because d|n, n = dq + 0 for some integer q. Thus the 
remainder is 0. 


Proof: Suppose n is any odd integer. By defini- 

tion of odd, n = 2q + 1 for some integer q. Then 

n= (QqtlyP =4¢ +4q+1=4¢ +q+1= 
4q(q + 1) + 1. By the result of part (a), the product 
q(q + 1) is even, so g(q + 1) = 2m for some integer m. 
1 =8m+1. 


Then, by substitution, n? =4-2m+4 


Because a mod 7 = 4, the remainder obtained when 
ais divided by 7 is 4, and so a = 7q + 4 for some 


4.5 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


23. 


A-28 


integer q. Multiplying this equation through by 5 gives 
that 5a = 35q+ 20 = 35q + 14+ 6 = 7(5q+2)+6. 
Because q is an integer, 5q + 2 is also an integer, and so 
5a = 7-(an integer) + 6. Thus, because 0 = 6 < 7, the 
remainder obtained when 5a is divided by 7 is 6, and so 
5amod7 = 6. 


Proof: Suppose n is any [particular but arbitrarily chosen] 
integer such that n mod 5 = 3. Then the remainder ob- 
tained when n is divided by 5 is 3, and son = 5q +3 for 
some integer g. By substitution, 


n = (5q +3) = 25q° + 30g+9 
= 25g + 30g+5+4 = 5(5¢ + 6q + 


1)+4. 


Because products and sums of integers are in- 
tegers, 5q +6q+1 is an integer, and hence 
n=5 (an integer) + 4. 

Thus, since 0 < 4 < 5, the remainder obtained when n? 
is divided by 5 is 4, and so n? mod 5 = 4. 


26. Hint: You need to show that (1) for each integer n and 


27. 


28. 


29. 


positive integer d, if n is divisible by d then n mod d = 0; 
and (2) for each integer n and positive integer d, if 
n mod d = 0 then n is divisible by d. 


Hint: Given any integer n, by the quotient-remainder 
theorem with divisor equal to 2, n = 2q, orn = 2q + 1 
for some integer q. 


a. Hint: Start by supposing that n, n + 1, and n + 2 are 
any three consecutive integers. Then use the quotient- 
remainder theorem to divide into three cases: 


Case 1 (n = 3q for some integer q). In this case you 
will show that n is a multiple of 3. 

Case 2 (n = 3q + 1 for some integer q). In this case 
you will show that n + 2 is a multiple of 3. 

Case 3 (n = 3q + 2 for some integer q). In this case 
you will show that n + 1 is a multiple of 3. 


Conclude that in all possible cases one of the inte- 
gers is a multiple of 3. 


a. Hint: Given any integer n, begin by using the 
quotient-remainder theorem to say that n can 
be written in one of the three forms: n = 3q, or 


n = 3q+1, orn = 3q+2 for some integer q. 
Then divide into three cases according to these 
three possibilities. Show that in each case either 
n° = 3k for some integer k, or n? = 3k+1 for 
some integer k. For instance, when n = 3q + 2, 
then n° = (3q+ 2) = 9 + 12g+4 = 

33g +4q+1)+1, and 3q + 4q + 1 is an integer 
because it is a sum of products of integers. 
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31. 


32. 


34. 


36. 


37. 


39. 


41. 


43. 


44, 
46. 


b. If m-n? = 56, then 56 = (m+n)(m—n). Now 
56 = 23-7, and by the unique factorization theorem, 
this factorization is unique. Hence the only repre- 
sentation of 56 as a product of two positive integers 
are 56 = 7:8 = 14-4 = 28-2 = 56-1. By part (a), 
m and n must both be odd or both be even. Thus the 
only solutions are either m+n = 14andm—n=4 
or m+n = 28 and m—n = 2. It follows that the 

9andn = 5 orm = 15 


only solutions are either m 
andn = 13. 


Under the given conditions, 2a — (b + c) is even. 

Proof: Suppose a, b, and c are any integers such that a — b 
is even and b — c is even. [We must show that 2a — (b +c) 
is even.] Note first that 2a—(b+c) = (a—b)+(a-c). 
Also note that (a— b)+ (b— c) is a sum of two even 
integers and hence is even by Example 4.3.3 #1. But 
(a—b)+(b—c) =a-—c, and so a~c is even. Hence 
2a — (b + c) is a sum of two even integers, and thus it is 
even /as was to be shown]. 


Hint: Express n using the quotient-remainder theorem 
with d = 3. 


Hint: Use the quotient-remainder theorem (as in Ex- 
ample 4.5.6) to say that n = 4q, n = 4q + 1, n = 4q +2, 
or n = 4q +3 and divide into cases accordingly. 


Hint: Given any integer n, consider the two cases where 
n is even and where n is odd. 


Hint: Use the quotient-remainder theorem to say 

that p must have one of the forms 6q, 6q + 1, 6q +2, 
6q + 3, 6q + 4, or 6q + 5 for some integer q. Then use 
the fact that p is prime and not equal to either 2 or 3 to 
show that you only need to consider two cases. 


Hint: There are four cases: Either x and y are both posi- 
tive, or x is positive and y is negative, or x is negative 
and y is positive, or both x and y are negative. 


Hint: Apply the triangle inequality with x = a — b and 
y = b and with x = b — a and y = a. Then use the result 
of exercise 42. 


a. 7,609 +5 = 7,614 


Answer to first question: No. Counterexample: Let 
m = 1,n = 3, and d = 2. Then m mod d = 1 and n mod 
d=1butm#n. 


Answer to second question: Yes. Proof: Suppose m, n, and 
d are integers such that m mod d =n mod d. Let r = m 
mod d =n mod d. By definition of mod, m = dp +r 
and n=dq+r for some integers p and q. Then 
m—n = (dp +r)— (dq +r) = d(p-— q). But p—q is an 
integer (being a difference of integers), and so m — n is divis- 
ible by d by definition of divisible. 
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SECTION 4.6 


1. 


3. 


10. 


12. 


13. 


14. 


15. 


17. 


137.999] = 37, [37.999] = 38 
|-—14.00001} = —15,[— 14.00001] = —14 


. |n/7|. The floor notation is more appropriate. If the ceil- 


ing notation is used, two different formulas are needed, 
depending on whether n/7 is an integer or not. (What 
are they?) 


A , | 2049 2049 | , | 2049 
a. (2050 +3 ]- [oo] + L300 ]) moa 7 
= (2050 + 512 — 20 + 5) mod 7 = 2547 mod 7 
= 6, which corresponds to a Saturday. 


b. Hint: One day is added every four years, except that 
each century the day is not added unless the century 
is a multiple of 400. 


Hint: The mistake is assuming what is to be proved. 
Explain the way in which the mistake occurs in the 
“proof.” 


Proof: Suppose n is any even integer. By definition of 
even, n = 2k for some integer k. Then 


aje 


But k=- 


because k is an integer and 
k=k<k=1. 


because n = 2k. 
Thus, on the one hand, es = k, and on the other hand, 


k= 7: It follows that |z] = 5 [as was to be shown]. 


Counterexample: Let x = 2 and y = 1.9. 
Then |x — y] = |2 — 1.9] = [0.1] = 0, whereas 
Ixl-Lly] =12)-[1.9) = 1. 


Proof: Suppose x is any real number. Let m = |x]. By 


definition of floor, m = x < m + 1. Subtracting 1 from 
all parts of the inequality gives that 


m-l1sx-l<mn, 
and so, by definition of floor, |x — 1] = m — 1. It follows 
by substitution that |x — 1] = |x]— 1. 
Proof for the case where n mod 3 = 2: 


In the case where n mod 3 = 2, then n = 3q + 2 for 
some integer q by definition of mod. By substitution, 


= la + zl =q _ because q is an integer and 
q=qt2/3<qtl. 


4.7 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-30 


But 3. 


n 
—— by solving n = 3q +2 for q. 


Thus, on the one hand, [3 | = q, and on the other hand, 


q="5 2 It follows that l=" 2 


18. Hint: This is false. 
19. Hint: This is true. 


23. Proof: Suppose x is a real number that is not an integer. 
Let|x] = n. Then, by definition of floor and because 
x is not an integer, n < x < n + 1. Multiplying both 
sides by —1 gives —n > —x > —n — 1, or equivalently, 
n—1 < —x < —n. Since —n — | is an integer, it fol- 
lows by definition of floor that |—x| = —n— 1. Hence 


[xj +|-x]=n+(—-n-1)=n-n-1=—Il, 


as was to be shown. 


25 


Hint: Let n = [š] and consider the two cases: n is even 
and n is odd. 


26. Proof: Suppose x is any real number such that 


x—>|x]< 5. Multiplying both sides by 2 gives 8. 
2x — Ix < 1, or 2x < 2xit 1. 
Now by definition of floor, |x] = x. Hence, 2|x| = 2x. 
Putting the two inequalities involving 2x together gives 
Ax] S 2x < 2x] + 1. 
Thus, by definition of floor (and because 2|x] is an inte- 
ger), [2x] = 2|x]. This is what to be shown. 


28. Hint: After applying the hypothesis that n is odd, evalu- 
ate the two sides of the equation separately and show 


that the results are equal. 


30. Hint: Divided into two cases: n is even and n is odd. 
For each case evaluate the two sides of the equation 


separately and show that the results are equal. 


31. Hint: Start by dividing the proof into two cases: n is 
even and n is odd. In case n is odd, use the quotient- 
remainder theorem with divisor equal to 6 to divide into 
three cases: n = 6k + 1, n = 6k +3, and n = 6k +5 for 
some integer k. You will need to consider a total of four 
cases. 
10. 


SECTION 4.7 


1. (a) acontradiction 
(b) a positive real number 
(c) x 12. 
(d) both sides by 2 
(e) contradiction 


Proof: Suppose not. That is, suppose there is an integer 
n such that 37 + 2 is divisible by 3. [We must show that 
this supposition leads to a contradiction.] By definition of 
divisibility, 3n +2 = 3k for some integer k. Subtracting 
2 from both sides gives that 3n = 3k + 2, and subtract- 
ing 3k from both sides gives 3n — 3k = 2, which implies 
that 3(n — k) = 2 by factoring out 3. Dividing both 

sides by 3 gives n — k = 2/3. Now n — k is an integer 
(because it is a difference of integers) and 2/3 is not an 
integer. Since an integer cannot equal a non-integer, we 
have reached a contradiction. [Hence the supposition is 
false and the given statement is true.] 


Negation for the statement: There is a greatest even 
integer. 


Proof of the statement: Suppose not. That is, suppose 
there is a greatest even integer; call it N. Then N is an 
even integer, and N = n for every even integer n. [We 
must deduce a contradiction.] Let M = N+ 2. Then M is 
an even integer since it is a sum of even integers, and 
M >N since M = N+2. This contradicts the supposi- 
tion that N = n for every even integer n. [Hence the sup- 
position is false and the statement is true.] 


(a) a rational number 

(b) an irrational number 
(©) 5 

(d) 5 

©;-; 

(f) integers 

(g) integers 

(h) zero product property 
(i) rational 


a. The mistake in this proof occurs in the second 
sentence where the negation written by the student 
is incorrect: instead of being existential, it is uni- 
versal. The problem is that if the student proceeds 
in a logically correct manner, all that is needed to 
reach a contradiction is one example of a rational 
and an irrational number whose difference is ir- 
rational. To prove the given statement, however, 
it is necessary to show that there is no rational 
number and no irrational number whose difference 
is rational. 


The mistake is that the negation for S that was used in 
the “proof” is incorrect. Thus deducing a contradiction 
from it fails to prove that S is true. (The actual negation 
is “There exist positive real numbers r and s such that 


VrFs = Vr+ V5”) 


a. Negation for R: There exists an irrational number 
whose square root is rational. 

b. Proof of R by contradiction: Suppose not. That is, 
suppose there exists an irrational number x such that 
the square root of x is rational. [We must derive a 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A-31 


B. 


14. 


15. 


16. 


17. 


19 


20. 


contradiction.] By definition of rational, Vx = A for 
some integers a and b with b # 0. By substitution, 


av? 
(Va = (<) ; 


and so, by algebra, 
_a 
= p 
But a° and b° are both products of integers and thus 
are integers, and b? is nonzero by the zero product 


property. Thus © is rational. It follows that x is both 
irrational and rational, which is a contradiction. 
[This is what was to be shown.] 


a. Negation for S: There exist an irrational number 
and a nonzero rational number whose product is 
rational. 


b. Hint: Recall that to say a mod 6 = 3 means that 
there exists an integer r such that a = 6r + 3. 


Hint: You could argue directly from the definition of odd, 
or you could use a result from Example 4.3.2. 


Proof by contradiction: Suppose not. That is, suppose that 
there exist odd integers a and b such that b? — a? = 4. [We 
must show that this supposition leads logically to a contradic- 
tion.] Factoring gives that 


b-ga = (b+ a\(b—a) =4. 


Now b>a because b’—a>=4>0, and the only 
way to factor 4 is either 4 = 2-2 or 4 = 4-1. Hence ei- 
ther b+a=b-a=2, or b+a=4 and b—a=1 or 
b+a=1landb-a=4. 
In case b +a = b—a = 2, then —a = aand soa = 0, 
which is not an odd integer. 


In case b +a = 4 and b—a = 1, then 2b = 5 and so 
b = 5/2, which is not an odd integer. 
In case b+ a = 1 and b—a = 4, then 2b = 5 and so 


b = 5/2, which is not an odd integer. 


Thus there are no odd integers a and b such that 
b? — a = 4, which contradicts the supposition. [Hence 
the supposition is false and the given statement is true.] 


Hint: Use the fact that a” = c? — b? = (c — b)(c + b) and 
apply the unique factorization of integers theorem. 


Hint: Suppose n” — 2 is divisible by 4, and consider the 
two cases where n is even and n is odd. (An alternative 
solution uses Proposition 4.7.4.) 


a. 5|n 
b. 5|n° 
c. 5k 

d. (5k) 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


e. 5|n* 


a. Proof by contradiction: Suppose not. That is, sup- 
pose there is an integer n such that n’? is odd and n is 
even. Show that this supposition leads logically to a 
contradiction. 

b. Proof by contraposition: Suppose n is any integer such 
that n is not odd. Show that n? is not odd. 


Formal version of the statement to be proved: For every 
real number x, if x is irrational then —x is irrational. 
Proof by contraposition: Suppose x is any real number 


such that — x is not irrational. By definition of irrational 
this means that — x is rational. [We must show that x is not 


irrational, or, equivalently, that x is rational.] By defini- 
tion of rational, —x = a/b for some integers a and b with 
b # 0. Then, by algebra, 


(a/b) = (—a)/b. 

Now ~a is an integer because a is an integer and because 
—a = (— l)a. Also b is a nonzero integer. Thus x is a ratio 
of integers with a nonzero denominator, and hence x is 
rational /as was to be shown]. 

Proof by contradiction: Suppose there exists a real num- 
ber x such that x is irrational and — x is not irrational. [We 


x (=x) 


must show that this supposition leads logically to a con- 
tradiction.] By definition of rational, —x = a/b for some 
integers a and b with b # 0. Then, by algebra, 

(a/b) = (—a)/b. 

Now -a is an integer because a is an integer and because 
—a = (— l)a. Also bis a nonzero integer. Thus x is a ratio 
of integers with a nonzero denominator, and hence x is 
rational. Hence x is both irrational and rational, which is 
a contradiction [which shows that the negation is false and 


x (=X) 


therefore that the statement to be proved is true]. 


Hint: See the answer to exercise 21 and look carefully at 
the two proofs for Proposition 4.7.4. 


Proof by contraposition: Suppose a, b, and c are any 
[particular but arbitrarily chosen] integers such that 

a|b. [We must show that a|bc.] By definition of divides, 

b = ak for some integer k. Then bc = (ak)c = a(kc). But 
kc is an integer (because it is a product of the integers k 


and c). Hence a|bc by definition of divisibility /as was to 
be shown]. 


Proof by contradiction: Suppose not. [We take the nega- 
tion and suppose it to be true.] Suppose J integers a, b, 
and c such that afbc and a|b. Since a|b, there exists an 
integer k such that b = ak by definition of divides. Then 
bc = (ak)c = a(kc) [by the associative law of algebra]. But 
kc is an integer (being a product of integers), and so a|bc 
by definition of divides. Thus afbc and a|bc, which is a 
contradiction. [This contradiction shows that the supposi- 


tion is false, and hence the given statement is true.] 


28. 


29. 


31. 


32. 


34. 


35. 


36. 


Hint: To prove p — q V r, it suffices to prove either 
DA\~q-—rorp ^~r— q. See exercise 14 in Section 2.2. 


Hints: (1) The contrapositive is “For all integers m and 
n, if m and n are not both even and m and n are not both 
odd, then m +n is not even.” Equivalently: “For all 
integers m and n, if one of m and n is even and the other 
is odd, then m + n is odd.” 


(2) The negation of the given statement is the following: 
4 integers m and n such that m + n is even, and either m is 
even and n is odd, or m is odd and n is even. 


a. Proof by contraposition: Suppose n, r, and s are 
positive integers and r > Vn and s > Vn. [We 
must show that rs > n.] By Theorem T27 in Ap- 
pendix A (witha = r, b = s, and c = d = Vn), 

rs > Vn- Vn = n. Thus the contrapositive of the 
given statement is true, and so the given statement is 
also true. 


667 = 25.8, and so the possible prime factors to 
be checked are 2, 3, 5, 7, 11, 13, 17, 19, and 23. Test- 
ing each in turn shows that 667 is not prime because 
667 = 23-29. 

V557 = 23.6, and so the possible prime factors 

to be checked are 2, 3, 5, 7, 11, 13, 17, 19, and 23. 
Testing each in turn shows that none divides 557. 
Therefore, 557 is prime. 


a. V 9269 = 96.3, and so the possible prime factors 

to be checked are all among those you found for 
exercise 33. Testing each in turn shows that 9,269 is 
not prime because 9,269 = 13-713. 

V'9103 = 95.4, and so the possible prime factors to 
be checked are all among those you found for exer- 
cise 33. Testing each in turn shows that none divides 


9,103. Therefore, 9,103 is prime. 


Hint: Assuming that n is not composite, show that 
n—4,n—6, andn- 8 are all prime. Next show that 
n—7 is divisible by 3 by considering n—6,n—7, 
and n — 8. Finally, write n— 4 = (n—7)+3 and show 
that 3 divides n—4. 


Hint: Use proof by contradiction. Suppose a, b, and c are 
odd integers, z is a solution to ax’ +bxt+ c, and z is ratio- 
nal. Then z = m'/n', for some integers m’ and n' with 

n' # 0. Divide out the greatest common factor of m' and 
n' (possibly 1) to obtain two integers m and n with no 
common factor that satisfy the equation m'/n' = m/n. 
Substitute m/n into ax? + bx + c, and multiply through 
by n’. Show that (1) the assumption that m is even leads 
to a contradiction, and (2) the assumption that n is even 
leads to a contradiction. 
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SECTION 4.8 


1. 


12. 


The value of V2 given by a calculator is an approxima- 
tion. Calculators can give exact values only for numbers 
that can be represented using at most the number of 
decimal digits in the calculator display. In particular, 
every number in a calculator display is rational, but even 
many rational numbers cannot be represented exactly. 
For instance, consider the number formed by writing 

a decimal point and following it with the first million 
digits of V2. By the discussion in Section 4.2, this 
number is rational, but you could not infer this from the 
calculator display. 


Yes. In fact there are infinitely many rational numbers 
with the same first trillion digits as V2. For instance, 
if you end the number after the first trillion digits, the 
result is a finite decimal, which is rational. Repeating 
the first trillion digits of V2 forever would create a re- 
peating decimal, which is rational. Or you could follow 
the first trillion digits of V2 by 012343434. .., where 
the digits 34 repeat forever. This is also rational. Try 
creating other examples. 


Proof by contradiction: Suppose not. That is, suppose 


6—7V2 is rational. [We must prove a contradiction.] By 
definition of rational, there exist integers a and b with 
b + 0 with 


a 
6-712 =- 
b 
Then 
1 fa by subtracting 6 from 
V2 = 4(¢- 6) both sides and dividing 
—7\b both sides by —7, 
a— 6b 
andso V2= 7b by the rules of algebra. 


But a — 6b and —7b are both integers (since a and b are 
integers and products and difference of integers are inte- 
gers), and —7b # 0 by the zero product property. Hence 
V2 is a ratio of the two integers a— 6b and —7b with 
—7b # 0, so V2 is a rational number (by definition of 
rational). This contradicts the fact that \V2 is irrational, 
and so the supposition is false and 6 — 7V2 is irrational. 


This is false. V4 = 2 = 2/1, which is rational. 


Counterexample: Let x = V2 and y = — V2. Then x and 
y are irrational, but x + y = 0 = 0/1, which is rational. 
True. 


Formal version of the statement: WY positive real number 
r, if r is irrational, then Vr is irrational. 
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16. 


19. 


Proof by contraposition: Suppose r is any positive real 
number such that Vr is rational. [We must show that r is 


rational.] By definition of rational, Vr = F for some inte- 
gers a and b with b # 0. Then r = (Wry = (e? = oo But 


both a? and b? are integers because they are products of 


integers, and P+O by the zero product property. Thus r 
is rational /as was to be shown]. 


(The statement may also be proved by contradiction.) 


Hint: Can you think of any “nice” integers x and y that 
are greater than 1 and have the property that y= y? 


a. Proof by contradiction: Suppose not. That 
is, suppose there is an integer n such that 
n = 3q, +r, = 3q + r, where q1, qo, Fi, and 
r, are integers, 0 =r, < 3,057 < 3, and 
rı # m. By interchanging the labels for rı and r3 
if necessary, we may assume that r > rı. Then 
3(q1ı — q2) = r2 — rı > 0, and because both r; and 
r are less than 3, either 7) — r; = 1 or nr = 2. 
So either 3(q1 — q2) = 1 or 3(q, — q2) = 2. The 
first case implies that 3|1, and hence, by Theorem 
4.4.1, that 3 = 1, and the second case implies that 

3|2, and hence, by Theorem 4.4.1, that 3 = 2. 
These results contradict the fact that 3 is greater 
than both 1 and 2. Thus in either case we have 
reached a contradiction, which shows that the sup- 
position is false and the given statement is true. 

b. Proof by contradiction: Suppose not. That is, sup- 
pose there is an integer n such that n? is divisible 
by 3 and n is not divisible by 3. [We must deduce 
a contradiction.] By definition of divisible, 


n= 3q for some integer q, and by the quotient- 
remainder theorem and part (a), n = 3k + 1 or 
n = 3k+2 for some integer k. 


Case 1 (n = 3k +1 for some integer k): In this case 
n? = Bk+1) = 9K +6k+1 = 33K +2k)+ 1. 


Let s = 3K? + 2k. Then n? = 3s + 1, and s is an inte- 
ger because it is a sum of products of integers. Thus 
n? = 3q = 3s + 1 for some integers q and s, which 
contradicts the result of part (a). 


Case 2 (n = 3k + 2 for some integer k): In this case 
n? = (3k+2) = 9K +12k+4 = 33K +4k4 1) 41. 


Lett = 3k +4k+ 1. Then n? = 3t + 1, and ż is an in- 
teger because it is a sum of products of integers. Thus 
there are integers q and ż so that n? = 3q = 3t+ 1, 
which contradicts the result of part (a). 


Thus in either case, a contradiction is reached, 
which shows that the supposition is false and the 
given statement is true. 
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c. Proof by contradiction: Suppose not. That is, 
suppose V3 is rational. By definition of rational, 
V3= E for some integers a and b with b # 0. 
Without loss of generality, assume that a and b have 
no common factor. (If not, divide both a and b by 
their greatest common factor to obtain integers a’ 
and b’ with the property that a’ and b’ have no com- 
mon factor and V3 = A . Then redefine a = a’ and 
b = b.) Squaring both sides of V3 = F gives V3 = 


me and multiplying both sides by b? gives 

3D? = a). 
Thus a’ is divisible by 3, and so, by part (b), a is 
also divisible by 3. By definition of divisibility, then, 
a = 3k for some integer k, and so 

a’ = 9P). 
Substituting equation (**) into equation (*) gives 
3b? = 9k, and dividing both sides by 3 yields 

b = 3k. 

Hence b? is divisible by 3, and so, by part (b), b is 
also divisible by 3. Consequently, both a and b are 
divisible by 3, which contradicts the assumption that 


a and b have no common factor. Thus the supposi- 
tion is false, and so \V/3 is irrational. 


Hint: The proof is a generalization of the one given in 
the solution for exercise 19(a). 


Hint: First prove that for all integers a, if 5 divides a 
squared then 5 divides a. The rest of the proof is similar 
to the solution for exercise 19(c). 


Hint: This statement is true. If a? — 3 = 9b, then 

a =9b+3= 3(3b + 1), and so a is divisible by 3. 
Hence, by exercise 19(b), a is divisible by 3. Thus 
a = (3c)’ for some integer c. 


Proof by contradiction: Suppose not. That is, suppose 


V2 is rational. [We will show that this supposition leads 
to a contradiction.] By definition of rational, we may 
write V2 = a/b for some integers a and b with b # 0. 
Then 2 = a’/b’, and so a” = 2b”. Consider the prime 
factorizations for a” and for 2b”. By the unique fac- 
torization of integers theorem, these factorizations are 
unique except for the order in which the factors are writ- 
ten. Now because every prime factor of a occurs twice 
in the prime factorization of a’, the prime factorization 
of a’ contains an even number of 2’s. (If 2 is a factor of 
a, then this even number is positive, and if 2 is not a fac- 
tor of a, then this even number is 0.) On the other hand, 
because every prime factor of b occurs twice in the 
prime factorization of b’, the prime factorization of 2b" 
contains an odd number of 2’s. Therefore, the equation 


26. 


28 


29. 
30. 


32. 


33. 


34. 


35. 


37. 


a’ = 2b’ cannot be true. So the supposition is false, and 
hence V2 is irrational. 


Hint: One solution uses only Theorem 4.8.1. Another 
uses the result of exercise 25 that V6 is irrational. 


Hint: Divide 2-3-5-7 + 1 by each of 2, 3, 5, and 7, using 
the quotient-remainder theorem. 


Hint: You can deduce that p = 3. 
a. Hint: For example, N4 = 2:3-5-7+1= 211. 


Hint: By Theorem 4.2.4 (divisibility by a prime) there 
is a prime number p such that p|(n! — 1). Show that the 
supposition that p = n leads to a contradiction. It will 
then follow thatn < p < n!. 


Hint: Every odd integer can be written as 

4k+ 1 or as 4k +3 for some integer k. (Why?) If 
Pi pP2---Pa+ 1 = 4k+ 1, then 4|p, py... pp. Is this 
possible? 


a. Hint: Prove the contrapositive: If for some integer 
n > 2 that is not a power of 2, x” + y” = z” has 
a positive integer solution, then for some prime 
number p > 2, x? +y? = z” has a positive integer 
solution. Note that if n = kp, then x” = =a, 


Existence proof: When n = 2, then n—=1= 3, which is 
prime. Hence there exists a prime number of the form 
n? — 1, where n is an integer and n = 2. 


Uniqueness proof (by contradiction): Suppose to the 
contrary that m is another integer satisfying the given 
conditions. That is, m >2 and m° —1 is prime. /We 
must derive a contradiction.] Factor m?—1 to obtain 
m? —1 = (m—1)(m+ 1). But m > 2, and so m—1>1 
and m+ 1 > 1. Hence m*— 1 is not prime, which is a 
contradiction. /This contradiction shows that the supposi- 
tion is false, and so there is no other integer m > 2 such that 
n -lis prime.] 

Uniqueness proof (direct): Suppose m is any integer 
such that m = 2 and m° — 1 is prime. [We must show that 
m = 2.] By factoring, m?—1 = (m—1)(m+ 1). Since 
m—1 is prime, either m—1=1 or m+1=1. But 
m+1=2+1 = 3. Hence, by elimination, m—1 = 1, 
and so m = 2. 


Proof (by contradiction): Suppose not. That is, suppose 
there are two distinct real numbers a, and a, such that 
for all real numbers r, 


()a;+r=r and (Q)a+r=r. 
Then 
ayta =a, by(l)with r=a 
and 
a,+a,=a, by (2)with r=a). 
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It follows that 


d äi Ta? aà T a, ai, 

which implies that a, = a,. But this contradicts the sup- 
position that a, and a, are distinct. [Thus the supposition 
is false and there is at most one real number a such that 


a+r = r for all real numbers r.] 


Proof (direct): Suppose a, and a, are real numbers such 
that for all real numbers r, 


()ajtr=r and Q)a+t+r=r. 
Then 

a +a, = a by (1) with r=a 
and 

ata =a _ by(2)with r= a. 


It follows that 


d? ai Ta? dı Tı a. 


Hence a, = a. [Thus there is at most one real number a 
such thata +r = r for all real numbers r.] 


SECTION 4.9 


1. 


vertex vy v2 V3 
degree 3 2 4 2 1 0 


Total degree = 3+2+4+2+1= 12 


1 
# of edges = 6 = (3) 12 = one-half of the total degree 


3. The total degree of the graph is0+2+2+3+9= 16, 
, So, by the handshake theorem (Theorem 4.9.1), the 
number of edges is 16/2 = 8. 


5. One such graph is 


£ d 


6. If there were a graph with four vertices of degree 1, 2, 
3, and 3, then its total degree would be 9, which is odd. 
But by Corollary 4.9.2, the total degree of the graph 
must be even. [This is a contradiction.] Hence there is no 
such graph. (Alternatively, if there were such a graph, 
it would have an odd number of vertices of odd degree. 
But by Proposition 4.9.3 this is impossible.) 


9. Suppose there were a simple graph with four vertices 
of degrees 1, 2, 3, and 4. Then the vertex of degree 4 
would have to be connected by edges to four distinct 
vertices other than itself because of the assumption that 
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the graph is simple (and hence has no loops or parallel 21. a. 
edges). This contradicts the assumption that the graph 


Yes. Let G be a simple graph with n vertices and let 


12. 


has four vertices in total. Hence there is no simple graph 
with four vertices of degrees 1, 2, 3, and 4. 


vi 


V4 


v 


U3 


14. a. Define a graph G by letting each vertex represent a 


person at the party and drawing an edge between each 
pair of people who knew each other before the party. 
Let x be the number of people who knew three other 
people before the party. 


Then the total degree of the graph 
=2-1+5-2+x-3 = 12+3x 


v be a vertex of G. Since G has no parallel edges, v 
can be joined by at most a single edge to each of the 
n— 1 other vertices of G, and since G has no loops, 
v cannot be joined to itself. Therefore, the maximum 
degree of visn—1. 


e No. Suppose there is a simple graph with four 


vertices, all of which have different degrees. By part 
(a), no vertex can have degree greater than three, 
and of course, no vertex can have degree less than 0. 
Therefore, the only possible degrees of the vertices 
are 0, 1, 2, and 3. Since all four vertices have differ- 
ent degrees, there is one vertex with each degree. But 
then the vertex of degree 3 is connected to all the 
other vertices, which contradicts the fact that one of 
the vertices has degree 0. Hence the supposition is 
false, and there is no simple graph with four vertices 
each of which has a different degree. 


22. Hint: Use the result of exercise 21, part (c). 


because 2 people knew 1 other person before the 23. a. Ky»: 
party, 5 people knew 2 other people before the party, vi 
and x people knew 3 other people before the party. 
In addition, since a total of 15 pairs of people knew 
each other before the party, the graph has 15 edges. v Us 
By the handshake theorem (Theorem 4.9.1), the total 
degree is twice the number of edges. Hence the total 
degree of the graph = 2-15 = 30. U3 V6 
It follows that 12 + 3x = 30. Thus 
3x=30-12=18, andso x=Ë=6. 7 
In other words, 6 people at the party knew 3 other . ae . 
23. d. Ifn + m, the vertices of K, n» are divided into 


people before the party. 


two groups: one of size m and the other of size 


b. Now the total number of people at the party is the : ; 

sum of the number who knew 1 other person before n. Every vertex in the group of size m has degree 

the party, plus the number who knew 2 other people ae os "3 En tO eVETY vertex oe 

before the party, plus the number who knew 3 other the group Eara So Km,» has m vertices of 

people before the party. Therefore, the number of degree n. Similarly, every vertex in the group of 

people at the party = 2+5 +6 = 13 size n has degree m because each is connected to 

` every vertex in the group of size m. So K,,,, has 

16. a. Suppose that, in a group of 15 people, each person n vertices of degree m. Note that if n = m, then 


had exactly three friends. Then you could draw a 
graph representing each person by a vertex and con- 
necting two vertices by an edge if the corresponding 
people were friends. But such a graph would have 15 
vertices, each of degree 3, for a total degree of 45. 


24. 


all n +m = 2n vertices have the same degree, 
namely, n. 


- This graph is bipartite 


7 Si eo "y 
This would contradict the fact that the total degree n 4 \ wf í ` ` 
of any graph is even. Hence the supposition must be ) UI ua \ 
false, and in a group of 15 people it is not possible ! | i 
for each to have exactly three friends. ‘ l ‘ 
\ U3 r U4 / 
19. Hint: Let t be the total degree of the graph, let dimin be ae a ÈA 7 


the minimum degree of any vertex in G, and let dmax be 
the maximum degree of any vertex in G. 
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. Suppose this graph is a bipartite. Then the vertex 


set can be partitioned into two mutually disjoint 
subsets such that vertices in each subset are 


4.10 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


connected by edges only to vertices in the other 
subset and not to vertices in the same subset. 
Now v; is in one subset of the partition, say, Vj. 
Since v, is connected by edges to v, and v3 both 
v and v3 must be in the other subset, V}. But vy 
and v3 are connected by an edge to each other. 
This contradicts the fact that no vertices in Vy are 
connected by edges to other vertices in V3. Hence 
the supposition is false, and so the graph is not 
bipartite. 


SECTION 4.10 


A-36 


8. a. 
A 69 19 9 
q 2 
d il 
n 1 
p 4 
9. gcd(27, 72) = 9 
10. gcd(5, 9) = 1 
13. Divide the larger number, 1,188, by the smaller, 385, to 


obtain a quotient of 3 and a remainder of 33. Next divide 


1 z=0 385 by 33 to obtain a quotient of 11 and a remainder of 
3. a. z=18 22. Then divide 33 by 22 to obtain a quotient of 1 and a 
ii remainder of 11. Finally, divide 22 by 11 to obtain a 
4. Trace table: quotient of 2 and a remainder of 0. Thus, by Lemma 
- 4.10.2, ged(1188, 385) = ged(385, 33) = gcd(33, 22) = 
i 0 1 2 3 ged(22, 11) = ged(11, 0), and by Lemma 4.10.1, 
a 2 7 22 67 gcd(11, 0) = 11. So ged(1188, 385) = 11. 
After execution: a = 67 14. Divide the larger number, 1,177, by the smaller, 509, to 
6 obtain a quotient of 2 and a remainder of 159. Next di- 
i Iteration Number vide 509 by 159 to obtain a quotient of 3 and a remainder 
0 1 2 3 of 32. Next divide 159 by 32 to obtain a quotient of 4 
and a remainder of 31. Then divide 32 by 31 to obtain a 
a 26 quotient of 1 and a remainder of 1. Finally, divide 31 by 1 
d 7 to obtain a quotient of 31 and a remainder of 0. Thus, by 
Lemma 4.10.2, ged(1177, 509) = ged(509, 159) = 
q 0 | 2 3 ged(159, 32) = gcd(32, 31) = ged(31, 1) = ged(1, 0), and 
r 26 19 12 5 by Lemma 4.10.1, gcd(1, 0) = 1. So ged(1177, 509) = 1. 
17. 
A 1,001 
B 871 
r 130 91 39 13 
b 871 130 91 39 13 
a 1,001 871 130 91 39 13 
gcd 13 
20. 
A 4,617 
B 2,563 
a 4,617 2,563 2,054 509 18 5 3 2 1 
2,563 2,054 509 18 5 3 2 1 0 
r 2,563 2,054 509 18 5 3 2 1 0 
gcd 1 


The table shows that the greatest common divisor of 4,617 and 2,563 is 1, and so these integers are relatively prime. 
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A-37 


22. 


25. 


26. 


Hint: Divide the proof into two parts. In part 1 suppose 


derive the conclusion that gcd(a, b) = a. To do this, note 
that because it is also the case that ala, a is a common 
divisor of a and b. Thus, by definition of greatest com- 
mon divisor, a is less than or equal to the greatest com- 
mon divisor of a and b. In symbols, a = gcd(a, b). Then 
show that a = gcd(a, b) by using Theorem 4.4.1. In 

part 2 of the proof, suppose a and b are positive integers 
such that gcd(a, b) = a and deduce that a|b. 


a. Hint 1: Ifa = dq —r, then ~a = —dqt+r= 
dq—d+d—r=d—q-1)+(d-r). 
Hint 2: £0 =r < d, then 0 = —r> —d. Add d to 
all parts of this inequality and see what results. 


a. Proof: Suppose a, d, q, and r are integers such that 
a=dq+rand0 =r < d. [We must show that 
q= [z] andr =a— d| SJ Solving a = dq +r for 
r gives r = a — dq, and substituting into 0 S r < d 
gives 0 = a — dq < d. Add dq to both sides to obtain 
dq Sa < d+ dq = d(q + 1). Then divide through by 
d to obtain q = 3 < q + 1. Therefore, by definition of 
floor, H = q. Finally, substitution into a = dq +r 
givesa = d [S| +r, and subtracting d|5| from both 


sides yields r = a — d|5| [as was to be shown]. 


27. b. 
Iteration Number 
0 1 2 3 4 
a 630 294 294 252 210 
b 336 336 42 42 42 
gcd 
Iteration Number 
5 6 7 8 9 
a 168 126 84 42 0 
b 42 42 42 42 42 
gcd 42 
28. a. Icm(12, 18) = 36 


. Proof: Part 1: Let a and b be positive integers, and sup- 
pose d = gcd(a, b) = Icm a, b). By definition of greatest 
common divisor and least common multiple, d > 0, 

=dm 

and d = an for some integers m and n. By substitution, 

a = dm = (an)m = anm. Dividing both sides by a gives 

1 = nm. But the only divisors of 1 are l and — 1 

(Theorem 4.4.2), and som = n = +1. Since both a and 

d are positive, m = n = 1, and hence a = d. Similar 

reasoning shows that b = d also, and soa = b. 
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32. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Part 2: Given any positive integers a and b such that 
a = b, we have gcd(a, b) = gcd(a, a) = a and lcm(a, b) = 
Icm(a, a) = a, and hence gcd(a, b) = Icm(a, b). 


Hint: Divide the proof into two parts. In part 1, suppose 
a and b are any positive integers, and deduce that 


gcd(a, b)-lcm(a, b) = ab. 


Derive this result by showiig that Icm(a, b) = ED D: 
To do this, show that ged a5), is a multiple of both a and 
b. For instance, to see that zda D is a multiple of b, note 
that because gcd(a, b) divides a, a = gcd(a, b)-k for 
some integer k, and thus ab = gedla, b): kb. Divide both 
sides by gcd(a, b) to obtain =, a7 ab) = kb. Des since k is 
an integer, this equation implies: that gcd(a, b) is a mul- 
tiple of b. The argument that ICD a,b) is a multiple of a 
is almost identical. In part 2 of the proof, use thig defini- 
tioni of least common miltiple í to show that ae |a and 
inte man Conclude that =o b= < gcd(a,b) and hence 


that ab = gcd(a, b)-Icm(a, b). 


SECTION 5.1 


1 2 3 4 1 1 1 
See Asaa S 050; 252 
11 12 13 14 39 27 
gı = [logs 1] = 0 
& = llog22] = 1, g3 = llog23]|= 1 
84 = llog24] = 2, gs =|log,5) = 2 
& = llog26] = 2, g7 = llog27] = 2 
gs = llog28] = 3, go = |log29] = 3 


811 = [logs 11] = 3 
813 = [logs 13) = 3 
815 = [logs 15] = 3 


810 = Log, 10] = 3, 
812 = [log 12] = 3, 
814 = [logs 14] = 3, 


When n is an integral power of 2, g,, is the exponent of that 
power. For instance, 8 = 2? and gg = 3. More generally, 
ifn = 2*, where kis an integer, then g, = k. All terms of 
the sequence from gx up to, but not including, g>+ have 
the same value, namely k. For instance, all terms of the 
sequence from gg through g;5 have the value 3. 


Exercises 10-16 have more than one correct answer. 


10. 
Tl. 


a, = (—1)", where n is an integer and n = 1 
a, = (n— 1)(—1)", where n is an integer and n = 1 
n . . 

a, = ——— > where n is an integer and n = 1 
(n+ 1) 
n 

dy = sp Where n is an integer and n = 1 
3 


18. a. 2+34+(-2)+140+(-1)+(-2) = 1 
b. a =2 
c a+a,+as=-2+0+(-2) = —4 


d. 2:3-(—2):1-0-(—1):(-2) =0 


19. 2+34+4+5+6=20 20. 27-3?-4" = 576 
23. 1(1+1)=2 


a (iaa a) ed) 


29. (-2)! + (-2)? + (-2)> +--+ + (-2)" 
= —24+727?-234 +(—1)"2" 

my 1 1l 1 1 

ai Zuo "aiat Ga dy 

33, 1-1 
12 

do cay a 
1+1/\2+1/\3+1) \olas\a}~ 4 


k+1 


37. > +(k+1% = = ai 
39. Hint: n+2 = (n+1)+1. 
k+1 k 


40. Si!) = Si) + K+ DKF DV! 


i=l i=1 


Exercises 43-52 have more than one correct answer. 


7 6 
43. XOD o SH e+ DP 
k=1 k=0 
S CDI Ly 
Gi G+ DG+D G+bDG+2 ” > kk+ 1) 
5 +. n 
47. XOD 4. Sk 
i=0 k=1 


n—l 
51. X-i) 
i=0 


53. When k = 0, then i = 1. When k = 5, then i = 6. Since 


i = k+ 1, then k = i — 1. Thus, 


kk- 1) = (i—- 1X- 1)- 1) = (i— 1I) — 2), 


and so 


5 6 
Skk-1) = X-i- 2 
k=0 i=1 


SOLUTIONS AND HINTS TO SELECTED EXERCISES A-38 


55. When i = 1, then j = 0. When į = n + 1, then j = n. 
Since j = i — 1, then i = j + 1. Thus, 
Gi) G+D-DÙ F 
in G+1)-n jntn 


(Note that n has the same value in each term of the sum.) 
n+l (i- 2 n F 


oo oD 


j=0 
56. When i = 3, then j = 2. When i = n, then j =n-—1. 
Since j = i— 1, then i = j+ 1. Thus, 


jntn 


n i = 1 j+1 
2 ai Ge 
n— itd 


59. 5 [3(2k — 3) + (4 — 5k)] 


k=1 
= DS [(6k 9)+ (4—5k)] = Dk 5) 
k=l k=1 
4:3-2+ 
62. 4 
3-2-T 
nn= n=) 324 
65. =n 
m=Dm- 32t 1 


6. = 
(n+ Dna =De n(n+1) 


ee — ... . . 2 
ex [n+ Dan- I(n—2)-°°3 Z if: ens 
[n(n—1)(n—2)++-3+2°1] 


n(n— 1)(n—2)***(n—k+ n= n= kK— D2 


69. 
m= tt 
= n(n—1)(n—2)°*-(n—k +1) 
a. G) 5! O 5432T _ 
3) BNY(5—-3)! G-212-1) 
n 3 3! a= 
“o ODB—-O)! DEDA 
75 n \ n! = n(u—tt 
“\n-1)) (n—Din-(n-1))!— (@—Din—-nt+ D! 
= 
1 


77. a. Proof: Let n be an integer such that n = 2. By defi- 
nition of factorial, 
2-1 ifn =2 
n! = 43-2-1 ifn =3 
niles 2-1 aS se. 
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In each case, n! has a factor of 2, and so n! = 2k for 88. 
some integer k. Then O remainder! = r[2]= li 
16/1 remainder 1 = r[1]= 116 
n!+2 = 2k+2 by substitution 16 | 17 remainder 15 = r[0] = F16 
= 2(k+1) by factoring out the 2. 16 | 287 
Since k+ 1 is an integer, n! + 2 is divisible by 2 fas Hence 28719 = 11F 6. 


was to be shown]. 
c. Hint: Consider the sequence m! + 2, m! +3, 


m!+4,...,m!+m. SECTION 5.2 


78. Proof: Suppose n and r are nonnegative integers with 1. a. The statement in part (a) is true because if 
i s i i 1 1 1 
rel = n. The right-hand side of the equation to be (1 = 5)(1 me 3) = $ then 
shown is 


FFE r 


= (ee n! as 


r+1 r!(n-r)! 1 1 13 1 
ner n! fi Jer 4 
r+1 riin—n-(n—r—D! re 
n! 2. a. P is the equation 1 = 1°, which is true. 
= b. P(k) is the equation 1 +3 +5+ -- +(2k-1l) = K. 
a | a c P(k+1) is the equation 1+3+5+ -4 
= (2k + 1)-1) = (k+ 1}? 
(r+ 1)l-(a—(r+ 1)! d. In the inductive step, show that if k is any 
n integer for which 1 +3 +5 + +- +(2k— 1) = K is 
afa true, then 1+3 +5 +  +(2(k+1)- 1) = (k+1)} 


is also true. 
which is the left-hand side of the equation to be shown. 


A+ (21+ 
6 


4 1 : 
3. a. P(1)is “q? = Js» P(1) is true because 


80. a. m-— 1, sum +aļi+ 1 
i ] the left-hand side equals 1° = 1 and the right-hand 


81. 0 remainder = r[6] = 1 side equals 1+ pre FA) = Ae. 
2)1 remainder = r[5] = 0 
2|2 remainder = r[4] = 1 5. a 1 
2-5 remainder = r[3] = 1 b. K 
2/11 remainder = r[2] = 0 ce 14+34+5+-:-+[(2(k+1)-1] 
2° |. 22. remainder = r[1] = 1 d. (k+ 1} 
2 | 45 remainder = r[0] = 0 e. the odd integer just before 2k + 1 is 2k— 1 
2 | 90 f. inductive hypothesis 
Hence 9019 = 1011010). 6. Proof: For the given statement, the property P(n) is the 
equation 
B4. 2+4+6+ + 2n=n +n. KPI) 
a 23 
Show that P(1) is true: 
l 0 | 2 4 To prove P(1), we must show that when 1 is substituted 
q 23 11 5 2 1 0 into the equation in place of n, the left-hand side equals 
r[0] 1 the right-hand side. But when 1 is substituted for n, the 
r{1] 1 left-hand side is the sum of all the even integers from 2 
to 2-1, which is just 2, and the right-hand side is r+ i; 
r[2] 1 which also equals 2. Thus P(1) is true. 
r[3] 0 
r[4] 1 
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Show that for every integer k = 1, if P(k) is true then 
P(k +1) is true: 


Let k be any integer with k = 1, and suppose P(A) is 
true. That is, suppose 


< P(k) 
inductive hypothesis 


We must show that P(k + 1) is true. That is, we must 
show that 


2+44+64°°: 


H2(k+1) = (k 


Because (k+ 1) +(k+1) = K +2k+1+k+1 = 
K +3k+2, this is equivalent to showing that 


244464 = +2k4+1) =K +A3Ik+2, Pk + l) 


Now the left-hand side of P(k + 1) is 


EEE EEEE EN 
=2+4+6+: 


+ 2k + 2(k + 1) 
by making the next-to-last 
term explicit 
= (+k) +2(k+1) 
by substitution from the 
inductive hypothesis 


=k +3k+2 by algebra, 


and this is the right-hand side of P(k + 1). Hence 
P(k + 1) is true. 


[Since both the basis step and the inductive step have been 
proved, P(n) is true for every integer n = 1.] 


Proof: For the given statement, the property P(n) is the 
equation 


Loehrer 2 = 2t], 


<— P(n) 


Show that P(O) is true: 


The left-hand side of P(O) is 1, and the right-hand side is 
2°*!_ 1 =2—1 = 1 also. Thus P(0) is true. 


Show that for every integer k = 0, if P(k) is true then 
P(k +1) is true: 


Let k be any integer with k = 0, and suppose P(k) is 
true. That is, suppose 


-+2 = 21-100 OP 

inductive hypothesis 
We must show that P(k + 1) is true. That is, we must 
show that 


. 2t = glkt4+1_y 


> 


5.2. SOLUTIONS AND HINTS TO SELECTED EXERCISES 


10. 


A-40 


or, equivalently, 


ieee 


EIH =y ey 


Now the left-hand side of P(k + 1) is 


14+24+2?4+--- +21 

ae +244 24t] 
by making the next-to-last 
term explicit 

= (oe —1)+ peel by substitution from the 
inductive hypothesis 

=9:9F t= 1 by combining like terms 

= brea 4 by the laws of exponents, 


and this is the right-hand side of P(k + 1). Hence the 
property is true for n = k+ 1. 


[Since both the basis step and the inductive step have been 
proved, P(n) is true for every integer n = 0.] 


Proof: For the given statement, the property is the 
equation 


ee ae ee ae a 
_ n(n+1)2n+ 1) 


6 <— P(n) 


Show that P(1) is true: 
The left-hand side of P(1) is 1° = 1, and the right-hand 


. . 1+H2-1+1 ; 
side is at us 23 = | also. Thus P(1) 


is true. 
Show that for every integer k = 1, if P(k) is true then 
P(k+1) is true: 


Let k be any integer with k = 1, and suppose P(A) is 
true. That is, suppose 


P4+274+3 4-4 


_ kk+DOK+D = 


6 inductive hypothesis 


We must show that P(k + 1) is true. That is, we must 
show that 


2 


PH +e 4k +1) 


(k+ I(k+1) + (2k +1) +1) 
z : 


or, equivalently, 


PH+ + ++) +(k+1) 


| (k+ K+ 2)(2k + 3) 
6 ; 


<—Pk+1) 
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A-41 


Now the left-hand side of P(k + 1) is 


r 4+ (k +19 


= 177427743745 4Ph 4419 
by making the next- 
to-last term explicit by 
substitution from the 
inductive hypothesis 


E k(k+ 1)(2k+ 1) (k4 1 
6 
O K++) , 6(K+ 1° er oe 
6 6 
= k(k + 1)(2k + 1) + 6(k 4 1) by adding fractions 
6 
k(k-+ D[k(2k + 1) + 6(k + 1)] 


by factoring out (k + 1) 


6 


2 
_ (k+ DOK + 7k+ 6) by multiplying out and 
6 combining like terms 


(k+ IK + 2)(2k + 3) because (k + 2) 


6 (2k+3) = 2K +7k+6, 


and this is the right-hand side of P(k + 1). Hence the 
property is true for n = k+ 1. 


[Since both the basis step and the inductive step have been 
proved, P(n) is true for every integer n = 1.] 


13. Proof: For the given statement, the property P(n) is the 


equation 
no —1)n+1 
SiG+) = WAE DED <—P(n) 
i=l 3 

Show that P(2) is true: 


The left-hand side of P(2) is > iit+1)=1-(1+1) =2, 


n <q. « M= 
and the right-hand side is oe = $ = 2 also. Thus 


P(2) is true. 

Show that for every integer k = 2, if P(k) is true then 
P(k +1) is true: 

Let k be any integer with k = 2, and suppose P(A) is 
true. That is, suppose 


I O kk- Dkt) 
SiG+) : 


i=1 


<— P(k) 

inductive hypothesis 
We must show that P(k + 1) is true. That is, we must 
show that 


Era H H H1)+4 
3 +n- Dk eae 1) D) 


i=1 
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15. 


20. 


22. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


or, equivalently, 


E pan (k+ 1)k(k + 2) 
aie + 1) 3 ; 


Now the left-hand side of P(k + 1) is 


< P(k+ 1) 


k 
Siti +1) 
i=1 
k-1 
= didi + 1)+k(k+ 1) by writing the last term 
i=1 separately 
_ kk 1k +1) kk by substitution from the 
(k+ 1) inductive hypothesis 
3 
= k(k-1)(kK+1) 3k(k+1) because $ = 1 
3 3 
—_ kk Wk + 1) + 3k(K + 1) 


by adding the fractions 


3 


_ kk+ Ik 1) +3] 
3 


by factoring out k(k + 1) 


_ kk+1IXk+2) 
3 


by algebra, 


and this is the right-hand side of P(k + 1). Hence 
P(k + 1) is true. 
[Since both the basis step and the inductive step have been 


proved, P(n) is true for every integer n = 2.] 


Hint: To prove the basis step, show that 

>)_, iG!) = (1 + 1)! - 1. To prove the inductive 
step, suppose that k is any integer such 

that k = 1 and Dt, ii!) = (k+ 1)!— 1, and 


show that pak i(i!) = (k+2)!— 1. Note that 


(k+ D!—1 + (kt DIK+ D!] = 
FDI +EEDI=1. 
4+8+12+16+--- +200 =41+2+3+4 
++ +50) = 42224) = 5,100 
a. 3+4+5+6+--: +1000 
= (1+24+3+4+--» +1000)-(1+2) 
1000-1001 
( 00 2) 3 = 500,497 
2 
b. 3+4+5+6+ =: +m 
=(1+2+3+4+ `- +m)- (1+2) 
_m(m+1) _ am +m 6 
2 2 2 
_ m+m—6 
2 


24. K- D&-1)+1) _ kk-1) 


2 2 
76 4 
25. a. = 27° 1 = 67,108,863 
2-1 
b. 24+27+27+--. +276 
= 2(1+24+27+ +--+ +27) 
= 2-(67108863) by part (a) 
= 134,217,726 


c. Solution 1: 
2427427 +--+ 42" 
=2(14+24+274+27 +--+ 42") 
ge-Dtl_ 4 on] 
=? — 2 = 72"! 2 
2—1 2-1 


Solution 2: 


24277424 +5542" = (1424274234 
-+21 
gntl_y 
= Ji q =2”t1 2 
fA 
Taa 
2 _ ont 1 
28 L, zi pt 2) 
2 2 
— 2 | = 1 
gntl 2 2 J” 


30. General formula: For every integer n = 1, 
11 
1-3 3-5 
Proof (by mathematical induction): Let the property 
P(n) be the equation 
1 1, , 1 o n 
L3 35 Qn—-DQnt1) ntl 
Show that P(A) is true: 
The left-hand side of P(1) equals and the right-hand 


1 o n 
' (2n-1)\2n+1) 2n+1 


side equals zT But both of these equal 7 so P(1) is 


true. 

Show that for each integer k = 1, if P(k) is true then 

P(k +1) is true: 

Suppose that k is any integer with k = 1, and 

1 1 1 eard 1 k <— P(k) inductive 
1-3 3:5 (2k—1)\(2k+1) 2k+1' hypothesis 
We must show that 
1 4 1 l E 1 

1:3 35 Akt- DAk+1) +1 
eed 
~ 2k+D+T 


5.2. SOLUTIONS AND HINTS TO SELECTED EXERCISES 


A-42 


or, equivalently, 
1 1 1 


1:3 3:5) 


_ k+l 
2k+3° 


—P(k+1) 


" (2k + 1)(2k + 3) 
Now the left-hand side of P(k + 1) is 
1 1 1 


13.35  (Qk+1Qk+3) 
i. 4 1 


ea ro" FOR DOKID 


1 
+ —_______. 
(2k + 1)(2k + 3) 
k 1 


{ by inductive 
2k+1  (2k+1)(2k+3) hypothesis 


O kKQK+3) 1 
(2k+1)(2k+3)  (2k+1)(2k +3) 


2 +3k+1 
(2k + 1)(2k +3) 


(2k + 1)(k+1) 
(2k + 1)(2k +3) 


k+1 


= by algebra, 
(2k + 3) 


and this is the right-hand side of P(k + 1) [as was to be 
shown]. 


32. Hint 1: The general formula is 
1-44+9-164 = HODR 
=(-1 "1 +2+3+ +n) 
: ú _4yi-l2 — p1! A, in summation 
Or: ží D SECI (š; notation. 


i=1 


in expanded form 


Hint 2: In the proof, use the fact that 


n +1 
1+2+3+:°::: r= ie ™ ) 


i=l 2 
33. Hint: 
ct+(ct+d)+(ct+2d)+ ++: +(c+nd) 
7 ied Praia 
(n T Je T 2 A 


36. In the inductive step, both the inductive hypothesis and 
what is to be shown are wrong. The inductive hypoth- 
esis should be 


Suppose that for some integer k = 1, 
k(k + 1)(2k + 1) 


?4+2?+--- +P = 
6 
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And what is to be shown should be 


1+2 +--+ +(k+1) 
—&+tD(K+ I+ DQAK+D+) 
6 : 
37. Hint: See the Caution note in Section 5.1, page 262. 


38. Hint: See the subsection Proving an Equality on 
page 284 in Section 5.2. 
AO. Hint: Form the sum n° 4 (n+ D? + (n+ 2 t 
(n+ (p- 1))’, and show that it equals 
ph +2n(1+2+3+: +(p-1)) 
tA+4+9+16+ +p-1. 


SECTION 5.3 


1. Proof: Let the property P(n) be the sentence “n cents 
can be obtained by using 3-cent and 8-cent coins.” 


We will show that P(n) is true for every integer n = 14. 
Show that P(14) is true: 


Fourteen cents can be obtained by using two 3-cent 
coins and one 8-cent coin. 


Show that for every integer k = 14, if P(k) is true then 
P(k + 1) is also true: 


Suppose k is any integer with k = 14 such that k cents can 
be obtained using 3-cent and 8-cent coins. /Inductive 
hypothesis] We must show that k + 1 cents can be 
obtained using 3-cent and 8-cent coins. If the k cents in- 
cludes an 8-cent coin, replace it by three 3-cent coins to 
obtain a total of k + 1 cents. Otherwise the k cents con- 
sists of 3-cent coins exclusively, and so there must be 
least five 3-cent coins (since the total amount is at least 
14 cents). In this case, replace five of the 3-cent coins by 
two 8-cent coin to obtain a total of k + 1 cents. Thus, in 
either case, k + 1 cents can be obtained using 3-cent and 
8-cent coins. /This is what we needed to show.] 


[Since we have proved the basis step and the inductive 


step, we conclude that the given statement is true for every 
integer n = 14.] 


4. a. P(O)is «50 — 1 is divisible by 4.” P(0) is true be- 
cause 5° — 1 = 0, which is divisible by 4. 
b. P (k) is “5*— 1 is divisible by 4.” 
P (k+ 1) is “5**! — 1 is divisible by 4.” 
d. Must show: If k is any integer such that k = 0 and 
5* — 1 is divisible by 4, then 5**! — 1 is divisible by 4. 


A 


6. For each positive integer n, let P(n) be the sentence 


Any checkerboard with dimensions 2 X 3n can be com- 
pletely covered with L-shaped trominoes. 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


a. P(1) is the sentence “Any checkerboard with 
dimensions 2 X 3 can be completely covered with 
L-shaped trominoes.” The following diagram shows 
that P(1) is true: 


b. P(k) is the sentence “Any checkerboard with 
dimensions 2 X 3k can be completely covered with 
L-shaped trominoes.” 

c. P(k+ 1) is the sentence “Any checkerboard with 
dimensions 2 X 3(k+ 1) can be completely covered 
with L-shaped trominoes.” 

d. The inductive step requires showing that for every 
integer k = 1, if any checkerboard with dimensions 
2 X 3k can be completely covered with L-shaped 
trominoes, then any checkerboard with dimensions 
2 X 3(k + 1) can be completely covered with L-shaped 
trominoes. 


Proof (by mathematical induction): For the given state- 


ment, the property is the sentence “5" — 1 is divisible 
by 4.” 

Show that P(0) is true: 

P(0) is the sentence “5° _ 1 is divisible by 4.” Now 
5°—1 = 1-1 = 0, and 0 is divisible by 4 because 

0 = 4-0. Thus P(0) is true. 

Show that for every integer k = 0, if P(x) is true then 
P(k +1) is true: 

Let k be any integer with k = 0, and suppose P(x) is true. 
That is, suppose 5* — 1 is divisible by 4. [This is the induc- 
tive hypothesis.] We must show that P(k + 1) is true. That 
is, we must show that 5‘*! — 1 is divisible by 4. Now 


a Nei 
=5'(44+1)-1=5'-44+(5*-1). ®© 


By the inductive hypothesis, 5* — 1 is divisible by 4, and 
so 5‘—1 = 4r for some integer r. Substitute 4r in place 
of 5*—1 in equation (*), to obtain 


S11 = 5*4 + 4r = 4(5*+ r). 


But 5“ + r is an integer because k and r are integers. 
Hence, by definition of divisibility, 5**! — 1 is divisible 
by 4 [as was to be shown]. 


An alternative proof of the inductive step goes as follows: 


Let k be any integer with k = 0, and suppose that 5k—] 
is divisible by 4. Then 5‘ — 1 = 4r for some integer 7, 
and hence 5% = 4r+ 1. 


Tl. 


13. 


14. 


16. 


It follows that S‘*! = 5%- 5 = (4r + 1)+5 = 20r +5. 
Subtracting 1 from both sides gives that slips 
20r +4 = 4(5r+ 1). Now since 5r + 1 is an integer, by 
definition of divisibility, 5**! _ 1 is divisible by 4. 


Proof (by mathematical induction): For the given state- 
ment, the property P(n) is the sentence “37” — 1 is divis- 
ible by 8.” 


Show that P(0) is true: 


P(O) is the sentence «379 — 1 is divisible by 8.” Observe 
that 37° -1 = 1—1 = 0, and 0 is divisible by 8 because 
0 = 8-0. Thus P(O) is true. 

Show that for every integer k = 0, if P(k) is true then 
P(k +1) is true: 


Let k be any integer with k = 0, and suppose P(k) is 
true. That is, suppose 3°* — 1 is divisible by 8. [This is 
the inductive hypothesis.] We must show that P(k + 1) is 
true. That is, we must show that 3°) _ 1 is divisible 
by 8, or, equivalently, that 3°k*2 _ 1 is divisible by 8. 
Now 


gkt2_ 1 = 32.321 = 3.9] 


= 3.84 1)—-1 = 3%.8 + (3% 


1). 


By the inductive hypothesis 3°* — 1 is divisible by 8, 
and so 3% — 1 = 8r for some integer r. By substitution 
into equation (*), 


e) 


3+2 1] = 3%.8 + 8r = 83% + r). 


Now 3% + ris an integer because k and r are integers, 
and hence, by definition of divisibility, 3°k*2 _ 1 is 
divisible by 8 fas was to be shown]. 

Hint: 


x-yt— yet! 


=x (a-y) + ay) 


Hint 1: 
(k+1) 


(k+1) =K 4+3K4+3k+1-k-1 
= (K —k)+3K + 3k 
= (K — k) +3k(k + 1) 
Hint 2: k(k + 1) is a product of two consecutive integers. 


By Theorem 4.5.2, one of these must be even. 


Proof (by mathematical induction): For the given state- 
ment, let the property P(n) be the inequality 2” < (n+ 1)!. 
Show that P(2) is true: 

P(2) says that 2? < (2 + 1)!. The left-hand side is 2” = 4 
and the right-hand side is 3! = 6. So, because 4 < 6, 
P(2) is true. 


5:3 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


19. 


24. 


A-44 


Show that for every integer k = 2, if P(k) is true then 
P(k +1) is true: 


Let k be any integer with k = 2, and suppose P(A) is 
true. That is, suppose 2% < (k+ 1)!. [This is the inductive 
hypothesis.] We must show that P(k + 1) is true. That is, 
we must show that 2!*! < ((k + 1) + 1), or, equivalently, 
that 2'*! < (k+2)!. By the laws of exponents and the 
inductive hypothesis, 


tI = 9.9% < (k+ 1). (*) 
Since k = 2, then 2 < k + 2, and so 
(k+ 1)! <(kK+2)(k+ 1)! = (k+2)!. (#*) 


Combining inequalities (*) and (**) gives 
Ol < (k +2)! 
[as was to be shown]. 


Proof (by mathematical induction): For the given state- 


ment, let the property P(n) be the inequality n? < 2”. 
Show that P(5) is true: 

P(5) says that 5? < 2°. But 5° = 25 and 2° = 32, and 
25 < 32. Hence P(5) is true. 

Show that for any integer k = 5, if P(k) is true then 
P(k +1) is true: 

Let k be any integer with k = 5, and suppose P(k) is 
true. That is, suppose K < 2. [This is the inductive 
hypothesis.] We must show that P(k + 1) is true. That is, 
we must show that (k + 1 < 2**! Now 


(k+1) =K +2k+1<2%+2k+1 
by inductive hypothesis. 


Also, by Proposition 5.3.2, 
2k+1 <2 Prop. 5.3.2 applies since k = 5 = 3. 


Putting these inequalities together gives 


(k+1)? < 2% 4+2k4+1< 242% = ok! 
[as was to be shown]. 


Proof (by mathematical induction): For the given state- 


ment, let the property P(n) be the equation a, = 3- ces 
Show that P(1) is true: 

The left-hand side of P(1) is a,, which equals 3 by 
definition of the sequence. The right-hand side is 
3-7"! = 3 also. Thus P(1) is true. 

Show that for every integer k = 1, if P(k) is true then 
P(k +1) is true: 

Let k be any integer with k = 1, and suppose P(A) is 
true. That is, suppose a, = 3-7! [This is the inductive 
hypothesis.] We must show that P(k + 1) is true. That is, 
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25 


29 


we must show that a,,; = 3° oO" or, equivalently, 
A+, = 3 -7* But the left-hand side of P(k + 1) is 


ak+1 = Tay by definition of the sequence a), a2, a3, ... 
= 783- gt by inductive hypothesis 
= 3-78 by the laws of exponents, 


and this is the right-hand side of P(k + 1) [as was to 
be shown]. 


Proof (by mathematical induction): According to the 
definition of bo, by, bo, .. 
b, = 4+ b,_, for every integer k = 1. Let the property 
P(n) be the inequality 


., we have that bọ = 5 and 


b, > 4n. 


We will prove that P(n) is true for each integer n = 0. 
Show that P(0) is true: To show that P(0) is true we 
must show that bọ > 4-0. But 4-0 = 0, bọ = 5 (by defi- 
nition of bo, b4, bo, ...), and 5 > 0. So P(O) is true. 
Show that for every integer k = 0, if P(k) is true then 
P(k +1) is true: Let k be any integer with k = 0, and 


suppose that 
by > 4k. 


< inductive hypothesis 


We must show that 
Dy > A(K+ 1). 


Now 
bey = 44+ by by definition of bo, b, bo, ... 
>4+4k because b, > 4k by inductive 
hypothesis 
>41 +k) by factoring out a 4 
> 4(k+1) by the commutative law of 


addition 
[as was to be shown]. 


Proof (by mathematical induction): 


A set L consists of strings obtained by juxtaposing one 
or more of abb, bab, and bba. Let the property P(n) 

be the sentence “Tf a string s in L has length 3n, then s 
contains an even number of b’s.” 


Show that P(1) is true: P(1) is the statement that a 
string s in L of length 3 contains an even number of b’s. 
The only strings in L that have length 3 are abb, bab, 
and bba, and each of these strings has an even number 
of b’s. So P(1) is true. 


Show that for every integer k = 1, if P(k) is true then 
P(k + 1) is true: Let k be any integer with k = 1 and 
suppose that 


if a string s in L has length 3k, then s contains an 


even number of b’s. <— P(k) inductive hypothesis 
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34. 


35. 


37. 


38. 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


We must show that 


if a string s in L has length 3(k + 1), then s contains 
an even number of b’s. <— P(k+1) 


So, suppose s is a string in L that has length 3(k + 1). 
Now 3(k + 1) = 3k +3 and the strings in L are obtained 
by juxtaposing strings already in L with one of abb, 
bab, or bba. Thus, either the initial or the final three 
characters in s are abb, bab, or bba. Moreover, the other 
3k characters in s are also in L by definition of L, and 
so, by inductive hypothesis, the other 3k characters in s 
contain an even number, say m, of b’s. Because each of 
abb, bab, and bba contains 2 b’s, the total number of b’s 
in s is m+ 2, which is a sum of even integers and hence 
is even [as was to be shown]. 


Hint: Consider the problem of trying to cover a 3 X 3 
checkerboard with trominoes. Place a checkmark in 
certain squares as shown in the following figure. 


vV v 


vV v 


Observe that no two squares containing checkmarks 

can be covered by the same tromino. Since there are 

four checkmarks, four trominoes would be needed to 
cover these squares. But, since each tromino cov- 

ers three squares, four trominoes would cover twelve 
squares, not the nine squares in this checkerboard. It 
follows that such a covering is impossible. 


a. Hint: For the inductive step, note that a2 X 3(k+ 1) 
checkerboard can be split into a 2 X 3k checker- 
board and a 2 X 3 checkerboard. 


b. Hint: Consider a 3 X 5 checkerboard, and refer to 
the hint for exercise 32. Figure out a way to place 
six checkmarks in squares so that no two of the 
squares that contain checkmarks can be covered by 
the same tromino. 


Hint: Use proof by contradiction. If the statement 

is false, then there exists some ordering of the 
integers from 1 to 30, say, x), X2, .. ., X39, such 

that x; + x2 + x3 < 45, X2 + x3 + x4 < 45, ..., and 

X39 + xı +X < 45. Evaluate the sum of all these inequali- 
ties using the fact that ae x; = Se i and Theorem 5.2.1. 


Hint: Given k+ 1 a’s and k + 1 b’s arrayed around the 
outside of the circle, there has to be at least one loca- 
tion where an a is followed by a b as one travels in the 
clockwise direction. In the inductive step, temporarily 
remove such an a and the b that follows it, and apply the 
inductive hypothesis. 


40. 


41. 


43. 


44, 


45. 


b. Hint: In the inductive step, imagine dividing a 
2(k +1) X 2(k+ 1) checkerboard into two sections: 
a center checkerboard of dimensions 2k X 2k and 
an outer perimeter of single, adjacent squares. Then 
examine three cases: case | is where both removed 
squares are in the central 2k X 2k checkerboard, 
case 2 is where one removed square is in the central 
2k X 2k checkerboard and the other is on the perim- 
eter, and case 3 is where both removed squares are 
on the perimeter. 


Hint: Let P(n) be the sentence: If (1) 2n + 1 people are 
all positioned so that the distance between any two 
people is different from the distance between any two 
other people, and if (2) each person sends a message to 
their nearest neighbor, then there is at least one person 
who does not receive a message from anyone. Use 
mathematical induction to prove that P(n) is true for 
each integer n = 1. 


a. Hint: 


Two Balls 


WW > B 


WB > W 


BB > B 


b. Hint: In all three cases when the urn initially 
contains an odd number of white balls, there is one 
white ball in the urn at the end of the game, and 
when the urn initially contains an even number of 
white balls, there is one black ball (i.e., zero white 
balls) in the urn at the end of the game. 


Hint: Given a graph G satisfying the given condition, 
form a new graph G” by deleting one vertex v of G 
and all the edges that are incident on v. Then apply the 
inductive hypothesis to G”. 


The inductive step fails for going from n = 1 ton = 2, 
because when k = 1, 


A= {apm} and B= {a} 

and no set C can be defined to have the properties 
claimed for the C in the proof. The reason is that 

C = {a,} = B, and so an element of A, namely ay, is 
not in either B or C. 


Since the inductive step fails for going from n = 1 to 

n = 2, the truth of the following statement is never 
proved: “All the numbers in a set of two numbers are 
equal to each other.” This breaks the sequence of induc- 
tive steps, and so none of the statements for n > 2 is 
proved true either. 


5.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES 
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Here is an explanation for what happens in terms of the 
domino analogy. The first domino is tipped backward 
(the basis step is proved). Also, if any domino from the 
second onward tips backward (the inductive step works 
or n = 2). In this case, however, when the first domino 
is tipped backward, it does not tip the second domino 
backward. So only the first domino falls down; the rest 
remain standing. 


46. Hint: Is the basis step true? 


SECTION 5.4 


1. 


Proof (by strong mathematical induction): Let the prop- 
erty P(n) be the sentence “a,, is odd.” 


Show that P(1) and P(2) are true: 


Observe that a; = 1 and a) = 3 and both | and 3 
are odd. 


Thus P(1) and P(2) are true. 


Show that for every integer k = 2, if P) is true for 
each integer i with 1 Si Sk, then P(k + 1) is true: 


Let k be any integer with k = 2, and suppose a; is odd 
for each integer i with 1 =i < k. [This is the inductive 
hypothesis.] We must show that a; is odd. We know 
that ay4, = ay) + 2q by definition of a1, a2, a3, .... 
Moreover, k — 1 is less than k + 1 and is greater than or 
equal to 1 (because k = 2). Thus, by inductive hypoth- 
esis, az- is odd. Also, every term of the sequence is an 
integer (being a sum of products of integers), and so 2a, 
is even by definition of even. It follows that ap, is the 
sum of an odd integer and an even integer and hence is 
odd by Theorem 4.1.2 (exercise 30, Section 4.1). [This is 
what was to be shown.] 


Proof (by strong mathematical induction): Let the prop- 


erty P(n) be the inequality d, = 1. 
Show that P(1) and P(2) are true: 


10 
Observe that d} = = and d = IT and both <1 


10 
and 77 = 1. Thus P(1) and P(2) are true. 


Show that for every integer k = 2, if P(i) is true for 
each integer i with 1 Si Sk, then P(k +1) is true: 


Let k be any integer with k = 2, and suppose d; = 1 

for each integer i with 1 =i =k. [This is the induc- 

tive hypothesis.] We must show that d,4, = 1. Now, by 
definition of d4, dz, d3, . . ., dk = dy dy_,. Moreover 

d, = 1 and d,_, = 1 by inductive hypothesis because 
both k — 1 and k are less than or equal to k. Consequent- 
ly, dys, = dy: dy_; = 1 because if two positive numbers 
are each less than or equal to 1, then their product is 
less than or equal to 1. [To see why this is so, note that 
if0<ax1land0<b S11, then multiplying a = 1 by 
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10. 


Tl. 


b gives ab S b, and since b = 1, then, by transitivity of 
order, ab = 1.] Thus the inductive step has been proved. 
[Since we have proved both the basis step and the inductive 
step, we conclude that d, = 1 for every integer n = 1.] 


Proof (by strong mathematical induction): Let the prop- 
erty P(n) be the equation e, = 5-3" + 7-2”. 

Show that P(0) and P(A) are true: 

We must show that ey = 5-3°+7-2° and 

e; = 5:3'+7-2'. The left-hand side of the first 
equation is 12 (by definition of eo, e1, e2, ...), and 
its right-hand side is 5-1+7-1 = 12 also. The 
left-hand side of the second equation is 29 (by 
definition of eo, e1, é2, ...), and its right-hand side is 
5:3+7-2 = 29 also. Thus P(0) and P(1) are true. 
Show that for every integer k = 1, if P(i) is true for 
each integer i with 0 Si Sk, then P(k + 1) is true: 
Let k be any integer with k = 1, and suppose 

e; = 5°3'+7-2' for each integer i withO < i < k. 
[Inductive hypothesis] We must show that 

eppi = 5:3147 2t 


Now 


ek+1 = 5e — Oez] by definition of eg, €j, €z, .... 


= 5(5-3t + 7:25- 65-3" 1 +7-2*-1) 
by inductive hypothesis 


= 25-3" + 35-2% 39-341 42.2%! 

= 25-3 + 35-2" — 10:3-37! — 21:2-24! 
= 25-3" + 35-2" — 10-3*— 21-2" 

= (25 — 10)-3* + (35—21)-2* 

= 15-3% + 14-2" 

= 5-3-3 47-2-24 

= 5-301 4724+! by algebra 


[as was to be shown]. 


Hint: In the basis step, show that P(14), P(5), 
and P(16) are all true. For the inductive step, note 
thatk+1 = [(k+1)—3] +3, and if k = 16, then 
(k+1)-3 = 14. 


Proof (by strong mathematical induction): Let the prop- 
erty P(n) be the sentence 


A jigsaw puzzle consisting of n pieces takes 
n— | steps to put together. 


Show that P(A) is true: 


A jigsaw puzzle consisting of just one piece does not 
take any steps to put together. Hence it is correct to say 
that it takes zero steps to put together. 
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12. 


13. 


14. 


16. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Show that for every integer k = 1, if P(i) is true for 
each integer i with 1 Si Sk, then P(k +1) is true: 


Let k be any integer with k = 1 and suppose that for 
each integer i with 1 =i =k, a jigsaw puzzle consisting 
of i pieces takes i— 1 steps to put together. [This is the 
inductive hypothesis.] We must show that a jigsaw puz- 
zle consisting of k + 1 pieces takes k steps to put togeth- 
er. Consider assembling a jigsaw puzzle consisting of 

k +1 pieces. The last step involves fitting together two 
blocks. Suppose one of the blocks consists of r pieces 
and the other consists of s pieces. Then r+s = k+1, 
and 1 =r=kand1=s<k. Thus, by the inductive 
hypothesis, the numbers of steps required to assemble 
the blocks are r— 1 and s — 1, respectively. Then the 
total number of steps required to assemble the puzzle is 
(r-1)+(s-1I)+1=(r+s)-1L=(kK+)D-1=k 


[as was to be shown]. 


Hint: For any collection of cans, at least one must 
contain enough gasoline to enable the car to get to the 
next can. (Why?) Imagine taking all the gasoline from 
that can and pouring it into the can that immediately 
precedes it in the direction of travel around the track. 


Sketch of proof: Given any integer k > 1, either k is 
prime or k is a product of two smaller positive integers, 
each greater than 1. In the former case, the property is 
true. In the latter case, the inductive hypothesis ensures 
that both factors of k are products of primes and hence 
that k is also a product of primes. 


Proof (by strong mathematical induction): Let the prop- 
erty P(n) be the sentence “Any product of n odd integers 
is odd.” 


Show that P(2) is true: 


We must show that any product of two odd integers is odd. 
But this was established in exercise 20 of Section 4.2. 


Show that for every integer k = 2, if P(i) is true for 
each integer i with 2 Si =k then P(k +1) is true: 


Let k be any integer with k = 2, and suppose that for 
each integer i with 2 < i < k, any product of i odd 
integers is odd. [Inductive hypothesis] Consider any 
product M of k + 1 odd integers. Some multiplication 
is the final one that is used to obtain M. Thus, there 
are integers A and B such that M = AB, and each of A 
and B is a product of between 1 and k odd integers. (For 
instance, if M = ((a;a7)a3)a4, then A = (a;d7)a3 and 

B = a4.) By inductive hypothesis, each of A and B is 
odd, and, as in the basis step, we know that any product 
of two odd integers is odd. Hence M = AB is odd. 


Hint: Let the property P(n) be the sentence “If n is 
even, then any sum of n odd integers is even, and if 


17. 


19. 


n is odd, then any sum of n odd integers is odd.” For 
the inductive step, consider any sum S of k+ 1 odd 
integers. Some addition is the final one that is used to 
obtain S. Thus, there are integers A and B such that 

S = A + B, and A is a sum of r odd integers and B is a 
sum of (k + 1) — r odd integers. Consider the two cases 
where k + 1 is even and k + 1 is odd, and for each case 
consider the two subcases where r is even and where r 
is odd. 


4! = 4, 4? = 16,4? = 64, 4* = 256, 4° 
4° = 4096, 4’ = 16384, and 4° = 65536. 
Conjecture: The units digit of 4” equals 4 if n is odd 
and equals 6 if n is even. 


1024, 


Proof by strong mathematical induction: Let the prop- 
erty P(n) be the sentence “The units digit of 4” equals 4 
if n is odd and equals 6 if n is even.” 


Show that P(1) and P(2) are true: 


When n = 1, 4” = 41 = 4, and so the units digit is 4. 
When n = 2, then 4” = 4° = 16, and so the units digit 
is 6. Thus, P(1) and P(2) are true. 


Show that for every integer k = 2, if the property is 
true for each integer i with 1 Si Sk then it is true 
fork +1: 


Let k by any integer with k = 2, and suppose that for 
each integer i with 0 <i = k, the units digit of 4’ equals 
4 if i is odd and equals 6 if i is even. [Inductive hypoth- 
esis] We must show that the units digit of 4«*! equals 4 
if k+ 1 is odd and equals 6 if k + 1 is even. 


Case 1 (k + 1 is odd): In this case, k is even, and 

so, by inductive hypothesis, the units digits of 4* 

is 6. Thus 4° = 10g + 6 for some nonnegative inte- 
ger q. It follows that 4**! = 4*-4 = (10g+6)-4 = 
40q + 24 = 10(4q +2) + 4. Thus, the units digit of 
41 is 4, 

Case 2 (k +1 is even): In this case, k is odd, and 

so, by inductive hypothesis, the units digit of 4“ is 

4. Thus 4* = 10g + 4 for some nonnegative inte- 

ger q. It follows that 4**! = 4*-4 = (10g+4)-4 = 
40q + 16 = 10(4g + 1) + 6. Thus, the units digit of 
4 is 6. 

Conclusion: Because cases | and 2 are the only pos- 
sibilities and 4**! has one of the required forms in each 
case, we have shown that P(k + 1) is true. 


Proof (by strong mathematical induction): Let a), a>, 
a3, ... be a sequence that satisfies the recurrence relation 


a = 2+ ayn, for every integer k = 2, with initial condi- 
tion a, = 1, and let the property P(n) be the inequality 


a, =N. <— P(n) 
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22. 
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We will show that P(n) is true for each integer n = 1. 
Show that P(A) is true: a, = 1 and 1 = 1. So P(1) 

is true. 

Show that for every integer k = 1, if P(i) is true for 
each integer i from 1 through k, then P(k + 1) is true: 
Let k be any integer with k = 1, and suppose that 


a; = i for each integer i with 
1sisk. < inductive hypothesis 
We must show that 


Ag+] <k+l1. 
Now 


ak+1 = 2° AK + 1)/2 
by definition of aj, a2, a3, ... 


= 2-\(k+1)/2| 
< on if k is odd 
2:(k/2) if kis even 
by inductive hypothesis 
= fk+1 if kis odd 
k if kis even 
< k+l because both k = k+ 1 and 
k+1Sk+1. 


Thus a+; = k+ 1 [as was to be shown]. 


Proof (by strong mathematical induction): Let P(n) be 
the sentence 


In this version of NIM, if both piles initially 
contain objects, the player who goes second can 
always win. 


We will prove that P(n) is true for every integer n = 0. 
Show that P(0) is true: 

If neither pile contains any objects, the player who goes 
first automatically loses because of not being able to 


make a move. So the second player wins the game by 
default. Thus P(0) is true. 


Show that for every integer k = 0, if P(i) is true for 
each integer i from 1 through k, then P(k + 1) is true: 


Let k be any integer with k = 0, and suppose: 
In this version of NIM, for every integer i with 


0 < i < k, if both piles initially contain i objects, 
the player who goes second can always win. 


We must show that 


In this version of NIM, if both piles initially con- 
tain k+ 1 objects, the player who goes second can 
always win. <—PK+1) 
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So suppose both piles contain k + 1 objects, where 
0 =i < k, and suppose the first player removes r ob- 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


prime is false.] Hence p is prime, and we have shown the 
existence of a prime number that divides n. 


jects from pile #1, where 1 = r < 3. If the second player 
removes r objects from pile #2, then both piles will 
have the same number of objects, namely (k + 1) —r 
and (k+ 1)— r =k because r = 1. Thus, by inductive 
hypothesis, the second player can win. 


28. a. Proof: Suppose r is any rational number. [We need 
to show that there is an integer n such that r < n.] 
Case 1 (r = 0): In this case, take n = 1. Then 
r<n. 
Case 2 (r > 0): In this case, r = z for some posi- 


23. a. tive integers a and b (by definition of rational and 
3 zu 7 307 z1 because r is positive). Note that r = A < nif, and 
1 2 4 3 1-2+4-3 14 only if, a < nb. Letn = 2a. Multiply both sides of 
i 2 2-1 2 1-14+2-2+1-2]| 7 the inequality 1 < 2 by a to obtain a < 2a, and 
H}ijiil A Ee ia a el A EE multiply both sides of the inequality 1 < b by 2a to 


TOTAL 45 obtain 2a < 2ab = nb. Thus a < 2a < nb, and so, 


24. a. The results are shown both diagrammatically and in by transitivity of order, a < nb. Dividing both sides 


a table. by b gives that F < n, or, equivalently, that r < n. 
Initial State After the Ist Round After the 2nd Round Hence, in both cases, r < n [as was to be shown]. 
16 1 2 1 1 29. Hint: If r is any rational number, let S be the set of all 
15 3 : i 
i4 3 integers n such that r < n. Use the results of exercises 
b z i s. de 3 28(a), 28(c), and the well-ordering principle for the inte- 
i2 A gers to show that S has a least element, say v, and then 
1i 7 show that v~- 1 Sr <v. 
109 8 9 9 


30. Proof: Let S be the set of all integers r such that 


After the 3rd Round i : 5 
n = 2'-r for some integer i. Then n E S because 


After the 4th Round 


1 1 ; ; 
n= 2-n, and so S # Ø. Also, since n = 1, each rin 


S is positive, and so, by the well-ordering principle, S 

has a least element m. This means that n = 2*-m (*) for 
some nonnegative integer k, and m = r for every rin S. 
We claim that m is odd. The reason is that if m is even, 
then m = 2p for some integer p. Substituting into equa- 


Round Eliminated Remaining tion (*) gives 
1 2, 4, 6, 8, 10, 12, 1,3, 5,7,9, n = 2m = %-2p = (2*-2)p = Wp. 
14, 16 11, 13 It follows that p E S and p < m, which contradicts the 
3; 711,15 1,5, 9, 13 fact that m is the least element of S. Hence m is odd, 
5,13 1,9 and so n = m-2* for some odd integer m and nonnega- 
4 9 1 tive integer k. 


34. Hint: In the inductive step, divide into cases depending 


26. Proof: Let n be any integer greater than 1. Consider the 


set S of all positive integers other than 1 that divide n. Oni whether k Gani De wetten ag ene Os 


; : : k = 3x + 2 for some integer x. 
Since n | n and n > 1, there is at least one element in S. 3 8 


Hence, by the well-ordering principle for the integers, 35. Hint: In the inductive step, let an integer k = 0 be given 
S has a smallest element; call it p. We claim that p is 
prime. For suppose p is not prime. Then there are inte- 
gers a and b with 1 <a < p, 1 < b < p, and p = ab. 
By definition of divides, a | p. Also p | n because p is in 
S and every element in S divides n. Therefore, a | p 

and p | n, and so, by transitivity of divisibility, 


and suppose that there exist integers q’ and r’ such that 
k = dq' +r and0 =r’ < d. You must show that there 
exist integers q and r such that 
k+1=dq+rand0sr<d. 

To do this, consider the two cases r’ < d—1 and 

a | n. Consequently, a E S. But this contradicts the fact r=d-l. 
that a < p, and p is the smallest element of S. [This 
contradiction shows that the supposition that p is not 


36. Hint: Given a predicate P(n) that satisfies conditions 
(1) and (2) of the principle of mathematical induction, 
let S be the set of all integers greater than or equal to a 
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for which P(n) is false. Suppose that S has one or more 
elements, and use the well-ordering principle for the 
integers to derive a contradiction. 


37. Hint: Suppose S is a set containing one or more inte- 


gers, all of which are greater than or equal to some inte- 
ger a, and suppose that S does not have a least element. 
Let the property P(n) be the sentence “i € S for any 
integer i with a = i = n.” Use mathematical induction 
to prove that P(n) is true for every integer n = a, and 
explain how this result contradicts the supposition that 
S does not have a least element. 


SECTION 5.5 


1. 


Proof: Suppose the predicate m +n = 100 is true before 
entry to the loop. Then 


Mold F Noid = 100. 


After execution of the loop, 


Mew = Mold + 1 and Mew = Mold 7 l, 


So 


Mew Mew 7 (Mota 1) (Nota 1) 

= Moat Nog = 100. 
Proof: Suppose the predicate m > r is true before 
entry to the loop. Then 

3 2 

Mid > Nola 
After execution of the loop, 
and 


Mew = 3° Mola New = 5 "Notas 


so 


3 


3 3 3 2 
Mew 7 (3 ‘Moa) aa 27: Ma >27 “Nold- 


A l 
Now since new = S*Noig, then nod = 37new Hence 


3 2 1 a 1 2 
Miew > 27° Nog = 27° g”new = 27:5” new 


27 3 2 
= ew = news 


25 


Proof: [The wording of this proof is almost the same as 
that of Example 5.5.2.] 
I. Basis Property: /1(0) is true before the first iteration 

of the loop.] 
1(0) is “exp = x° and i = 0.” According to the pre- 
condition, before the first iteration of the loop exp = 1 
and i = 0. Since x° = 1, (0) is evidently true. 

Il. Inductive Property: /If G A^ K(k) is true before a 
loop iteration (where k = 0), then (k + 1) is true 
after the loop iteration.] 


A-50 


Suppose k is any nonnegative integer such that 

G ^ K(k) is true before an iteration of the loop. 
Then as execution reaches the top of the loop, 

i #m, exp = x, and i = k. Since i # m, the guard 
is passed and statement 1 is executed. Now before 
execution of statement 1, 


— k 
eXPod = X, 
so execution of statement 1 has the following effect: 
= k, kt 
eXPnew — EXPoa X = XX = X . 
Similarly, before statement 2 is executed, 
lold = k, 
so after execution of statement 2, 
liew = iati = k+1 


Hence after the loop iteration, the two statements 
exp = x+; andi = k+ 1 are true, and so Z(k + 1) is 
true. 

Ill. Eventual Falsity of Guard: [After a finite number of 
iterations of the loop, G becomes false.] 


The guard G is the condition i # m, and m is a non- 
negative integer. By I and II, it is known that 


for every integer n = 0, if the loop is 
iterated n times, then exp = x" andi = n. 


So after m iterations of the loop, i = m. Thus 
G becomes false after m iterations of the loop. 

. Correctness of the Post-Condition: /If N is the 
least number of iterations after which G is false and 
I(N) is true, then the value of the algorithm variables 
will be as specified in the post-condition of the loop.] 
According to the post-condition, the value of exp 
after execution of the loop should be x”. But when 
G is false, i = m. And when /(N) is true, i = N and 
exp = x. Since both conditions (G false and I(N) 
true) are satisfied, m = i = N and exp = x", as 
required. 


Proof: 

l. Basis Property: /(0) is “i = 1 and sum = A[1].” 
According to the pre-condition, this statement is true. 

Il. Inductive Property: Suppose k is a non- 

negative integer such that G ^ I(k) is true be- 

fore an iteration of the loop. Then as execution 

reaches the top of the loop, i # m, i = k+ 1, 

and sum = A[1]+A[2]+ --- +A[k+ 1]. Since 

i # m, the guard is passed and statement 1 

is executed. Now before execution of state- 


ment 1, isa = k+ 1. So after execution of state- 
ment l, iew = fog +1 = (K+1)+1=k4+2. 
Also before statement 2 is executed, 

suMoq = ALL] + A[2]+ -+ + A[k+ 1]. 
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Execution of statement 2 adds A[k + 2] to this 
sum, and so after statement 2 is executed, 

SUMpey = AL1]+A[2]+ --- + A[A+1]+ A[k+ 2]. 
Thus after the loop iteration, /(k + 1) is true. 

Ill. Eventual Falsity of Guard: The guard G is the 
condition i # m. By I and II, it is known that for 
every integer n = 1, after n iterations of the loop, 
(n) is true. Hence, after m — 1 iterations of the loop, 
I(m) is true, which implies that i = m and G is false. 

IV. Correctness of the Post-Condition: Suppose 
that N is the least number of iterations after 
which G is false and J(N) is true. Then (since G 
is false) i = m and (since J(N) is true) i = N+ 1 
and sum = A[1]+ A[2]+ --- +A[N+ 1]. Put- 
ting these together gives m = N+ 1, and so 
sum = A[1]+A[2]+ --- +Alm], which is the 
post-condition. 


10. Hint: Assume G ^ I(k) is true for a nonnegative integer 


k. Then agg # O and boia # O and 

(1) doa and boag are nonnegative 
gcd(doia, Dora) = gcd(A, B). 

(2) At most one of aoia and boja equals 0. 

(3) OS asat dog = A+B-k. 

It must be shown that /(k + 1) is true after the loop 

iteration. That means it is necessary to show that 

(1) Gpew and bnew are nonnegative integers with 
gcd(Ayews Bnew) = ged(A, B). 

(2) At most one of anew and by. equals 0. 

(3) OS dyew + Drew SAt+B-(k+ 1). 

To show (3), observe that 


Anew + brew T | 


integers with 


Goa Porat Poa IEdog = boa 


boa doa Foa tora < Bora 


[The reason for this is that when asiq = boia, then 
anew = Gold 7 boia and Drew = bow and when old < Dola 
then b đa 


if aoia = Dota 


new T boia ~ Gold and anew = 


_ } old 
new + Drew = | 


a ; i 
boa ifaoa < boia 


Now since asa # 0 and baa # 0, and since aog and 

boia are nonnegative integers, then dog = 1 and 

baa = 1. Hence, asa — 1 = 0 and baa — 1 = 0, and so 
dold = Agia + bora — 1 and boa S Bboid + oa — 1. It follows 
that iew + anew = Goia + Bog — 1 S (A+ B-k)— 1 by 
noting that (3) is true when going into the kth itera- 
tion. Thus, anew + bnew < A + B — (k + 1) by algebraic 
simplification. 


SECTION 5.6 


1. 


a = l,a = 2a +2 =2:1+2=4, 
a3 = 2a, +3 = 2-443 = 11, 
a4, = 2a; +4 = 2:11 +4 = 26 
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15. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


eo = 1c = l(c = 1-0) = 1, 

ey = 2c) =2 (1° = 2, 

C3 = 3(cy)” = 3- (27 = 12 

So = 1,5, = 1, 59 = sı +259 = 142-1 = 3, 


$3 = 89 +25; =34+2-1=5 


1, u = 3u — u; = 3-1-1 =2, 


1=7 


u = l,m = 
W = 4:2 


u4 = 4u3 

By definition of ag, a), a2,..., for each integer k = 1, 
(*) a, = 3k+1~ and 

Cc) ay, = 3(k- 1) +1. 


Then ap; +3 


= 3(k—1)+1+3 by substitution from (**) 
= 3k-3+1+3 

= 3k+1 by basic algebra 

=a by substitution from (*). 


By definition of co, C1, C2,..-, Cn = 2" — 1, for each inte- 
ger n = 0. Substitute k and k — 1 in place of n to get 


(*) Ch = 2*—1 and 
C) cy = 21-1 
for every integer k = 1. Then 


2c,- +1 = 2(2* 1-1) 41 


by substitution from (**) 


=2-24+1 
= oF] by basic algebra 
CR by substitution from (*). 


By definition of to, ti, to, ..., t = 2 +n, for each integer 
n= 0. Substitute k, k — 1, and k— 2 in place of n to get 


(*) = 2+k, 
(*) tı =2+(k-1), and 
ce) th = 2+ (k—2) 
for each integer k = 2. Then 
2th — tk-2 
= 222+(k—-1))-(2+(k—2)) 
by substitution from (**) and (***) 
=2(k+1)—-k 
=2+k by basic algebra 
= tk by substitution from (*). 


Hint: Mathematical induction is not needed for the 
proof. Start with the right-hand side of the equation and 


use algebra to transform it into the left-hand side of the 
equation. 
17. a. a, =2 


a = 2 (moves to move the top disk from pole A to 
pole C) 


+ 1 (move to move the bottom disk from 
pole A to pole B) 


+ 2 (moves to move top disk from pole C 
to pole A) 


+ 1 (move to move the bottom disk from 
pole B to pole C) 


+2 (moves to move top disk from pole A 
to pole C) 


=8 
az = 8+14+8+1+4+8 = 26 
c. For every integer k = 2, 


ay = G,—1 (moves to move the top k — 1 disks from 
pole A to pole C) 


+ 1 (move to move the bottom disk from 
pole A to pole B) 
+ a,—, (moves to move the top disk 
from pole C to pole A) 


+ 1 (move to move the bot- 
tom disks from pole B 


to pole C) 

+ a, — 1 (moves to move 
the top disks 
from pole A to 
pole C) 

= 3ak—1 +2. 
18. b. b, = 40 


e. Hint: One solution is to use mathematical induc- 
tion and apply the formula from part (c). Another 
solution is to prove by mathematical induction that 
when a most efficient transfer of n disks from one 
end pole to the other end pole is performed, at some 
point all the disks are on the middle pole. 


19. a. s =1,5)=1+1+1=3, 
s, =s +(1+1+1)+s;=5 
b. Sq = Sq (d 1+ 1) 5, =9 


20. b. Call the poles A, B, and C. Compute c3 by using the 
following sequence of steps to transfer two disks 


from A to B: 


1 (move to transfer the top disk from A to B) 
+ 1 (move to transfer the top disk from 
BtoC) 
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21. 
22. 


23. 


25. 


26. 


27. 


32. 
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+ 1 (move to transfer the bottom disk from A to B) 

+ 1 (move to transfer the top disk from C to A) 

+ 1 (move to transfer the top disk from A to B). 
This sequence of steps is the least possible, and so 
Cy = 5. 

A tower of 3 disks can be transferred from A to B 
by using the following sequence of steps: 


1 (move to transfer the top disk from A to B) 

+ 1 (move to transfer the top disk from B to C) 

+ 1 (move to transfer the middle disk from A to B) 
+ 1 (move to transfer the top disk from C to A) 

+ 1 (move to transfer the middle disk from B to C) 
+ 1 (move to transfer the top disk from A to B) 

+ 1 (move to transfer the top disk from B to C). 


After these 7 steps have been completed, the bottom 
disk can be transferred from A to B. At that point 
the top two disks are on C, and a modified version 
of the initial seven steps can be used to transfer 
them from C to B. Thus the total number of steps is 
74+1+7 = 15,and 15 < 21 = 4c + 1. 


b. R= 14 
b. r= 1, =1,m=14+4:1=5,7;=5+4-1=9, 
r4 =9+4:5 = 29, r; = 29+4-9 = 65, 


r6 = 65+4-29 = 181 


c. There are 904 rabbit pairs, or 1,808 rabbits, after 
12 months. 


a. Each term of the Fibonacci sequence beyond 
the second equals the sum of the previous two. 
For any integer k = 1, the two terms previous 
to F+ are F, and F,_;. Hence, for every integer 
k = 1, Fey = Fg + Fy. 


By repeated use of definition of the Fibonacci sequence, 
for each integer k = 4, 


Fp = Fp-1 + Pea = (Fk-2 + Fy) + (Fag + F-a) 
= ((Fy-3 + Fk-4) + Fy—3) + (Fk-3 + Fea) 
= 3F;—-3 +2 + Fra. 


For each integer k = 1, 
Fi— Fi- 

= (Fy — Fp- XF + Fg-1) by basic algebra (difference 
of two squares) 


by definition of the 
Fibonacci sequence. 


= (Fk — Fy) Fett 
= FP ao FF eet 


Hint: Use mathematical induction. In the inductive step, 
use Lemma 4.10.2 and the fact that F,4. = Fy, + Fi, to 
deduce that 


gcd(Fk+2, Pepi) = ged(Fk+1, Fy). 
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š P Fri 
34. Hint: Let L = lim 


35. 


37. 


39. 


41 


and show that L = it il. 
Deduce that L = : Ne 


Hint: Use the result of exercise 30 to prove that the 


Py 

Fp 
F, F; Fs 

that the infinite sequence F Fy Fp Fe 


n : Fo F, ; : : 
infinite sequence Fp Fe . 1s strictly decreasing and 


. is strictly increas- 


ing. The first sequence is bounded below by 0, and the 
second sequence is bounded above by 1. Deduce that 
the limits of both sequences exist, and show that they 
are equal. 


a. Because the 4% annual interest is compounded 
quarterly, the quarterly interest rate is (4%)/4 = 1%. 
Then Ry = Ry} + 0.01 R} =“ 1.01R;—1. 

b. Because one year equals four quarters, the amount 
on deposit at the end of one year is Ry = $5,203.02 
(rounded to the nearest cent). 

c. The annual percentage yield (APY) for the account 


. $5203.02 — $5000.00 
is $5000.00 ~ 4.0604%. 


When one is climbing a staircase consisting of n stairs, 
the last step taken is either a single stair or two stairs 
together. The number of ways to climb the staircase and 
have the final step be a single stair is c,_,; the number 
of ways to climb the staircase and have the final step 

1 +C,—2. Note 
also that c} = 1 and c, = 2 [because either the two 


be two stairs is c, >. Therefore, c, = c,— 


stairs can be climbed one by one or they can be climbed 
as a unit]. 


Proof (by mathematical induction); Let the property, 
P(n), be the equation x 14; i=1 Gj, Where a), ao, 
da, and c are any real numbers. 


Show that P(A) is true: 


Let a; and c be any real numbers. By the recursive 


A3, s.. 


definition of sum, > (ca) = ca, and È ð =q]. 
Therefore, Dj (cai) = c> a and so P(1) is true. 
Show that for every integer k = 1, if P(k)is true, 
then P(k + 1)is true: 


Let k be any integer with k = 1. Suppose that 
for any real numbers a4, a2, 43, ..., a, and c, 
ay (ca;) = c> a [This is the inductive hypothesis.] 


[We must show that for any real numbers aj, ap, a3, ..., 
k+l 


ay.+, and c, Ditea) = = cD 14i] 


Let a), a2, 43, ..., Ag+] and c be any real numbers. Then 


k 
= X cait cas 
i=1 
k 
=¢> a+ 
i=] 


by the recursive 
definition of È 


Cak+1 by inductive hypothesis 
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by the distributive law 
for the real numbers 


ll 
io) 
aa 
M> 
s 
+ 
a 
> 
+ 
en 


by the recursive 
definition of È. 


44. Hint: Let the property be the inequality 


45 


n 


> 


i=1 


$a aj. 


To prove the inductive step, note that because 
Etla] = = ie 


inequality for absolute value e (Theorem 4.5.6) to deduce 


k k 
(Ea; +F aial = BA 


We give two proofs for the given statement, one less 
formal and the other more formal. 


Proof 1 (by mathematical induction): For the basis 

step observe that any “sum” of one even integer is the 
integer itself, which is even. For the inductive step 

we suppose that for an arbitrarily chosen even integer 
r = 1., the sum of any r even integers is even. Then we 
must show that any sum of r + 1 even integers is even. 
But any sum of r + 1 even integers is equal to a sum of 
r even integers, which is even (by inductive hypothesis), 
plus another even integer. The result is a sum of two 
even integers, which is even (by Theorem 4.1.1) /as was 
to be shown]. 


Proof 2 (by mathematical induction): Let P(n) be the 
sentence: 


If ay, A2, AZ, «+65 
n 

X, a;is even. 
i=l 


We will show that P(n) is true for every integer n = 1. 


Show that P(n) is true for n = 1: 


da, are any even integers, then 


<— P(n) 


Suppose a, is any even integer. Then SS 1d; = a, 
which is even. So P(1) is true. 
Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 
Let k be any integer with k = 1, and suppose that 
If ay, do, a3, ..., ap are any even integers, 
then 2o a; is even. 
We must show that 


<— P(k) inductive hypothesis 


If ay, ao, 43, .++, Ups are any even integers, 


then pai , 4; is even. <—P({k+1) 


So suppose qj, a2, a3, .. 
then 


., aç+1 are any even integers, 


k+l k 


X a= X ait ag 
=I i=l 


by writing the final term of the sum separately. Now, by 
inductive hypothesis, Des a; is even, and, by assump- 
tion, a,41 is even. Therefore, ze ; a; is the sum of two 
even integers, which is even (by Theorem 4.1.1) /as was 
to be shown]. 


47. Hint: Use proof by contradiction or proof by 
contraposition. 


SECTION 5.7 


1. a. 14243 4--+(k-1) 
_ (k= DkK- D+) _ k= Dk 
2 2 
b. 54+24+44+6+8+4-:-+2n 
=5+2(1+2+3+: +n) 
n(n+ 1) 
=5+2 = 5+n(n+1) 
2 
=n +n+5 
; ni-D+l_ 
2. a 1+2+2 et 2 = =2'-1 
2-1 
ec. 2742°-7.3 4.9" 3.34 +++ 427-342-343 
ee ia +203 we 2421) 
= 24301424274 ++ $273 4-9") 


7 j 3 gin-2)41 s1 
2=1 


= 2” +3(2"7! —1) 
= 2.2"! } 3.2071 3 


= 5-2" 1-3 
3. aj = 1 
a,=l-a9=1-1=1 
a, = 2a, = 2°1 


a; = 3a =3:2:1 
a4 = 4a3 = 4-3-2-1 


Guess: 
a, = n(n—1)°°°3-2-1=n! 


5. cq=l 
Cy = 3c, +1 =3-14+1=3+41 
C3 = 3c +1 =3-(3+1)+1=3°+3+1 


5:7 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


6. 


9. 


10. 


A-54 


c4 = 30, +1 = 3-37? +34+D41 


=3+3°4+341 
Guess: 
Ca = 37143"? e 33H31 
arc , 
= 3-1 by Theorem 5.2.2 with r = 3 
3"-1 
“9 
Hint: 
d, = 2"+2"-7.3 42" 3.34 +++ 427-342-343 
= 5-2" 1-3 


for every integer n = 1. 


Hint: For any positive real numbers a and b, 


h=V —hy = 22 -(2?-2! 41) 
2—2?+2'-1 

hy = 24 -h, = 2- (V -2 +2- 1) 
24*—23+2?-2' +1 


Guess: 
hy, =73" gn-l bres +( ii 
=(= [1-2+2— --- +(—1)"-2"] 


= (-1"[1+(-2) 
+ (—2)?— +++ +(—2)"] 


—?2 n+1 | 
( yt! : | 


by basic algebra 


CJI by Theorem 5.2.2 


7 (iy “[(-2)"*! = 1] 
(—D-(-3) 


grt =(— jj! 


= by basic algebra 
3 
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12. 5) =3 18. Proof (by mathematical induction): Let d be any fixed 
sı = so+2:-1 =3+2-1 constant, and let ap, a), a2, ... be the sequence defined 
Sy = 5, +2:2 =[3+2-1]+2-2 recursively by a, = a,_, +d for each integer k = 1. 

= 34+2-(14+2) The property P(n) is the equation a„ = dy + nd. We 
Sq = 8 +2°3 = [3+2 (1 42)]+2°3 show by mathematical induction that P(n) is true for 

i E i 2-(1+2+3) every integer n = 0. 

S4 = Sy 2:-4=[3 2-(1 24 3)] + 2-4 Show that P(0) is true: 

= 342-(14+24+3+4+4) When n = 0, the left-hand side of the equation is do, 

: and the right-hand side is aọ + 0-d = ag, which equals 
Guess: the left-hand side. Thus P(0) is true. 

Show that for every integer k = 0, if P(k) is true, 

Sy = 342-1 42434 -->+(—1) +n) then P(k +1) is true: 

=349. n(n + 1) by Theorem 5.2.1 Suppose k is any integer such that k = 0 and 

2 
=3+n(n+1) by basic algebra a = ao + kd. 
i [This is the inductive hypothesis. ] 

14. %1 = We must show that a,4,; = dg + (k + 1)d. Now 
X: Z 3x, +2 = 3+2 
x = 3 +3 = 3(3 +2) 4 3 = 32?+3-2+3 +) = atd by definition of ao, 44, ay, ... 
x4 = 3x3 +4 = 3(37+3-24+3)4+4 = [ay tkd]+d by substitution from the 

= 3°4+37-243-34+4 inductive hypothesis 
xs = 3x, +5 = 3(33 32-243-34 4)+5 = ao+ (k+ ld by basic algebra 
a 34433-2432-343-445 [as was to be shown]. 
x6 = 3x5 +6 19. Let U,, = the number of units produced on day n. Then 
= 3(34+37-2+37-34+4-3+5)+6 
( ) U, = Up, +2 for each integer k = 1, 


= 3°4+37-2439-3 437-443-546 
Up = 170. 


Guess: Hence Up, U, U, ... is an arithmetic sequence with 
x, = 3" 143772243" 3.34 -5543(n-Dtn fixed constant 2. It follows that when n = 30, 


= 37143772 4 377? 4.373 4 3" 4 373 4 U,, = Uy tn-2 = 170+ 2n = 170 +2-30 


2 times 3 times = 230 units. 


SSeS it ee Thus the worker must produce 230 units on day 30. 
(n—1) times n times 24. 2o 5k = = L= 1.192 X 10 = 


= (37 1435774 -e 4374341 119,200,000,000,000 = 119 trillion people (This is 
+ (3724 373 4 oe 4324341) 40°: about 20,000 times the current population of the earth!) 


(3? +34+1)+(34+1)4+1 26. b. Hint: Before simplification, 
3n] gai] 3-1 An = py + 200[(1.0025)" F 
A H 5 peed 5 (1.0025)""° + +++ +(1.0025)° + 1.0025 + 1]. 
3-1. S=] d. A49 = $67,481.15, Aggo = $188,527.05 
2 2 e. Hint: Use logarithms to solve the equation 


A, = 10,000, where A, is the expression found after 
simplifying the result in part (b). 


ae n—l 4 n-2 | | fi 
=3BB +3" 7+: +3+1)—n] 27. a. Hint: APY = 19.6% 

j ( (= _ ) c. Hint: approximately two years 
=1/3 

2 


28. Proof (by mathematical induction): Let ao, a), 


_ lagnt -3 —2n) a, ... be the sequence defined recursively by ag = 1 
4 and a; = ka,— for each integer k = 1, and let the 


property P(n) be the equation a, = n!. We show by 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


30. 


mathematical induction that P(n) is true for every 
integer n = 0. 
Show that P(0) is true: 


When n = 0, the right-hand side of the equation is 

0! = 1, and by definition of do, aj, a, ..., the left-hand 
side of the equation, dp, is also 1. Thus the property is 
true for n = 0. 


Show that for every integer k = 0, if P(k) is true, then 
P(k +1) is true: 


Suppose k is any integer with k = 0 and 
ay, = kt. 

[This is the inductive hypothesis.] 

We must show that a4, = (k + 1)!. Now 

Apa, = (k+ 1) ay 


= (k+1)-k! by substitution from the 
inductive hypotheses 
=(k+1)! by definition of factorial. 


[Hence if P(k) is true, then P(k + 1) is true.] 


Proof (by mathematical induction): Let c4, C2, C3, ... 
be the sequence defined recursively by c; = 1 and 
cy, = 3cy_1 + 1 for each integer k = 2. 


Let the property P(n) be the equation c, = eo We 


show by mathematical induction that P(n) is true for 
every integer n = 1. 


Show that P(A) is true: 


When n = 1, the right-hand side of the equation is 


gr St ae 
z =~ = l, and by definition of c4, c2, C3, .. 


., the 


left-hand side of the equation, c1, is also 1. Thus the 
property is true for n = 1. 


Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 


Suppose that k is any integer with k = 1 and 


3—1 
Ck = z 
[This is the inductive hypothesis.] 
ktl] 
We must show that c+; = 7 Now 


Cka = 3,41 
_ 3k — 1 by substitution from the 
=3 2 +1 inductive hypothesis 


grt 23. 9 
Fee 
2 2 
k+l 
= a Eh by basic algebra. 
2 


by definition of do, a), do, ... 


by definition of c1, C2, C3, ... 


5.7 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


A-56 


35. Hint: 
“tH k+l Cp! 
3 
z 3.2kt1 ktl - (=t! 
3 3 
2-241 4 (1! 2+2 (yy? 
3 3 
37. Hint: 


39 


[3 +k(k+1)]+2(k+1) 

= 34h +k4+2k+2 =34+[K+3k42] 
= 3+(k+1)(k+2) 
=3+(kt+I[(k+ 1) +1) 


Proof (by mathematical induction): Let x1, x2, x3, ... 


be the sequence defined recursively by x; = 1 and 

Xg = 3x,-; + k for each integer k = 2. Let the property, 
un 3 We show by 
mathematical induction that P(n) is true for every integer 
n=l. 

Show that P(A) is true: 

When n = 1, the right-hand side of the equation is 


gitl_9-1-3 _ 3-2-3 oe 
q = 4 = |, and by definition of x1, x2, 


f —2n- 
P(n), be the equation x, = = 


X3, ..., the left-hand side of the equation, xj, is also 1. 
Thus P(1) is true. 


Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 


Suppose that k is any integer with k = 0 and 


31 2k = 3 
Xx = =~~ g —. [Inductive hypothesis] We must show 
that 
3@*D41_ 9k +1)-3 
Xk+1 = , or, equivalently, 
3h? 2k 5 
Xk+1 = = Now 
Xg+1 = 3x, + k by definition 
, of x1, X2, X3, 
(+1 
=3 3t —2k-3 EEI by inductive 
4 hypothesis 
3-341 3-2k-3-3 4(k+1) 
4 4 
_ 3? _6k-9+4k+4 
4 
k+2 
= Sa 2k 5 by algebra 
4 


[as was to be shown]. 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


43 a. a =2 
ao 2 2 
a = = 
2ag-1 22=1 3 
2 z 
a 3 3 
a l = =—=2 
ml 2a 
3 3 3 
a 2 2 
a3 
“  2Qag9-1 2-2-1 3 
= 2 
a 3 3 
ai 2A 1 
Foa 2 
2 ifnis even 
Guess: a, = 2 
3 ifnis odd 
b. Proof (by strong mathematical induction): Let do, 


a), dy, ... be the sequence defined recursively by 


ap . 
Xo = 2 and a; = T - 7 for each integer k = 1. Let 


the property, P(n), be the equation 

2 ifniseven 

an5 1. an. 

3 Ifnis odd. 
We show by strong mathematical induction that 
P(n) is true for every integer n = 1. 
Show that P(0) and P(1) are true: 
The results of part (a) show that P(0) and P(1) are true. 


Show that for every integer k = 0, if P(k) is true 
for each integer i with 0 Si Sk, then P(k +1) 
is true: 


Let k be any integer with k = 0, and suppose that 
for each integer i withO < i S k, 


2 ifiiseven 
aj 7 2 
3 if iis odd. 


We must show that 


[Inductive hypothesis] 


2 ifkiseven 
A+1 = oe 
A if k is odd. 
Now 
ag =í 
ak1 => by definition of 
2a,—1 Ag, Ay, A, ... 
~ 2. ifti 
72-1 if k is even 
= 5 by inductive 
z if k is odd hypothesis 
2-3-1 
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Fi if k is even 
=42 
3 r r 
T if k is odd 
3 
a. . because k + 1 is 
3 ifk+ 1 is odd odd when k is even 


and k + 1 is even 


2 ifk+1iseven shen kiwcdd 


[as was to be shown]. 


45. v =1 
Vo = Vo/2)+ 43/2) +2 = vi +v +2 
=1+1+2 
v3 = V3/2) + Ya +2 = vi +v +2 
=14+(1+14+2)+2=3+2-2 
V4 = Wat Ys + 2 = vo + v2 +2 
=(1+14+2)+(1+1+2)+2 
=4+3-2 
Vs = Y5/2)+ Vio/2j + 2 = Va +v; +2 
= (34+2-2)+(1+14+2)+2 
=5+4-2 
V6 = Vo/2) + Y7/2) + 2 = v3 + v3 +2 
= (342-2)+(84+2-2)+2 
=6+5°2 
Guess: 
Vv, = n+2(n—1) = 3n—2 for every integer n = 1 
b. Proof (by strong mathematical induction): Let v4, 


V2, V3, ... be the sequence defined recursively by 
vı = Land vy = Vesa) + Yas1y/2) + 2 for each integer 
k = 1. Let the property, P(n), be the equation 

Vv, = 3n—2. 
We show by strong mathematical induction that 
P(n) is true for every integer n = 1. 
Show that P(1) is true: 
When n = 1, the right-hand side of the equation is 
3-1—2 = 1, which equals vı by definition of v4, v2, 
V3, .... Thus P(1) is true. 
Show that for every integer k = 1, if P) is true 
for each integer i with 0 Si Sk, then P(k + 1) 
is true: 


Let k be any integer with k = 1, and suppose that 
for each integer i with 1 < i S k, v; = 3i-2. 


5.8 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-58 


[This is the inductive hypothesis.] We must show that 
vk+1 = 3(k+ 1) 2=3k+1. 


Vk+1 = Wek+1)/2) > YK +2)/2) +2 


SECTION 5.8 


by definition of 1. (a), (d), and (£) 


Ai Pis Fas axe 0 
er 3. a. a= C2 


eH ned 
-hi 


D=2C+D=3 


w 
eee 


c=2 
< < 
TE. es -1 


a = 2-2? +(-1) =7 


k k+2 a 
GS )-2 if k is even A. a. Do C-3° D-( 2) =C+D=0 
k+l k+l b, = C-3'+D-(-2)' = 3C-2D =5 
H- if k is odd 
2 2 D=-C C=1 
=> => 
(22) -2 3C—2(-C) =5 D=-1 
j by = 3 +-1-27 =9-4=5 
=3(k+1)-2 


5. Proof: Given that a, = C:2"+ D, then for any choice of 
C and D and an integer k > 2, 
a, = C:2*+D 
ay, = C:2*!+D 
ap = C2? +D. 


by the laws of 
algebra 


=3k+1 


[as was to be shown]. 


46. Hint: Show that for every integer n = 0, 52, = 2” and 


$ . : 
Son41 = 2"*'. Then combine these formulas using the 
Hence 


3ay—1 — 2a, = 3(C-2* 1 + D) — 2(C-2" 7 + D) 


ceiling function to obtain s, = gins 


48. a. Hint: 
axe = 3C-:2'''+3p-2¢-2**-2p 
("5 ) if n is odd =3¢.21-¢c.2F 4D 
= = . k-1 
W= fn _ =2C:2!+D 
7 z7! if n is even = CŽ +D = aņ 


8. a. If, foreach k > 2, É = 27! +3? and t # 0, 
then f = 2t +3 [by dividing by °], and so 
f —2t—3 = 0. And, since ? — 2t- 3 = 
(t—3)(t+ 1), then t = 3 ort = —1. 

. It follows from (a) and the distinct roots theorem 


49. a. Hint: Express the answer using the Fibonacci se- 
quence. 


50. Performing the inductive step for a proof by math- 
ematical induction of the formula, involves sub- b 


52. 


stituting (k — 2)? in place of ap- in the expression 

ay, = 2a,-; +k— 1 in hopes of showing that a, equals 
(k— 1)’. However, solving 2(k— 2)? +k—1 = (k- 1° 
for k, gives that k — 5k + 6 = 0, which implies that 

k = 2 or k = 3. It turns out that the sequence a), az, 


a3 ,... does satisfy the given formula for k = 2 and 
k = 3, but when k = 4, ay = 2:4+(4—1) = 11, and 
11 # (4— 1}. Hence the sequence does not satisfy the 


formula for n = 4. 


a. Hint: The maximum number of regions is obtained 
when each additional line crosses all the previ- 
ous lines, but not at any point that is already the 
intersection of two lines. When a new line is added, 
it divides each region through which it passes into 
two pieces. The number of regions a newly added 
line passes through is one more than the number of 
lines it crosses. 


that for some constants C and D, do, ay, a2, ... satis- 
fies the equation 


a, = C:3" + D-(—1)" for every integer n = 0. 


Since dy = 1 and a, = 2, then 


dy = C-3° 
a, = C-3' 


D=1-C 
=> 
3C-(1- ©) =2 
D=1=C 
=> 
4c-1=2 


C= 3/4 
D=1/4. 


D-(-1)=C+D=1 
D-(-1)' =3C-D=2 


Thus a, = 33") + k- 1)” for every integer n = 0. 


53. Hint: The answer involves the Fibonacci numbers! 
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Tl. 


13. 


16. 


19. 


Characteristic equation: —4 = 0. Since? — 4 = 
(t—2)(t+ 2), t = 2 and t = —2 are the roots. By the 
distinct roots theorem, for some constants C and D 


d, = C-(2") + D+(—2)" 


for every integer n = 0. 
Since dọ = 1 and d; 


D-(—2)?=C+D=1 
D:(-2)! = 2C-2D = -1 


= — 1, then 


Thus d, = 12” + x —2)" for every integer n = 0. 


Characteristic equation: P-2t+1=0. By the qua- 
dratic formula, 
2+V4-4-1 2 


t= = = 1. 
2 2 


By the single root theorem, for some constants C and D 
r, = C- (1”) + Dn-(1") 
= C+nD for every integer n = 0. 


Since rọ = 1 andr, = 4, then 


my =C+0-D=C=1 A c=1 
rn =C+1:D=C+D=4 1+D=4 


C=1 
=> 
= 


= 1] + 3n for every integer n = 0. 


Thus r, 
Hint: For every integer n = 0, 


o vV3+2 i P 
2V3 cer 2V3 


Proof: Suppose r, s, a9, and a, are numbers with r # s. 
Consider the system of equations 


2a V3)". 


C+D = a 
Cr+ Ds = a}. 
By solving for D and substituting, we find that 
D= dy — G 
Cr+ (a — C)s = ay. 
Hence 


C(r— s) = a, — aos. 
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21. 


22. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Since r # s, both sides may be divided by r — s. Thus 
the given system of equations has the unique solution 


ay — aos 
CS 
p= 
and 
ay — aos 
D = a C= a 
r= 
ag — agos — a taos agra 
r=s r—-s ` 


Alternative solution: Since the determinant of the 
system is 1-s—r-1 = s —r and since r # s, the given 
system has nonzero determinant and therefore has a 
unique solution. 


Hint: Use strong mathematical induction. First note that 
the formula holds for n = 0 and n = 1. To prove the in- 
ductive step, suppose that k is any integer such that k = 2 
and the formula holds for every integer i with 0 < i S k. 
Then show that the formula holds for k + 1. Use the proof 
of Theorem 5.8.3 (the distinct roots theorem) as a model. 


The characteristic equation is -2+2 =0. By the 
quadratic formula, its roots are 
_2&V4-8_2+Ł+2i_]l+ti 
2 2 [=i 
By the distinct roots theorem, for some constants C 
and D 


an = C +D” + DA — i)” 


for every integer n = 0. 


Since dy = | and a, = 2, then 


=C(1+)°+D0-)° =C+D=1 
a= CA +)! +DA -—i)' 
=C1+ğ)+D-i)=2 
D=1-C 
cdi+i)+0d-Qd-i =2 
=1-C 
<€ 
[o i—1+i)+1 = 
D C 
=> 
eae 
D=1-C 
=> 
1+i I+i i r= 7 
2i % i =32 2 
| l-i 2-14+i 1+i 
D=1 = = 
2 3 2 
<] 
1-i 
C= 
2 


Thus for every integer, n = 0, 


1-i jpt 1+i i-p 
an = > (1+i)'4 5 A=; 


SECTION 5.9 


1. 


a. (1) p, q,r, and s are Boolean expressions by I. 

(2) ~s is a Boolean expression by (1) and II(c). 

(3) rv ~s is a Boolean expression by (1), (2), and 
TI(b). 

(4) (rv ~s) is a Boolean expression by (3) and 
II(d). 

(5) q^(rv ~s) is a Boolean expression by (1), (4), 
and II(a) 

(6) (qA(rv ~s)) is a Boolean expression by (5) 
and II(d) 

(7) ~p is a Boolean expression by (1) and II(c). 

(8) ~p v (q/A(rv ~s)) is a Boolean expression by 
(6), (7), and I(b). 


- a. (1) ()isinCbyl. 


(2) (()) isin C by (1) and Ha). 
(3) OO is in C by (1), (2), and I(b). 


. a. (1) By Theorem 5.9.1, a and b are strings in S 


because a and b are in A. 

(2) By (1) and part II(c) of the definition of string, 
a(bc) = (ab)c because a and b are strings in $S 
and c is in A. 


a. (1) MTJis inthe MIU system by I. 

(2) MITisin the MIU system by (1) and I(b). 

(3) MIIII is in the MIU system by (3) and II(b). 

(4) MITITITITis in the MIU system by (3) and 
II(b). 

(5) MIUITI Tis in the MIU system by (4) and 
II(c). 

(6) MIUUTisin the MIU system by (5) and II(c). 

(7) MIUTis inthe MIU system by (6) and II(d). 


a. (1) 2,0.3, 4.2, and 7 are arithmetic expressions 


by I. 
(2) (0.3 — 4.2) is an arithmetic expression by (1) 
and II(d). 


(3) (2-(0.3 —4.2)) is an arithmetic expression by 
(1), (2), and IIe). 

(4) (-—7) is an arithmetic expression by (1) and 
II(b). 

(5) ((2:(0.3 —4.2)) + (—7)) is an arithmetic ex- 
pression by (3), (4), and I(c). 


Proof (by structural induction): By the definition of $ in 
exercise 6, the only integer in the base for S is 5, and the 
recursion rule states that for every integer n in S,n+4 
is in S. Given any integer n in S, let property P(n) be the 
sentence, “n mod 2 = 1.” 
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Show that P(n) is true for each integer n in the base 
for S: 


The only integer in the base for S is 5, and P(S5) is true 
because 5 mod 2 = 1 since 5 = 2-:2+1. 


Show that for each integer n in S, if P(n) is true and 
if m is obtained from n by applying a rule from the 
recursion for S, then P(m) is true: 

Suppose n is any integer in S such that P(n) is true, or, 
in other words, n mod 2 = 1. [This is the inductive hy- 
pothesis.] The recursion for S consists of only one rule, 
and when the rule is applied to n, the result is n + 4. To 
complete the inductive step, we must show that P(n + 4) 
is true, or, equivalently, that (n + 4) mod 2 = 1. Now 
since n mod 2 = 1, then 


n = 2k + 1 for some integer k. 

Hence 
(n+ 4) mod 2 = [(2k+ 1)+4] mod 2 
= [2(k+2)+ 1] mod 2 


=1 because k +2 
is an integer, 


by substitution 


by basic algebra 


and so P(n + 4) is true /as was to be shown]. 


Conclusion: Because there are no integers in S other 
than those obtained from the base and the recursion 
for S, we conclude that every integer n in S satisfies the 
equation n mod 2 = 1. 


Proof (by structural induction): By the definition of S 
in exercise 7, the only element in the base is 1, and the 
recursion rules II(a) and II(b) state that for every string 
sin S, Os and 1s are in S. Given any string s in S, let 
property P(s) be the sentence “s ends ina 1.” 


Show that P(a) is true for each string a in the base 
for S: 


The only string in the base for S is 1, which ends in a 1, 
and so P(1) is true. 


Show that for each string x in S, if P(x) is true and 
if y is obtained from x by applying a rule from the 
recursion for S, then P(y) is true: 


The recursion for $ consists of two rules: II(a) and H(b). 
Suppose s is any string in S such that P(s) is true, which 
means that s ends in a 1. [This is the inductive hypoth- 
esis.] To complete the inductive step, we must show 
that applying either of the two recursion rules to s also 
results in a string that ends in 1. 


Now when rule II(a) is applied to s, the result is Os and 
when rule II(b) is applied to s, the result is 1s. Because s 
ends in a 1, so do Os and 1s, which means that P(Os) and 
P(1s) are true and completes the inductive step. 


Conclusion: Because there are no strings in S other 
than those obtained from the base and recursion for S, 
we conclude that every string in S ends ina 1. 
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9. 


10. 


11. 


13. 


Proof (by structural induction): By the definition of S 


in exercise 9, the only element in the base is A, and the 
recursion rules II(a)—II(d) state that for every string s in S, 
bs, sb, saa, and aas are in S. Given any string, s in S, let 
property P(s) be the sentence “s contains an even number 
of a’s.” 


Show that P(a) is true for each string a in the base 
for S: 


The only string in the base for S is A, which contains 0 
a’s. Since 0 is an even number, P(A) is true. 


Show that for each string x in S, if P(x) is true and 
if y is obtained from x by applying a rule from the 
recursion for S, then P(y) is true: 


Suppose s is a string in S such that P(s) is true, or, in 
other words, suppose that s has an even number of a’s. 
[This is the inductive hypothesis.] 


When either rule II(a) or I(b) is applied to s, the result 
is either bs or sb, each of which contains the same num- 
ber of a’s as s and hence an even number of a’s. Thus 
both P(bs) and P(sb) are true. 


When either rule II(c) or II) is applied to s, the result 
is either aas or saa, each of which contains two more 
a’s than does s. Because two more than an even number 
is an even number, both aas and saa contain an even 
number of a’s. Hence both P(aas) and P(saa) are true. 
This completes the inductive step because I(a)—I(d) 
are the only rules in the recursion. 


Conclusion: Because there are no strings in S other 
than those obtained from the base and the recursion for 
S, we conclude that every string in § contains an even 
number of a’s. 


Hint: For each string s in S, let property P(s) be the 
sentence: “s represents an odd integer.” In the decimal 
notation, a string represents an odd integer if, and only 
if, it ends in 1, 3, 5, 7, or 9. 


Hint: By divisibility results from Chapter 4 (exercises 
15 and 16 of Section 4.4), if both k and m are divisible 
by 5, then so are k + m and k — m. 


Hint: Can the number of /’s in a string in the M I U sys- 
tem be a multiple of 3? How do rules IH(a)-IH(d) affect 
the number of /’s in a string? 


15. a. The parenthesis structure ()(() is not in C. To see 


why this is so, we will prove that every parenthe- 
sis structure in C has an equal number of left and 
right parentheses. It will follow that, since ()(() 
has 3 left parentheses and 2 right parentheses, ( )(() 
cannot be in C. 


Proof (by structural induction): 


Define a function f : C — Z as follows: For each 
parenthesis structure x in C, let 
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the number of left the number of right 
fœ = i z i i 
parentheses in x parentheses in x 


Given any parenthesis structure x in C, let property 
P(x) be the sentence, “ fx) = 0.” 


Show that P(a) is true for each parenthesis struc- 
ture ain the base for C: 


The only parenthesis structure in the base for C is 
(), and f [0] = 0 because () has one left parenthesis 
and one right parenthesis. Hence P[()] is true. 


Show that for each parenthesis structure x in C, if 
P(x) is true and if y is obtained from x by applying a 
rule from the recursion for C, then P(y) is true: 

The recursion for C consists of two rules, denoted 
II(a) and II(b). 

Suppose u and v are any parenthesis structures 

in C such that P(u) and P(v) are true. This means 
that f(u) = O and f(v) = 0. [This is the inductive 
hypothesis. ] 

Let k and m be the numbers of left and right pa- 
rentheses, respectively, in u, and let n and p be the 
numbers of left and right parentheses, respectively, 
in v. Then k—m = Oandn-—p = 0 by definition of f. 


When rule II(a) is applied to u, the result is (u), and 
fi(u)] = 0 because (u) has one more left paren- 
thesis and one more right parenthesis than u, and 
(k+ 1)—(m+ 1) = 0. Hence, P(u) is true. 

When rule II(b) is applied to u and v, the 

result is uv. Now uv has k +n left parentheses 
and m + p right parentheses. Hence f(uv) = 
(k+n)—(m+p) = (k-—m)+(n—-p) =0+0=0. 
Hence, f(uv) = 0, and so P(uv) is true. 


Conclusion: Because there are no parenthesis 
structures in C other than those obtained from the 
base and the recursion for C, we conclude that given 
any parenthesis structure x in C, f(x) = 0. There- 
fore, every parenthesis structure in C has the same 
number of left and right parentheses. 

b. Hint: This parenthesis structure is not in C either 
even though it has equal numbers of left and right 
parentheses. 


16. Let S be the set of all strings of 0’s and 1’s with the 


same number of 0’s and 1’s. The following is a recursive 
definition for S. 
I. Base: The null string A € S. 
Il. Recursion: If s E S, then 
a. OlsES b. s01 ES c. l0sES 
d. s10ES e. OSLES f. I0ES 
III. Restriction: There are no elements of S other that 
those obtained from the base and recursion for S. 


18. 


20. 


22. 


23. 


25. 


26. 


Let T be the set of all strings of a’s and b’s that contain 
an odd number of a’s. The following is a recursive 
definition of T. 


l. Base: a E T. 
Il. Recursion: If t E T, then 
a. bt ET b. bET c. aat ET 


d. aa ET œe. taaET 


- Restriction: There are no elements of T other than 
those obtained from the base and recursion for T. 


a. Suppose a is any character in A. [We must show that 
L(a) = 1.] Then 


L(a) = L(À +a) by part II(b) of the definition of string 
= L(A)+1 by part (b) of the definition of the 
length function 
=0+1 by part (a) of the definition of the 
length function 
=1 by definition of 0. 


Hint: If S is the set of all strings over a finite set 

A, then for any string u in S, let the property P@) 

be the sentence “If v is any string of length n, then 
Rev(uv) = Rev(v)Rev(u).” For the basis step you will 
show that P(A) is true by showing that Rev(uA) = 
Rev(A)Rev(u). For the inductive step you will as- 
sume that x is any string for which P(x) is true, and 
you will show that if y is the result of applying rule 
I(a) to x, then P(y) is true. 


a. M(86) = M(M(97)) since 86 < 100 


= M(M(M(108))) since 97 < 100 
= M(M(98)) since 108 > 100 
= M(M(M(109))) since 98 < 100 
= M(M(99)) since 109 > 100 
= M(91) by Example 5.9.7 


a. A(1, 1) = A(O, AC, 0)) by (5.9.3) with m = 1 and 


n=1 


= A(1,0)+ 1 by (5.9.1) with n = A(1, 0) 
= A(O, 1)+1 by (5.9.2) with m = 1 
=(1+1)+1 by (5.9.1) with n = 1 


Alternative solution: 


AC, 1) = A(O, AC, 0)) by (5.9.3) with m = 1 andn = 1 


= A(O, A(O, 1)) by (5.9.2) with m = 1 
= A(0, 2) by (5.9.1) with n = 1 
=3 by (5.9.1) with n = 2 


a. Proof by mathematical induction: Let the property, 
P(n), be the equation A(1, n) = n+2. 


Show that P(0) is true: 


6.1 


SOLUTIONS AND HINTS TO SELECTED EXERCISES 


A-62 


When n = 0, 
A(1, n) = ACI, 0) by substitution 
= A(0, 1) by (5.9.2) 
=1+1 by 6.9.1) 
= 2. 


On the other hand, n +2 = 0+2 also. Thus 
A(1,n) =n+2 forn = 0. 

Show that for every integer k = 0, if P(k) is true, 
then P(k +1) is true: 


Let k be an integer with k = 1 and suppose P(x) 

is true. In other words, suppose A(1, k) = k +2. 
[This is the inductive hypothesis.] We must show that 
P(k + 1) is true. In other words, we must show that 
AU, k+1) = (k+1)+2 = k+3. Now 


Ad, k+ 1) = AQ, AC, k)) by (5.9.3) 
=A(,k)+1 by (5.9.1) 
=(k+2)+1 by inductive hypothesis 
=k+3 


[as was to be shown]. 


[Since both the basis and the inductive steps have been 
proved, we conclude that the equation holds for every 
nonnegative integer n.] 


28. Suppose F is a function. Then F(1) = 1, F(2) = FC) = 1, 


F(3) = 1+ F(5:-3— 9) = 1 + F(6) = 1 + F(3). Subtract- 
ing F(3) from the extreme left and extreme right of this 
sequence of equations gives 1 = 0, which is false. Hence F 
is not a function. 


SECTION 6.1 
1. a. A= {2, {2}, (VD} = {2, {2}, 2} = {2, {2} 


2. 


and B = {2, {2}, {{2}}}. So A C B because every 
element in A is in B, but B Z A because {{2}} E B 
and {{2}} É A. Thus A is a proper subset of B. 

c. A= {{1,2}, {2,3} and B = {1, 2,3}. So A B 
because {1, 2} E A and {1, 2} B. Also B GA 
because 1 € B and 1 EA. 

e. A = {V16, {4} = {4, {4} and B = {4}. Then 
B C A because the only element in B is 4 and 4 is in 
A, but A È B because {4} € A and {4} É B. Thus B is 
a proper subset of A. 


Proof That B C A: 


Suppose x is a particular but arbitrarily chosen element 
of B. 
[We must show that x E A. By definition of A, this 
means we must show that x = 2: (some integer).] 


By definition of B, there is an integer b such that 
x=2b-2: 
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[Given that x = 2b — 2, can x also be expressed as 2° 
(some integer)? That is, is there an integer—say, a— 
such that 2b — 2 = 2a? Solve for a to obtaina = b—1. 
Check to see if this works.] 


Leta =b-1. 


[First check that a is an integer.] 


We know that a is an integer because it is a difference 
of integers. 


By substitution, 2a = 2(b 


[Then check that x = 2a.] 


1) = 2b-2=x. 


Thus, by definition of A, x is an element of A, 


[as was to be shown]. 


- RCT because there are elements in R that are not 


in T. For example, the number 2 is in R but 2 is not 
in T since 2 is not divisible by 6. 


- T C R because every element in T is in R since 


every integer divisible by 6 is divisible by 2. To 

see why this is so, suppose n is any integer that is 
divisible by 6. Then n = 6m for some integer m. 
Since 6m = 2(3m) and since 3m is an integer (being 
a product of integers), it follows that n = 2-(some 
integer), and, hence, that n is divisible by 2. 


C C D because every element in C is in D. To see 
why this is so, suppose n is any element of C. Then 
n = 6r—5 for some integer r. Let s = 2r — 2. Then 
s is an integer (because products and differences of 
integers are integers), and 

3s+1 = 3(2r—2)+1=6r—6+1=6r—S, 
which equals n. Thus n satisfies the condition for 
being in D. Hence, every element in C is in D. 


. D Ç C because there are elements of D that are not 


in C. For example, 4 is in D because 4 = 3-1 + 1. 
But 4 is not in C because if it were, then 4 = 6r—5 
for some integer r, which would imply that 9 = 6r, 
or, equivalently, that r = 3/2, and this contradicts 
the fact that r is an integer. 


Sketch of proof that B C C: If ris any element of 
B then there is an integer b such that r = 10b — 3. 
To show that r is in C, you must show that there is 
an integer c such that r = 10c + 7. In scratch work, 
assume that c exists and use the information that 
10b — 3 would have to equal 10c + 7 to deduce 

the only possible value for c. Then show that this 
value is (1) an integer and (2) satisfies the equation 
r = 10c +7, which will allow you to conclude that r 
is an element of C. 


Sketch of proof that C C B: If s is any element of C 
then there is an integer c such that s = 10c + 7. To show 
that s is in B, you must show that there is an integer b 
such that s = 10c — 3. In scratch work, 


Tl. 


13. 


15. 


assume that b exists and use the information that 

10c + 7 would have to equal 10b — 3 to deduce the only 
possible value for b. Then show that this value is (1) an 
integer and (2) satisfies the equation s = 10b — 3, which 
will allow you to conclude that s is an element of B. 


- In words: The set of all x in U such that x is in A 


and x is in B. 


In symbolic notation: A N B. 


x€Aandx€B 
AUB= (Il, 3,5, 6, 7, 9} 

AN B= {3,9} 

AUC = {1, 2, 3, 4, 5, 6, 7, 8, 9} 
ANC=6 

A-B = {1,5,7} 
AUB={xER|0<x<4} 
ANB={xER|1 <x <2} 
AS = {x ER|x <Oorx> 2} 
AUC={xER|0<x<20r3 <x <9} 
ANC=6 


Bo = {x ER|x < lorx=4} 

ASN Bo = {x ER|x <0 orx=4} 
ASU BS = {x ER|x<lorx>2} 
(AN BY = {x E R|x < lorx> 2} 
(A U BY = {x ER|x <0 orx= 4} 


AN B= {1111} 
A—B = {1110, 1000, 1001} 


& 


U 


16. a. AU(BNC) = {a,b,c}, (AUB) NC = 


{b,c}, and (A UB)N (AUC) = 
{a,b,c,d} N {a, b, c, e} = {a,b,c}. 
Hence A U(BNC)=(AUB)N(AUC). 
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17. a. 


18. a. The number 0 is not in Ø because Ø has no 
elements. 
b. No. The left-hand set is the empty set; it does not 
have any elements. The right-hand set is a set with 
one element, namely Ø. 


19. A, = {1, 17} = {1}, A> (2, 27} = {2,4}, 
A3 = {3, 3°} = {3, 9}, A4 = {4, 4} = {4, 16} 
a. Ay UA, U A3 U A4 = {1} U {2,4} U {3, 9} 
U {4, 16} = {1, 2, 3, 4, 9, 16} 
b. Ay MAM A3 MA, = 
{1} NA {2,4} N {3,9} N {4, 16} = Ø 
c. Aj, A2, A3, and A4 are not mutually disjoint, because 


21. Cy = {0, —0} = {0}, Ci = {1, -1}, C = {2, —2}, 
C3 = (3, —3}, C4 = (4, —4} 


a. Uo=ut. ue, —2} U 43, —3} U 


i=0 


{4, -4 = 1-4, = 3, —2, —1, 0, 1, 2, 3, 4} 


4 
b. MG = {0} N {1,—1} N {2,—2} N {3,-3} N 
i=0 
{4, —4 = Ø 
c. Co, Ci, Ch, . . . are mutually disjoint because no 
two of the sets have any elements in common. 


d. JG = 


i=0 


{—n, —(n—1),...,-—2,-1,0,1,2,..., (a— 1), n} 


M 


. NC = 6 
i=0 


f. (JC; = Z, the set of all integers 


i=0 
g- are == Ø 
i=0 


22. Do = [—0, 0] = {0}, Dj = [-1, 1], D2. = [-2, 2], 
D; = [—3, 3], Dy = [—4, 4] 


a. UD, = FUL 1,1] U [-2, 2] U [-3, 3] U 


i=0 


[-4,4]=[-4,4] 
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24. 


27. 


28. 


31. 


= 


g. 


a. 
b. 


AD =0A1 1,1] N[-2, 2] N[-3, 3] N 


i=0 


[—4, 4] = {0} 

Do, Dj, Do, ..., are not mutually disjoint. In fact, 
each D} C Dy+4. 

UD; = [—n, n] 

i=0 

QP: = {0} 

i=0 


(JD; = R, the set of all real numbers 


i=0 


ND: = (0) 


= (0, %), W, = (1, œ), W, = (2, ©), W3 = (3, ©), 
= (4%) 


4 
U W; = (0, %) U (1, %) U (2, œ) U (3, %) U 
i=0 


(4, œ) = (0, 2) 


4 
NM W: = 0, 2) N (1, %) N (2, %) N (3, 2) N 
i=0 


(4, ©) = (4, œ) 
Wo, Wi, W2, . . . are not mutually disjoint. In fact, 


W,+, C W; for every integer k = 0. 


UW; = (0, %) 
i=0 
AW; = (n, ©) 
i=0 
UW; = (0, %) 
i=0 


nw, = O 
i=0 


No. The element d is in two of the sets. 
No. None of the sets contains 6. 


Yes. Every integer is either even or odd, and no integer 
is both even and odd. 


a. 
b. 
c. 


d. 


ANB = {2}, so P(A N B) = {Ø, {2}. 
A = {1,2}, so P(A) = {Ø, {1}, {2}, {1, 2}}. 
AUB = {1,2,3}, so P(A U B) = {Ø, {1}, 
{2}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3} 
A X B= {(1, 2), (1, 3), 2, 2), (2, 3)}, so 
P(A X B) = {Ø, (1, D}, {A D} (2, D}, 12, 3}, 
1d, 2), d, 3)}, {C, 2), 2, 2)}, 
(d, 2), (2, 3), {(, 3), (2, 2}, {CL 3), 2, 3)}, 
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{2, 2), (2, 3), {, 2), d, 3), 2, D}, 

{C, 2), A, 3), 2, 3), 

a, 2), 2, 2), (2, 3)}, {, 3), 2, 2), 2, 3)}, 
{C, 2), CL 3), (2, 2), (2, 3)}}. 


P(A X B) = {O, {Cw}, {Cv}, (A, W, Cv} 
33. b. PPD) = PUD) = 10, {P}} 
34. a. 


32. a. 


A, U (A; X A3) 
= {1} U {(u, m), (u, n), (v, m), V, n)} 
= {1, (u, m), (u, n), (v, m), (v, n)} 
AX(BUC) = {a,b} X {1,2,3} 
= {(a, 1), (a, 2), (a, 3), (b, 1), (b, 2), (b, 3)} 
b. (A X B) U (A X C) = {(a, 1), (a, 2), (b, 1), 
(b, 2), (a, 2), (a, 3), (b, 2), (b, 3)} 
= {(a, 1), (a, 2), (b, 1), (b, 2), (a, 3), (b, 3)} 


35. a. 


36. 
i ert 2 ee 
J 1] 2 )3)] 1 )2) 3.)4) 152 
found no | yes no yes no | yes 
answer |ACB 


SECTION 6.2 


1. a. (DA 

b. ()ANB 
Oe 

d. (I)(AUB)NC 


Q)BUC 


(QAU(BNC) 


. ()A-B DA (3)A 4B 


. by definition of subset (because A C B) 
b. by definition of subset (because B C C) 
c. by definition of subset 


5. Proof: Suppose A and B are any sets. 


Proof that B— A C B N A‘: Suppose x E€ B — A. By 
definition of set difference, x € B and x & A. It follows 
by definition of complement that x € B and x € A‘, 
and so by definition of intersection, x E B N A®. [Thus 
B—A CBNA‘ by definition of subset.] 

Proof that B N A®° C B —A: Suppose x € B N A‘. By 
definition of intersection, x € B and x € A“. It follows 
by definition of complement that x E B and x ¢ A, and 
so by definition of set difference, x E B — A. [Thus 

B N A‘ C B—A by definition of subset.] 

[Since both subset relations have been proved, 

B-—A = BNA‘ by definition of set equality.] 


6. Partial answers 
dd) a(ANB)U(ANC) 
d.x E€(ANB)U(ANC) 


bA cxEC 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Hint: This is somewhat similar to the proof in 
Example 6.2.3. 


Proof: Suppose A and B are any sets. 
Proof that (A N B) U (A N B®) CA: Suppose 


x E (A N B) U (A A B°). [We must show that x E A.] 
By definition of union, x E A N B or x E (A N B®). 


Case 1 (x E A N B): In this case x is in A and x is in B, 
and so, in particular, x € A. 


Case 2 (x E A N B°): In this case x is in A and x is not 
in B, and so, in particular, x € A. 


Thus, in either case, x € A [as was to be shown]. So 
(A N B) U (AN B®) CA [by definition of subset]. 
Proof that A C (A N B) U (A N B°): Suppose 

x E A. [We must show that x E (A N B) U (AN B°).J 
Either x € B or x É B. 


Case 1 (x € B): In this case we know that x is in A and 
we are also assuming that x is in B. Hence, by definition 
of intersection, x E A N B. 


Case 2 (x E A N B°): In this case we know that x is in 
A and we are also assuming that x is in B®. Hence, by 
definition of intersection, x E A N B®. 


Thus, in either case x E A N B or x E A N B‘, and so, 
by definition of union, x € (A N B) U (A N B°) [as was 
to be shown]. So A C (A N B) U (A N B°) [by definition of 
subset]. 


Conclusion: Since both subset relations have been 
proved it follows by definition of set equality that 
(AN B)U (AN B‘) =A. 


Partial proof: Suppose A, B, and C are any sets. To 
show that (A — B) U (C — B) = (A U C) -— B, we must 
show that (A — B) U (C — B) C (A U C) —B and that 
(A U C)—B C (A-B) U (C-B). 

Proof that (A — B) U (C — B) C (A U C) — B: 
Suppose that x is any element in (A — B) U (C — B). [We 
must show that x E (A U C) -— B.] By definition of union, 
xE&A-—-BorxE C-B. 


Case I (x E A — B): Then, by definition of set dif- 
ference, x € A and x É B. Now because x € A, we 
have that x E A U C by definition of union. Hence 

x E A U Cand x ¢ B, and so, by definition of set dif- 
ference, x € (A U C) —B. 


Case 2 (x E C — B): Then, by definition of set dif- 
ference, x E C and x É B. Now because x E C, we 
have that x E A U C by definition of union. Hence 

x E A U C and x ¢ B, and so, by definition of set dif- 
ference, x € (A U C)—B. 


Thus, in both cases, x E (A U C)-— B [as was to 
be shown]. So (A— B) U (C— B) C (AU C)-B 
[by definition of subset]. To complete the proof that 
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10. 


14. 


15. 


16. 


19. 


(A—B)U(C—B) = (A U C) —B, you must show that 
(AU C)—-BC(A—B) U(C—B). 


Proof: Suppose A, B, and C are any sets. 

We will show that (A UB) 1 CCA U(BNC),. 
Suppose x is any element (A U B) N C. 

By definition of intersection x is in A U B and x is in C. 


Then by definition of union x is in A or x is in B, and 
in both cases x is in C. It follows by definition of 
union that in case x is in A and x is in C, then x is in 
A U (BN C) by virtue of being in A. And in case x is 
in B N C, then xis in A U (B N C) by virtue of being 
in B AN C. Thus in both cases x is in A U (B A C), 
which proves that every element in (A U B) N Cis in 
AU(BNC). 

Hence (A U B) N C CA U (B N C) by definition of 
subset. 


Partial proof: Suppose A and B are any sets. We will 
show that A U (A N B) C A. Suppose x is any element 
in A U (A N B). [We must show that x € A.] By defini- 
tion of union, x € A or x E A N B. In the case where x 
E A, clearly x € A. In the case where x E A N B, x E 
A and x € B (by definition of intersection), and so, in 
particular, x E A. Hence, in both cases x € A [as was 
to be shown]. Thus A U (A N B) CA by definition of 
subset. 
To complete the proof that A U (A N B) = A, you must 
show that A CA U (BNA). 


Proof: Let A be any set. [We must show that AU © = A.] 
Proof that A U Ø CA: Suppose x E A U Ø. Then 

x E A or x E Ø by definition of union. But 

x € Ø since Ø has no elements. Hence x € A. 

Proof that A C A U ©: Suppose x E A. Then the 
statement “x E A or x E @” is true. Hence x E A U Ø 
by definition of union. [Alternatively, A C A U Ø by the 
inclusion in union property.] 

Since A U Ø CA and A CA U Ø, then A U Ø = A by 
definition of set equality. 


Proof: Suppose A, B, and C are any sets such that A C B. 
Let x E A N C. By definition of intersection, x E A and 
x E C. Now since A C B and x € A, then x € B. Hence 
x E B and x E C, and so, by definition of intersection, 

x E B N C. [Thus A N C C B N C by definition of subset.] 


Hint: The proof has the following outline: 


Starting point: Suppose A, B, and C are any sets such 
that A C BandA CC. 


To show: AC BNC. 


6.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-66 


21. Proof: Suppose A, B, and C are arbitrarily chosen sets. 


A X (BUC) C(A x B) U(A x C): Suppose 
(x,y) EA X (BUC). [We must show that 

(x, y) E (A X B) U (A X C).] Then x E€ A and 

y E B U C. By definition of union, this means that 
yEBoryEC. 


Case 1 (y € B): Then, since x € A, (x,y) EA X B 
by definition of Cartesian product. Hence 
(x, y) E (A X B) U (A X C) by definition of union. 


Case 2 (y E C): Then, since x E A, x,y) EA X C 
by definition of Cartesian product. Hence 
(x, y) E (A X B) U (A X C) by definition of union. 


Hence, in either case, (x, y) E (A X B) U (A X C) [as 
was to be shown]. 


Thus A X (B U C) C (A X B) U (A X C) by definition 
of subset. 


(A xXx B)U(A X C) CA X (B U C): Suppose 
(x, y) E (A X B) U (A X C). Then (x,y) EA X Bor 
wy EA XC. 


Case I ((x,y) EA X B): In this case, x E A and y € B. 
Now since y € B then y € B U C by definition of 
union. Hence x € A and y € B U C, and so, by defini- 
tion of Cartesian product, (x, y) E A X (B U C). 


Case 2 ((x, y) EA X C): In this case x E€ A and y E C. 
Now since y € C, then y € B U C by definition of 
union. Hence x € A and y € B U C, and so, by defini- 
tion of Cartesian product, (x, y) E A X (B U C). 

Thus, in either case, (x, y) E A X (B U C). 

[Hence, by definition of subset, 
(A{AXB)U(AXC)CAX(BUC)] 

[Since both subset relations have been proved, we can con- 
clude that A X (B U C) = (A X B) U (A X C) by defini- 
tion of set equality.] 


. There is more than one error in this “proof.” The most 


serious is the misuse of the definition of subset. To say 
that A is a subset of B means that for every x, if x E A 
then x € B. It does not mean that there exists an ele- 
ment of A that is also an element of B. The second error 
in the proof occurs in the last sentence. Even if there is 
an element in A that is in B and an element in B that is 
in C, it does not follow that there is an element in A that 
is in C. For instance, suppose A = {1, 2}, B = {2, 3}, 
and C = {3,4}. Then there is an element in A that is in 
B (namely 2) and there is an element in B that is in C 
(namely, 3), but there is no element in A that is in C. 


Hint: The words “and so x € A U B” do not necessar- 
ily follow from “x Æ A or x É B.” Try to think of an 
example of sets A and B and an element x for which “x 
€ A or x É B” is true and “x € A U B” is false. 
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26. 


The shaded region is A U (B N C). 


U 


The most darkly shaded region is 
(AUB)N(AUC). 


27. (a) (A-B) N (B—A) 


28. 


30. 


32. 


(b) intersection 

(c) B-A 

(d) B 

(e) A 

(f) A 

(g) (A-B) N (B-A) = Ø 


Proof by contradiction: Suppose not. That 

is, suppose there exist sets A and B such that 

(A N B) N (A N B®) # Ø. Then there is an element x 
in (A N B) N (A N B°). By definition of intersection, 

x E (A N B) and x E (A N B®). Applying the definition 
of intersection again, we have that since x € (A N B), 

x E A and x € B, and since x € (A N B®), x E A and 

x & B. Thus, in particular, x E€ B and x € B, which is 

a contradiction. It follows that the supposition is false, 
and so (A N B) N (A A B5 = Ø. 


Proof: Let A be a subset of a universal set U. Suppose 
ANA‘ # Ø, that is, suppose there is an element x such 
that x E A N A“. By definition of intersection, x € A and 
x € A“, and so by definition of complement, x € A and 

x É A. This is a contradiction. [Hence the supposition is 
false, and we conclude that A N AS = ©.] 


Proof: Let A be a set. Suppose A X Ø + Ø. Then there 
would be an element (x, y) in A X Ø. By definition of 
Cartesian product, x E A and y € Ø. But there are no 
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33. 


36. 


39. 


40. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


elements y such that y E Ø. Hence there are no ele- 
ments (x, y) in A X Ø, which is a contradiction. [Thus 
the supposition is false, and so A X © = ©.] 


Proof: Let A and B be sets such that A C B. [We must show 
that A N B® = ©.] Suppose A N B| # Ø; that is, suppose 
there were an element x such that x E A N BÀ. Then 

x E A and x E B“ by definition of intersection. So x € 

A and x € B by definition of complement. But A C B by 
hypothesis, and, since x € A, then x € B by definition of 
subset. Thus x É B and also x € B, which is a contradic- 
tion. Hence the supposition that A N BF # Ø is false, and 
so A N Bo= Ø. 


Proof: Let A, B, and C be any sets such that C C B — A. 
Suppose A N C # Ø. Then there is an element x such 
that x E A N C. By definition of intersection, x € A 
and x € C. Now since x € C and C C B — A, then x 

€E B and x € A. So x € A and x É A, which is a 
contradiction. Hence the supposition is false, and thus 
ANC=Ø. 


a. Start of proof that 
A U B C (A-B) U (B—A) U (A N B): Given any 
element x in A U B, by definition of union x is in at 
least one of A and B. Thus x satisfies exactly one of 
the following three conditions: 


(1) x E A and x € B (xis in A only) 
(2) x E Band x € A (x is in B only) 
(3) x E A and x E B (xis in both A and B) 

b. To show that (A — B), (B — A), and (A N B) are mu- 
tually disjoint, we must show that the intersection of 
any two of them is the empty set. Now, by definition 
of set difference and set intersection, saying that 
x E A—B means that (1) x E A and x É B, saying 
that x E B—A means that (2) x E€ B and x É A, 
and saying that x © A N B means that (3) x E A and 
x E B. Conditions (1)—(3) are mutually exclusive: no 
two of them can be satisfied at the same time. Thus 
no element can be in the intersection of any two of 
the sets, and, therefore, the intersection of any two 
of the sets is the empty set. Hence, (A — B), (B — A), 
and (A M B) are mutually disjoint. 


Suppose that n is any positive integer and that A and B}, 
Bo, B3,..., B, are any sets. 


Proof that AN (Wa, C Ja N B): 
i=1 i= 


i=1 
Suppose x is any element in A N (Wa). [We must 
i=1 


show that x E |_)(A N B).] By definition of intersection, 


i=1 


41. 


43. 


n n 
x € A and x E |_JB;. Since x E |_)B;, the definition of 
i=1 i=l 
general union implies that x € B; for some i = 1, 
2,...,n, and so, since x € A, the definition of intersec- 


tion implies that x E A N B;. Thus, by definition of 


general union, x € U (A N B;) [as was to be shown]. 
i=l 


Proof that | )(ANB;)GAN ( B); 
i=1 i=l 


Suppose x is any element in (_J(A N B). [We must show 
i=1 


ai) ] By definition of 
=1 


that x ean( 


i= 


general union, x € A N B; for some i = 1,2,..., n. 
Thus, by definition of intersection, x E A and 


x E B, Since x € B; for some i = 1, 2, .. . , n, then 


by definition of general union, x E (_)8B;. 
i=l 
Thus we have that x € A and x € U B;, and so, 
i=1 


n 
by definition of intersection, x E A N (ú B.) 
[as was to be shown]. i=] 


Conclusion: Since both subset relations have been 


proved, it follows by definition of set equality that 


A a( z.) = UNB). 
i=1 i=l 


i= 


Proof sketch: If x E \_)(A;— B), then x E A;— B for 


i=1 


some i = 1,2,...,n, and so, (1) for some i = 1,2,...,n, 


x € A; (which implies that x € (Wa) and (2) x É B. 
i=l 
Conversely, if x € ( Ua) — B, then x € (Wa) 
i=1 i=] 
and x Æ B, and so, by definition of general union, 
x E A; for some i = 1,2,...,n, and x É B. 
This implies that there is an integer 7 such that 


x E A;— B, and thus that x E |_J (A;— B). 
i=1 
Suppose that n is any positive integer and that A and B}, 


Bo, B3,..., B, are any sets. 


L 


n n 

Proof that |_)(A xX B) GA x ( Bi} 

i=1 =1 

Suppose (x, y) is any element in (_)(A X Bj). [We must 
i=1 


show that (x, y) E A X (ús) ] By definition of general 
i=1 


i 
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union, (x, y) E A X B; for some i = 1, 2, .. . , n. By 
definition of Cartesian product, this implies that (1) x 
€ A and (2) y E B; for some i = 1, 2, . . . , n. By defini- 


tion of general union, (2) implies that y © |_)B;. Thus 
i=1 


x E A and y E (JB; and so by definition of Cartesian 


i=1 


product, (x, y) E A X ( Ua [as was to be shown]. 


i=1 


(A x B): 
i=1 


Proof that A X (a, Cc 
i=1 


Suppose (x, y) is any element in A X (Wa). [We must 


i=1 


show that (x, y) E (_J(A X B)).] By definition of Carte- 
i=1 


sian product, (1) x € A and (2) y E |_)8;. By definition 
i=l 
of general union, (2) implies that y € B; for some 
i= 1,2,...,n. Thus x E A andy € B; for some 
i = 1,2,...,n, and so, by definition of Cartesian 
product, (x, y) E A X B; for some i = 1,2,...,7. 
It follows from the definition of general union that 
(x, y) E [J (A X B) [as was to be shown]. 
i=l 
Conclusion: Since both subset relations have been 
proved, it follows by definition of set equality that 


UJiAxB) =4x (ús) 
i=l i=1 


SECTION 6.3 


1. 


Counterexample: A, B, and C can be any sets 

where A has an element that is not in C. For 

instance, let A = {1,2}, B = {2}, and C = {2}. 
Then (A U B) NA C= ({1, 2} U {2}) N {2} = 

{1,2} N {2} = {2},andA U (BN C) = 

{1,2} U ({2} NA {2}) = {1,2} U {2} = {1, 2}. Thus 
1 EAU (BN ©)but 1 € (A U B) N C, and hence 
(A U B) N C +A U (B N C) by definition of subset. 


Counterexample: A, B, and C can be any sets where 

A C Cand B contains at least one element that is not in 
either A or C. For instance, let A = {1}, B = {2}, and 
C = {1, 3}. Then A ZG B and B £ Cbut A C C. 


False. Counterexample: A, B, and C can be any sets where 
all three sets have an element in common or where A and 
C have a common element that is not in B. For instance, 
let A = {1, 2, 3}, B = {2, 3}, and C = {3}. Then B — C = 
{2}, and so A — (B — C) = {1, 2,3} — {2} = {1, 3}. 
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14. 


15. 
17. 


18. 


On the other hand, A — B = {1, 2,3} — {2,3} = {1}, 
and so (A — B)— C = {1} — {3} = {1}. Since 
{1,3} # {1}, A-(B-C) + (A-B)-C. 


True. Proof: Let A and B be any sets. 

Proof that A N (A U B) CA: Suppose 

x E A N (A U B). By definition of intersection, x E A 
and x € A U B. In particular, x E A. Thus, by definition 
of subset, A N (A U B) CA. 

Proof that A C A N (A U B): Suppose x € A. Then 
by definition of union, x E A U B. Hence x E€ A 

and x E A U B, and so, by definition of intersec- 
tion x E A N (A U B). Thus, by definition of subset, 
ACAN (AUB). 

Because both A N (A UB) CAandA CAN (AUB) 
have been proved, we conclude that A N (A U B) = A. 


True. Proof: Suppose A, B, and C are any sets such 
that A C Cand B C C. Let x E A U B. By definition 
of union, x € A or x €E B. But if x € A then 

x E C (because A C C), and if x E B then x E C 
(because B C C). Hence, in either case, x E C. [So, by 
definition of subset, A U B C C.] 


Hint: The statement is false. Consider sets U, A, B, and 
C as follows: U = {1, 2, 3, 4}, A = {1,2}, B = {1, 2, 3}, 
and C = {2}. 


Hint: The statement is true. Observe that if 
x€AN(B-O), then x É C, and so x € A N C. Con- 
versely, if x E (AM B)— (A N C), then x É A N C, and 
so, in particular, x € C. 


Hint: The statement is true. Sketch of part of proof: 
Suppose x € A. [We must show that x E B.] Either x € 
C or x É C. In case x E C, make use of the fact that 
ANCCRBANA C to show that x € B. In case x É C, 
make use of the fact that A U C C B U C to show that 
xEB. 


Hint: The statement is false. 


True. Proof: Suppose A and B are any sets with 

A C B. [We must show that P(A) C P(B).] So suppose 
X E P(A). Then X C A by definition of power set. And 
because A C B, we also have that X C B by the transi- 
tive property for subsets. Thus, by definition of power 
set, X E P(B). This proves that for all X, if X E P 

(A) then X E P(B), and so P(A) C P(B) [as was to be 
shown]. 


False. Counterexample: For any sets A and B, the 

only sets in P(A) U P(B) are subsets of either A or B, 
whereas a set in P(A U B) can contain elements from 
both A and B. Thus, if at least one of A or B contains 
elements that are not in the other set, P(A) U P(B) and 
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P(A U B) will not be equal. For instance, let A = {1} 
and B = {2}. Then {1, 2} E P(A U B) but {1, 2} 
E P(A) U P(B). 


Hint: The statement is true. To prove it, suppose A and 
B are any sets, and suppose X E P(A) U P(B). Note 
that if X C A, then X C A U B, and so X E€ P(A U B). 


a. Statement: V set S, 3 a set T such that S N T = ©. 
Negation: 3a set S such that V set T, S A T # ©. 
The statement is true. Given any set S, take T = S°. 


Then S N T=S N S° = Ø by the complement law 
for N. Alternatively, T could be taken to be Ø. 


Hint: Sy = {Ø}, Sı = {{a}, {b}, {c}} 


a. Si = {, {t}, {u}, {v}, {t, u}, {t, v}, fu, v}, {t, u, v} 
b. S, = {{w}, {t, w}, {u, w}, {v, w}, {t, u, w}, 

{t, v, w}, {u, v, w}, {t, u, v, w}} 
c. Yes 


Hint: The proof uses the same basic idea as the proof of 
Theorem 6.3.1. In this case let P(n) be the sentence “If a 
set S has n elements, the number of subsets of S with an 
even number of elements equals the number of subsets 
of S with an odd number of elements.” 


Hint: Use mathematical induction. In the inductive 
step, you will consider the set of all nonempty subsets 
of {2,..., k} and the set of all nonempty subsets of 
{2,...,k+1}. Any subset of {2,...,k+1} either 
contains k + 1 or does not contain k + 1. Thus 


the sum of all products 
of elements of nonempty 
subsets of {2,...,k +1} 


the sum of all products 
of elements of nonempty 
subsets of {2,...,k +1} 
that do not contain k+ 1 


the sum of all products 
of elements of nonempty 
subsets of {2,...,k+ 1} 
that contain k + 1 


Now any subset of {2,...,k +1} that does not con- 
tain k+ 1 is a subset of {2,..., k}. And any subset 
of {2,...,k+1} that contains k+ 1 is the union of a 
subset of {2,..., k} and {k+ 1}. 


a. commutative law for N 
b. distributive law 
c. commutative law for N 


Partial answer: 

a. set difference law 

b. set difference law 

c. commutative law for N 
d. De Morgan’s law 


29. Hint: Remember to use the properties in Theorem 6.2.2 
exactly as they are written. For example, the distribu- 
tive law does not state that for all sets A, B, and C, 
(AAUB)NC=(ANC)U(BNC). 


30. 


31. 


Proof: Let sets A, B, and C be given. Then 


(ANB)UC 
=CU(ANB) 

= (CUA) N (CUB) 
=(AUC)N(BUC) 


by the commutative law for U 
by the distributive law 


by the commutative law for U. 


Proof: Suppose A and B are sets. Then 


AU (B-A) 
=AU(BNA) 
=(AUB)N(AUA 
=(AUB)NU 
=AUB 


by the set difference law 
by the distributive law 
by the complement law for U 


by the identity law for N. 


36. Proof: Let A and B be any sets. Then 


39. 


(4° U BY) — Ay 
= ((AS U BS) NAS’ 


by the set difference law 


= (A£ U BEY U (Ac) 
= (9 NB) UAY 


by De Morgan’s law 
by De Morgan’s law 


=(ANB)UA by the double 

complement law 
=AU(ANB) by the commutative law for U 
=A by the absorption law. 


Partial proof: Let A and B be any sets. Then 


(A—B) U (B-A) 


41. 


=(ANB)U (BNA) 
=[(AN B°)UBIN[AN BUA] 

by the distributive law 
(AN BS)] A [AS U (A N B*)] 

by the commutative law for U 
=[(BUA)N(BUB‘)]N[(AS UA) N (AS UB*)] 
by the distributive law 
=[AUB)N (BU B®)] A [(4 UA‘) A (ASU B° )] 


by the commutative law for U. 


by the set difference law 


=[(BU 


Hint: The answer is Ø. 


44. a. Proof (by contradiction): Suppose not. That is, suppose 


there exist sets A and B such that A — B and B are not 
disjoint. Then (A — B) N B # Ø, which means there is 
an element x in (A — B) N B. By definition of intersec- 
tion, x E A— Band x € B, and by definition of set 
difference, x E A and x € B. Hence x € B and 

x É B, which is a contradiction. Thus the supposition 
is false, and so A — B and B are disjoint. 


6.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


46. 


47. 


48. 


51. 


52. 
53. 
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b. Let A and B be any sets. Then 


(A-B)NB 
=(AMB°)NB bythe set difference law 
=ANM(B°CB) _ by the associative law for N 
=AN(BNMB‘) by the commutative law for N 
=AND by the complement law for N 
=Ø by the universal bound law for N. 


Thus A — B and B are disjoint. 


a. AAB = (A-B) U (B—A) = {1,2} U {5,6} = 
{1, 2, 5, 6} 


Proof: Let A and B be any subsets of a universal set. By 
definition of A, showing that AA B = BAA is equivalent 
to showing that (A — B) U (B— A) = (B—A) U (A — B). 
This follows immediately from the commutative law 

for U. 


Proof: Let A be any subset of a universal set. Then 
AA® 


= (A-Ø) U (Ø-A) by definition of A 


=AND)UMWNA‘) 
=(ANU)U(ASN Ø) 


by the set difference law 


by the complement of U law 


and the commutative law for N 


=AU Ø by the identity law for N and 
the universal bound law for N 
=A by the identity law for U. 


Hint: First show that for any sets A and B and for any 
element x, 


x €AABS (x EA and x É B) or (x E€ B and x É A) 
and 
x AAB & (x É A and x È B) or (x € B and x € A). 


Same hint as for exercise 51. 


Start of proof: Suppose A and B are any subsets of a 
universal set U. By the universal bound law for union, 
B U U = U, and so, by the commutative law for union, 
U U B = U. Take the intersection of both sides of the 
equation with A. 


SECTION 6.4 


1. 


because | is an identity for - 
by the complement law for + 
by the distributive law for + over - 


angy 


by the complement law for - 
e. because 0 is an identity for + 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


4. Proof: For every a in B, 


6. 


a:0 = a: (a-a) by the complement law for - 
=(a-‘a)-a by the associative law for - 
=a by exercise 1 
=0 by the complement law for -. 


a. Proof: 0-1 = 0 because 1 is an identity for -, 
and 0+ 1 = 1+0 = 1 because + is commutative 
and 0 is an identity for +. Thus, by the uniqueness 
of the complement law, 0 = 1. 


Proof: Suppose 0 and 0’ are elements of B both of 
which are identities for +. Then both 0 and 0’ satisfy 
the identity, complement, and universal bound laws. 
[We will show that 0 = 0'.] By the identity law for +, for 
every a E B, 


a+O=a(*) and a+0 =a(**), 


It follows that 


0’ = 0'+0 by (*) with a = 0’ 
=0+0' by the commutative law for + 
=0 by (**) with a = 0. 


[This is what was to be shown.] 


Hint: Suppose 1 and 1’ are elements of B both of 

which are identities for -. Then for every a € B, by the 
identity law for -, a-1 = a and a: 1' = a. It follows that 
a:l = a: 1', and thus a +a:1 = a +a: 1', and so forth. 


Proof: Suppose B is a Boolean algebra and a 
and b are any elements of B. We first prove that 
(a-b)+(a+b) = 1. 
a:b+(a+b) 
= ((a:b)-a@) + ((a:b):b) 
by the commutative law for + 
= ((b-a)-a):@-(b-b)) 
by the distributive law of + over - 
= ((b+a)+a)-(@+ (b+b)) 


by the commutative and associative laws for + 


= (b+ @t+a))-@+(b+b)) 

by the associative and commutative laws for + 
= (b+ (a+ @)-@+1) 

by the commutative and complement laws for + 
=(b+1)-1 by the universal bound laws for + 
=1-1 by the universal bound law for + 
=1 by the identity law for -. 


Next we prove that (a-b): (a +b) = 0. 
(a-b): (@ +b) 
= ((a-b)-a) + ((a:b)-b) 


by the distributive law of: over + 
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10. 


Tl. 


12. 


13. 


15. 


16. 


= ((b-a)-a@) + (a: (b-b)) 

by the commutative and associative laws for - 
= (b-(a-@) + (a0) 

by the associative and complement laws for - 
= (b-0) +0 


by the complement and universal bound laws for - 


=0+0 

=0 
Because both (a-b) + (a + b) = 1 and (a-b): (@ + b) = 0, 
it follows, by the uniqueness of the complement law, that 
a-b=atb. 


by the universal bound law for - 


by the identity law for +. 


Hint: One way to prove the statement is to use the result 
of exercise 3. Some stages in the proof are the following: 


ygt y= na Ha yt@y =o 


H=c(xty) =o =z 


a. (i) Because S has only two distinct elements, 0 and 
1, we only need to check that 0+ 1 = 1+ 0. This is 
true because both sums equal 1. 


(v) Partial answer: Show that for all a, b, and c in B, 


at(b-c) =(atb)-(at+b). 

0+ (0-0) = 0+0 = 0 and (0+ 0)-(0 +0) 
=0:0=0 [a=b=c=0)] 

0+(0-1) = 0+0 = Oand (0+0): (0+1) 
=0:1=0 [a=b=0,c=1] 

0+(1-0) = 0+0 = Oand (0+ 1): (0+0) 
=1:0=0 [a=c=0,b=1] 

0+(1-1)=0+1=1and(0+1)-0+1) 
=1-1=1 [a=0,b=c=1] 


b. Hint: Verify that 0 +x = x and that 1-x = x for 
every x E S. 
Proof: Suppose a is any element of a Boolean algebra B. 


a+l=(atl):l 


because 1 is an identity for - 


= (a+1)-(a+a) bythe complement law for + 
at+l-a by the distributive law for + over - 
=ata-l by the commutative law for - 
=ata because | is an identity for - 
=1 by the complement law for +. 


Start of proof: Suppose a and b are any elements of a 
Boolean algebra B. 


a‘b+a=a-b+a:1_ because 1 is an identity for - 


For part (1), show that both sides of the equation equal 
a. For part (2), show that both sides of the equation 
equal a- (b+c). 


The sentence is not a statement because it is both true 
and false. If the sentence were true, then because it 

declares itself to be false, the sentence would be false. 
Therefore, the sentence is not true. On the other hand, 


17. 


20. 


23. 


24. 


28. 


if the sentence were false, then it would be false that 
“This sentence is false,” and so the sentence would be 
true. Consequently, the sentence is false. It follows that 
the sentence is both true and false. 


This sentence is a statement because it is true. Recall that 
the only way for an if-then statement to be false is for 

the hypothesis to be true and the conclusion false. In this 
case the hypothesis is not true. So regardless of what the 
conclusion states, the sentence is true. (This is an example 
of a statement that is vacuously true, or true by default.) 


This sentence is not a statement because it is both 

true and false. If the sentence is true, then, by defini- 
tion of an or statement, either the sentence is false or 
1+1 = 3. But 1 +1 #3, and so the sentence is false. 
On the other hand, if the sentence is false, then (by 
DeMorgan’s law) both of the following must be true: 
“This sentence is false” and “1 + 1 = 3.” But it is not 
true that 1 + 1 = 3. So it is impossible for the sentence 
to be false and hence the sentence is true. Consequently, 
the sentence is both true and false. 


Hint: Suppose that apart from statement (ii), all of 
Nixon’s other assertions about Watergate are evenly 
split between true and false. 


No. Suppose there exists a computer program P that has 
as output a list of all computer programs that do not list 
themselves in their output. If P lists itself as output, then 
it would be on the output list of P, which consists of all 
computer programs that do not list themselves in their 
output. Hence P would not list itself as output. But if P 
does not list itself as output, then P would be a member 
of the list of all computer programs that do not list them- 
selves in their output, and this list is exactly the output 
of P. Hence P would list itself as output. This analysis 
shows that the assumption of the existence of such a pro- 
gram P is contradictory, and so no such program exists. 


Hint: Show that any algorithm that solves the printing 
problem can be adapted to produce an algorithm that 
solves the halting problem. 


SECTION 7.1 


1. 


a. domain of f= {1, 3,5}, co-domain of 


f= {s, t, u, v} 
b. f0) =v, fB) = 5, fG) =v 
c. range of f= {s, v} 
d. yes, no 
e. inverse image of s = {3}, inverse image of u = Ø, 


inverse image of v = {1, 5} 
f. {(1,v), (3, s), (5, v)} 
a. True. The definition of function says that for any 


input there is one and only one output, so if two 
inputs are equal, their outputs must also be equal. 


7.1 


4. 
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c. True. The definition of function does not prohibit 
this occurrence. 


a. There are four functions from X to Y as shown below. 
X Y X Y 
X Y X Y 


9. 


10. 


Tl. 


12. 


a. I,(e) =e 
b. 1,(bj) = bř 


- a. The sequence is given by the function 


f: 7°" — R defined by the rule 


=1)" 
f(r) = S for each nonnegative integer n. 

a. F({1, 3, 4}) = 1 [because {1, 3, 4} has an odd number 
of elements] 

c. F({2, 3}) = 0 [because {2, 3} has an even number of 
elements] 

a. F(0) = (0° +2-0+4) mod 5 = 4 mod 5 = 4 

b. F(1) = (12 +2-14+4) mod5 =7mod5 =2 

a SQ) =1 b. S(15)=1+3+5+15 = 24 


c S(17)=1+17=18 


a. T(1)={1} b. 7(15) = {1, 3,5,15} 
T(17) = {1,17} 


a. F(4,4) = (2-4 
b. F(2, 1) = (2-2 


1,3-4—2) = (9, 10) 
1, 3-1=2):= 6,1) 


a. G(4, 4) = ((2-4+ 1) mod 5, (3-4 — 2) mod 5) = 
(9 mod 5, 10 mod 5) = (4, 0) 

b. G(2, 1) = (2:2 +1) mod5, (3:1 — 2) mod 5) = 
(5 mod 5, 1 mod 5) = (0, 1) 


x fæ g(x) 
0 | 4mod5=1 (0° +3-0+1) mod5 =1 
1 | 5mod5=0 (17+3-1+1) mod5 =0 


2 | 6 mod5=1 (27+3:2+1) mod5 = 1 
3 | Tmod5=4 (37+3:3+1) mod 5 = 4 
4 | 8&mod5=4 (4°+3-4+1) mod5 = 4 
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15. 


17. 
18. 


19. 


21. 


22. 


23. 


25. 
26. 


27. 


The table shows that f(x) = g(x) for every x in Js. Thus, 
by definition of equality of functions, f = g. 


F-G and G-F are equal because for every real number x, 


(F-G)(x) = F(x): G(x) by definition of F: G 


= G(x): F(x) by the commutative law for 
multiplication of real numbers 
= (G:F X(x) by definition of G- F. 


a 2=8 « 4)=4 


a. log;81 = 4 because = 81 
c. logs(35) = —3 because 3? = 5 


Let b be any positive real number with b # 1. Since 
b! = b, then log,b = 1 by definition of logarithm. 


Proof: Suppose b and u are any positive real numbers 
with b # 1. [We must show that log,(z) = —log,(u).] 
Let v = log, (4). By definition of logarithm, b” = r 
Multiplying both sides by u and dividing by b” 

gives u = b ”, and thus, by definition of logarithm, 

—v = log,(u). When both sides of this equation are 
multiplied by —1, the result is v = —log,(u). Therefore, 
logy) = — log, (u) because both expressions equal v. 
[This is what was to be shown.] 


Hint: Use a proof by contradiction. Suppose log37 is 
rational. Then log3;7 = ; for some integers a and b with 
b#0. 


Apply the definition of logarithm to rewrite log3;7 = E 
in exponential form. 

Suppose b and y are positive real numbers with 

log, y = 3. 

By definition of logarithm, this implies that b= y. 


Then 
1 1 1\° 
jars Za. ` 


b b 
Thus, by definition of logarithm (with base 1/b), 
log jp) = —3. 


a. p,(2, y) = 2, p65, x) = 5, range of pı = {2, 3, 5} 


a. mod(67, 10) = 7 and div(67, 10) = 6 since 
67 = 10-6 +7. 


Ff (aba) = 0 


[because there are no b’s to the left 
of the left-most a in aba] 

Ff (bbab) = 2 [because there are two b’s to the left 
of the left-most a in bbab] 


[because the string b contains no a’s] 


f(b) =0 
range of f = Zronneg 
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29. 
30. 


32. 


33. 


35. 


38. 


40. 


41. 


43. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


- a. E(0110) = 000111111000 and 


D(111111000111) = 1101 
a. (10101, 00011) = 3 


a. f(1, 1, 1) = (4-143-14+2:1) mod2 


= 9 mod2 = 1 
f(0,0, 1) = (4-0+3-0+2-1) mod 2 
=2mod2=0 


If g were well defined, then g(1/2) = 9(2/4) be- 
cause 1/2 = 2/4. However, g(1/2) = 1-2 = —1 
and g(2/4) = 2—4 = —2. Since —1 2. 
g(1/2) # g(2/4). Thus g is not well defined. 


Student B is correct. If R were well defined, then R(3) 
would have a uniquely determined value. However, on 
the one hand, R(3) = 2 because (3-2) mod 5 = 1, and, 
on the other hand, R(3) = 7 because (3:7) mod 5 = 1. 
Hence R(3) does not have a uniquely determined value, 
and so R is not well defined. 


>< 


at 


b. f(A) = iv}, FX = {tv}, FO) = {c}, 
f(D) = {a,b}, f'E) = Ø, fO) = 
{a,b,c} =X 


Partial answer: (i) y E F(A) or y E F(B) 


(ii)some (iii)AUB (iv) F(A U B) 


The statement is true. Proof: Let F be a function from 

X to Y, and suppose A C X, B C X, and A C B. Let 

y E F(A). [We must show that y E F(B).] By definition 
of image of a set, y = F (x) for some x € A. Thus since 


A C B, x E B, and so y = F(x) for some x E€ B. Hence 
y E F(B) [as was to be shown]. 


The statement is false. Counterexample: Let 
X = {1,2,3}, let Y = {a, b}, and define a function F: 
X — Y by the arrow diagram shown below. 


— 


Let A = {1,2} and B = {1,3}. Then F(A) = 
{a, b} = F(B), and so F(A) N F(B) = {a, b}. 
But F(A N B) = F({1}) = {a} + {a, b}. And so 
F(A) N F(B) £ F(A N B). 


(This is just one of many possible counterexamples.) 


45. The statement is true. Proof: Let F be a function from 
a set X to a set Y, and suppose C C Y, D C Y, and 
C CD. [We must show that F-\(C) C F~\(D).] Suppose 
xE FO). Then F(x) E C. Since C C D, F(x) E€ D 


also. Hence, by definition of inverse image, x € F~ '(D). 


[So F(C) C F~\(D).] 


46. Hint: xE F (CU D se F(x) ECUDS FW EC 
or F(x) E D 


51. a. $(15)=8 [because 1,2, 4,7,8, 11, 13, and 14 have 
no common factors with 15 other than +1] 
b. 6(2)=1 [because the only positive integer less 
than or equal to 2 having no common 
factors with 2 other than +1 is 1] 
c. 6(5)=4 [because 1, 2,3, and 4 have no common 
factors with 5 other than +1] 


52. Proof: Let p be any prime number and n any integer 
with n = 1. There are p” ' positive integers less than 
or equal to p” that have a common factor other than +1 
with p”—namely, p, 2p, 3p,..., (p 
are p"—p" | 


nat 


yp. Hence, there 
positive integers less than or equal to p” 


that do not have a common factor with p” except for +1. 


53. Hint: Use the result of exercise 52 with p = 2. 


SECTION 7.2 


1. The second statement is the contrapositive of the first. 
2. a. most 


3. Hint: One counterexample is given and explained 
below. Give a different counterexample and accompany 
it with an explanation. Counterexample: Consider the 
function defined by the following arrow diagram: 


— 


Observe that a is sent to exactly one element of Y, 
namely, u, and b is also sent to exactly one element of 
Y, namely, u also. So it is true that every element of X is 
sent to exactly one element of Y. But fis not one-to-one 
because f(a) = f(b) whereas a # b. [Note that to say, 
“Every element of X is sent to exactly one element of Y” is 
just another way of saying that in the arrow diagram for 


7.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-74 


10. 


Tl. 


the function there is only one arrow coming out of each 
element of X. But this statement is part of the definition of 
any function, not just of a one-to-one function.] 


Hint: The statement is true. 


Hint: One of the incorrect ways is (b). 


- a. fis not one-to-one because f(1) = 4 = f(9) and 


1 # 9. fis not onto because f(x) # 3 for any x in X. 
b. gis one-to-one because g(1) # g(5), g(1) # g(9), 

and g(5) # g(9). g is onto because each element 

of Y is the image of some element of X: 3 = g(5), 

4 = g(9), and 7 = g(1). 


a. F is not one-to-one because F(c) = e = F(d) 
and c # d. F is onto because each element 
of Y is the image of some element of X: 
e = F(c) = F(d), f = F(a), and g = F(b). 


a. One example of many is the following: 


a. (i) fis one-to-one. Proof: Suppose f(n,) = f(m) 
for some integers nı and nz. [We must show that 
nı = n>] By definition of f, 2n} = 2m, and divid- 
ing both sides by 2 gives n; = m [as was to be 
shown]. 


(ii) fis not onto. Counterexample: Consider 1 € Z. 
We claim that 1 # f(n), for any integer n, 
because if there were an integer n such that 
1 = f(n), then, by definition of f, 1 = 2n. Divid- 
ing both sides by 2 would give n = 1/2. But 
1/2 is not an integer. Hence 1 # f(n) for any 
integer n, and so fis not onto. 

b. his onto. Proof: Suppose m € 2Z. [We must show 
that there exists an integer such that h of that integer 
equals m.] Since m E 2Z, m = 2k for some integer 
k. Then h(k) = 2k = m. Hence there exists an inte- 
ger (namely, k) such that h(k) = m [as to be shown]. 


Hints: a. (i) gis one-to-one (ii) g is not onto 

b. Gis onto. Proof: Suppose y is any element of R. 
[We must show that there is an element x in R such 
that G(x) = y. What would x be if it exists? Scratch 
work shows that x would have to equal (y + 5)/4. 
The proof must then show that x has the necessary 
properties.] Let x = (y + 5)/4. Then (1) x E€ R, and 
(2) GX) = Gy + 5)/4) = 410 + 5)/41-5 = 
(y+5)—5 = y [as was to be shown]. 
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13. 


14. 


15. 


16. 


19. 


20. 


a. (i) His not one-to-one. Counterexample: 
H(1) = 1 = H(—1) but 1 il, 


Gi) H is not onto. Counterexample: H(x) # —1 for 
any real number x because H(x) = x and no 
real numbers have negative squares. 


The “proof” claims that fis one-to-one because for 
each integer n there is only one possible value for f(n). 
But to say that for each integer n there is only one pos- 
sible value for f(n) is just another way of saying that f 
satisfies one of the conditions necessary for it to be a 
function. To show that fis one-to-one, one must show 
that any integer n has a different function value from 
that of the integer m whenever n # m. 


fis one-to-one. Proof: Suppose f(x,) = f(x) where x, 
and x» are nonzero real numbers. [We must show that 


xı = X>.] By definition of f, 
yti æti 


xj X2 


Cross-multiplying gives 
Xo + XQ = Xia FX 


and so 


Xy =X, by subtracting xx from both sides. 


[This is what was to be shown.] 
fis not one-to-one. Counterexample: Note that 
xX) X 


atl tl 


24 = 24 
=> XX. Tt X] XXi T X2 


>x = m- x] 


> x2 x) = XQ Hy 


>x% =X or xx, = 1. 


Thus for a counterexample take any x, and x with 
xı # x but xx = 1. For instance, take x, = 2 
and x, = 1/2. Then f(x,) = f(2) = 2/5 and 

f(%2) = f(1/2) = 2/5, but 2 # 1/2. 


a. Note that because 1/302” = 59614581.7 and 


417302072 — 7-59614581 = 5, H(417302072) = 
417302072 mod 7 = 5. But position 5 is already 
occupied, so the next position is checked. It is free, 
and thus the record is placed in position 6. 


Recall that |x| = that unique integer n such that 

nsx<ntl. 

a. Floor is not one-to-one. Counterexample: 
Floor(0) = 0 = Floor(1/2) but 0 # 1/2. 

b. Floor is onto. Proof: Suppose m € Z. [We must 
show that there exists a real number y such that 
Floor(y) = m.] Let y = m. Then Floor(y) = Floor(m) 
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24. b. 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


= m since m is an integer. (Actually, Floor takes 
the value m for all real numbers in the interval 
m =x <m+1.) Hence there exists a real number y 
such that Floor(y) = m [as was to be shown]. 


a. Lis not one-to-one. Counterexample: L(0) = L(1) = 1 
but 1 # 0. 

b. Lis onto. Proof: Suppose n is a nonnegative integer. 
[We must show that there exists a string s in S such 
that L(s) = n.] Let 


E ba null string) ifn =0 
=> |00...0 ifn>0. 
n0’s 
Then L(s) = the length of s = n [as was to be 


shown]. 


. a. Fis not one-to-one. Let A = {a} and B = {b}. 


Then F(A) 1 but A # B. 


F(B) 


N is not onto. The number —1 is in Z but N(s) # —1 
for any string s in S because no string has a negative 
number of a’s. 


26. S is not one-to-one. Counterexample: $(6) = 1 + 


27. 
28. 


2+3+6 = 12and S(11) =1+11 = 12. So 

S(6) = S(11) but 6 # 11. 

S is not onto. Counterexample: In order for there to be 
a positive integer n such that S(n) = 5, n would have to 
be less than 5. But S(1) = 1, S(2) = 3, S(3) = 4, and 
S(4) = 7. Hence there is no positive integer n such that 
S(n) = 5. 


Hint: a. Tis one-to-one. b. Tis not onto. 


a. Gis one-to-one. Proof: Suppose (x, y1) and 
(X92, y2) are any elements of R X R such that 
G(x, y1) = Ga, y2). [We must show that 
(x1, yD) = (Xo, y2).] Then, by definition of G, 
(2y,, —x,) = (2y2, —X), and, by definition of 
ordered pair, 


2y; =2y, and ~x = ~x. 


Dividing both sides of the equation on the left by 
2 and both sides of the equation on the right by —1 
gives that 


yzy and x, =X, 


and so, by definition of ordered pair, (x), y1) = 
(X2, y2) [as was to be shown]. 
b. Gis onto. Proof: Suppose (u, v) is any element of 
R X R. [We must show that there is an element 
(x, y in R X R such that G(x, y) = (u, v).] Let 
(x, y) = (~v, u/2). Then (1) (x, y) E R X R and (2) 
G(x, y) = (2y, —x) = (2(u/2), =V) = (u, v) 


[as was to be shown]. 


30. a. 


31. 


32. 


33. 


35. 


36. 


Hint: Use properties of rational numbers from 

Section 4.3 and facts about rational and irrational 

numbers from Sections 4.7 and 4.8. 

b. Hint: Show that V3 is not the image of any ordered 
pair in Q x Q. 

a. Hint: F is one-to-one. Use the unique factorization 

of integers theorem in the proof. 


a. Let x = logg27 and y = log,3. [The question is: Is 
x = y?] By definition of logarithm, both of these 
equations can be written in exponential form as 


8°=27 and 2 =3. 


Now 8 = 2°. So 
gr = (PF = pam 


Also 27 = 3° and 3 = 2”. So 
27 = 3° = (XP = 2”. 


Hence, since 8° = 27, 


3, 
By (7.2.5), then, 
3x = 3y, 
and so 
x=y. 


But x = logg27 and y = log,3, and so logg27 = y = 
log23 and the answer to the question is yes. 


Proof: Suppose that b, x, and y are any positive real 
numbers such that b # 1. Let u = log,(x) and v = 
log, (y). By definition of logarithm, b“ = x and b” = y. 


By substitution, 3 = 9 = b"™” [by (7.2.3) and the fact 
that b” = vi Translating ý = b“~" into logarithmic 


form gives logy (¥) = u — yv, and so, by substitution, 


log, (#) = log, (x) — log, (y) [as was to be shown]. 


Start of Proof: Suppose a, b, and x are any [particular 
but arbitrarily chosen] real numbers such that b 


and x are positive and b # 1. [We must show that 
log, (x^) = a log, x.] Let 


r = log,(x“) and s = log, x. 


No. Counterexample: Define f: R — Rand g: R —> R 
as follows: f(x) = x and g(x) = —x for every real num- 
ber x. Then fand g are both one-to-one /because for all 
real numbers x, and x», if f (x1) = f(x2) then x; = x, and 
if g(x) = g(X2) then —x, = —X9, s0 X; = xX in this case as 
well]. But f+ g is not one-to-one [because f+ g satis- 
fies the equation (f+ g)(x) = x +(—x) = 0 for every real 
number x, and so, for instance, (f+ g)(1) = (f+ g)(2) but 
142). 


7.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


38. 


40. 


42. 


44. 


45. 


46. 


47. 
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Yes. Proof: Let fbe a one-to-one function from R to R, 
and let c be any nonzero real number. Suppose 

(cf) (x1) = (c-f) (2). [We must show that x; = xz] It 
follows by definition of (c-f) that c-(f(x,)) = c- (f@)). 
Since c # 0, we may divide both sides of the equation 
by c to obtain f(x,) = f(x2). And since fis one-to-one, 
this implies that x; = x /as was to be shown]. 


a. Hint: The assumption that F is one-to-one is needed 
in the proof that F'(F(A)) CA. If F(r) E F(A), 
the definition of image of a set implies that there is 
an element x in A such that F(r) = F(x). 

b. Hint: The assumption that F is one-to-one is needed 
in the proof that F(A,) O F(A2) C F(A, N Ao). If 
u E F(A,) and u E F(A3), then the definition of im- 
age of a set implies that there are elements x, in A, 
and x, in A, such that F(x,) = u and F(x) = u and, 
thus, that F(x) = F(x). 


F-! 
Lo 
= 


The function is not a one-to-one correspondence be- 
cause it is not onto. 


The answer to exercise 10(b) shows that h is onto. To 
show that h is one-to-one, suppose h(n) = h(n). By 
definition of h, this implies that 2n; = 2n,. Dividing 
both sides by 2 gives n; = n>. Hence h is one-to-one, 
and so A is a one-to-one correspondence. 


Given any even integer m, if m = h(n), then by defini- 
tion of h, m = 2n, and son = m/2. Thus 


h` '(m) = 5 for every m E 2Z. 


The function g is not a one-to-one correspondence 
because it is not onto. For instance, if m = 2, it is im- 
possible to find an integer n such that g(n) = m. (This 
is because if g(n) = m, then 4n — 5 = 2, which implies 
that n = 7/4. Thus the only number n with the property 
that g(n) = mis 7/4. But 7/4 is not an integer.) 


The answer to exercise 11b shows that G is onto. 

In addition, G is one-to-one. To prove this, sup- 
pose G(x,) = G(x) for some x; and x, in R. 

[We must show that x, = x>.] By definition of G, 

4x, —5 = 4x. —5. Add 5 to both sides of this equa- 
tion and divide both sides by 4 to obtain xı = x2 [as 
was to be shown]. We claim that G '(y) = (vy +5)/4 
for each y in R. By definition of inverse function, 
this is true if, and only if, G( + 5)/4) = y. But 
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50. 


52. 


53. 


57. 


58. 


G((y + 5)/4) = AO + 5)/4) —5 = (vy +5) —5 = y, and 
so it is the case that GO) = (y + 5)/4 for each y in R. 


The function L is not a one-to-one correspondence 
because it is not one-to-one. 


The answer to exercise 15 shows that fis one-to-one, 
and if the co-domain is taken to be the set of all 

real numbers not equal to 1, then fis also onto. The 
reason is that given any real number y # 1, if we take 
x= rae then x is a real number and 


1 
—+1 


_f 1 \_y-t _ 1+0-D_ 
fx) ( -) T y. 


y= 


1 
y-l 


Thus f '(y) = for each real number y # 1. 


The answer to exercise 16 in Appendix B shows that 
fis not one-to-one. Therefore, it is not a one-to-one 
correspondence. 


Hint: Let a function F be given and suppose the 
domain of F is represented as a one-dimensional array 
a{1], a[2],..., a[n]. Introduce a variable answer whose 
initial value is “one-to-one.” The main part of the body 
of the algorithm could be written as follows: 
while (i = n— 1 and answer = “‘one-to-one’’) 
ji=itl 
while (j = n and answer = “one-to-one”) 
if (F(ali]) = F(a{j]) and afi] # a[j]) 
then answer := “not one-to-one” 
j:=j+1 
end while 
i:=i+1 
end while 


What can you say if execution reaches this point? 


Hint: Let a function F be given and suppose the 
domain and co-domain of F are represented by the 
one-dimensional arrays a[1], a[2],..., a[n] and b[1], 
b[2],..., bim], respectively. Introduce a variable 
answer whose initial value is “onto.” For each 

b[i] from i = 1 to m, make a search through a[1], 
a(2],..., a[n] to check whether b[i] = F(a[j]) for 
some a[j]. Introduce a Boolean variable to indicate 
whether a search has been successful. (Set the vari- 
able equal to 0 before the start of each search, and 
let it have the value 1 if the search is successful.) 
At the end of each search, check the value of the 
Boolean variable. If it is 0, then F is not onto. If all 
searches are successful, then F is onto. 
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SECTION 7.3 


1. 


7. 


9. 


10. 


g ° f is defined as follows: 

(g °f)C) = 800) = 8G) = 1 

(g °f)G) = 80G) = 88) = 5 

(ge NS = g(f(5)) = gC) = 3. 
fe gis defined as follows: 

(f° gC) = f(g) = fGB) = 3 

(f° 86) = fG) =fG) = 1 

Fe 8X5) = f(gG)) = fC) = 5. 
Then g ° f # f° g because, for example, (g ° f) 
(1) # F° 8)0). 


- (G° F\(x) = GEF(x)) = G(x) =x-1 for every real 


number x. 

(F ° G\(x) = F(G(x)) = F(x— 1) = (x— 1)? for every 
real number x. 

G ° F + F ° G because, for instance, (G ° F)(2) = 
2—1 = 7, whereas (F ° G)(2) = (2-19 = 1. 


(G ° F)(0) = G(F(0)) = G(7-0) = GO) 


=0mod5=0 

(Ge F)(1) = GU) = G(7- 1) = G(7) 
= 7mod5=2 

(G ° F)(2) = G(F(2)) = G7-2) = G4) 
= 14mod5=4 

(G ° FX) = G(F(3)) = G7-3) = G21) 
= 21 mod5= 1 

(G ° F)(4) = G(F(4)) = G7-4) = G(28) 
= 28 mod 5 = 3 


a. Partial answer: 
(L° M)(12) = L(M(12)) = L(12 mod 5) = L(2) 


== 
(M ° L)(12) = M(L(12)) = M(12”) = M(144) 
= 144 mod 5 = 4 


a. (T° L)(abaa) = T(L(abaa)) = T(4) 
=4mod3= 1 


a. (Go F\(2) = G(F(2)) = G(2?/3) = G(4/3) = 
[4/3] = 1 


a. (Go F)(8) = G(F(8)) = G(16) = [16/2] = 8 
(F ° G)(8) = F(G(8)) = F(8/2) = F(4) = 8 
(G ° FXG) = G(F(3)) = G6) = [6/2] = 3 
(F ° G)(3) = F(G(3)) = F(3/2)) = FC) = 2 
b. G° F + F ° G because (G ° F)(3) = (F ° G)(3)= 2 
and 3 # 2. 


12. (F ° Fœ) = F (FQ) = F 1x42) 


Bx+2)-2 3 
= 3 = 3 = x = Ra) 


15. 


16. 


18. 
19. 


21. 


22. 


24. 


for every x in R. Hence F!loF=Ip by definition of 
equality of functions. 


=p 
(Fe F0) = FFO) = (7) 


(2) 2-6 


=y= RO) 


for every y in R. Hence F ° F | = Ig by definition of 
equality of functions. 


a. By definition of logarithm with base b, for each real 
number x, log, (b*) is the exponent to which b must 
be raised to obtain b”. But this exponent is just x. So 
log, (b') = x. 


Hint: Suppose fis any function from a set X to a set Y, 
and show that for every x in X, (Iy ° f)(x) = f(). 

a. Sk = Sim 

The answer is no. 


Counterexample: Define f and g by the arrow diagrams 
below. 


Then g ° f is one-to-one but g is not one-to-one. (This 
is one counterexample among many. Can you construct 
a different one?) 


Hint: Suppose f: X — Y and g: Y — Z are func- 

tions and g ° f is one-to-one. Given xı and x2 in X, if 
flay) = fla) then (g ° f1) = (8 ° f(a). (Why?) Then 
use the fact that g ° f is one-to-one. 


Hint: Suppose f: X — Y and g: Y — Z are functions 
and g ° f is onto. Given z € Z, there is an element x in 
X such that (g ° f)(x) = z. (Why?) If y = f(x), what can 
you deduce about g(y)? 


True. Proof: Suppose X is any set and f, g, and h are 
functions from X to X such that h is one-to-one and 

he f= hoe g. [We must show that for every x in X, 

f(x) = g(x).] Suppose x is any element in X. Because 
ho f= he g, we have that (h ° f Xx) = (h ° g)(x) by 
definition of equality of functions. Then, by definition 
of composition of functions, h(f(x)) = h(g(x)). And 
since h is one-to-one, this implies that f(x) = g(x) [as 
was to be shown]. 


7.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES 
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26. 

X gof a Z igei E 
-~a a 
Ca ce eae 
FA g7! Y Y fo x 
-~a — 
ie \ ee} 
p=) | 

Z fog! X 


-i 
=<) 
er 


The functions (g ° f) ' and f 'og ' are equal. 


29. Hints: (1) Theorems 7.3.3 and 7.3.4 taken together insure 


that g ° f is one-to-one and onto. (2) Use the inverse 
function property: F !(b) =a F(a) = b, for every a 
in the domain of F and every b in the domain of F E 


SECTION 7.4 


1. 


The student should have replied that for A to have the 
same cardinality as B means that there is a function 
from A to B that is one-to-one and onto. A set cannot 
have the property of being onto or one-to-one another 
set; only a function can have these properties. 


. Define a function f: Z* — S as follows: For every posi- 


tive integer k, f(k) = K. 

fis one-to-one: [We must show that for all k; and 

ky EZ", if f(k) = f(k) then k, = ky.] Suppose k, and kz 
are positive integers and f(k,) = f(k2). By definition of 
f. (kO? = (kY, so ky = +k». But kı and ky are positive. 
Hence kı = ko. 

fis onto: [We must show that for each n E S, there 

exists k E Z* such that n = f(k).] Suppose n € S. By 
definition of S, n = K? for some positive integer k. Then 
by definition of f, n = f(k). 


Since there is a one-to-one, onto function (namely, f) 
from Z” to S, the two sets have the same cardinality. 
Define f: Z — 3Z by the rule f(n) = 3n for each 
integer n. 


The function fis one-to-one because for any integers n 
and m, if f(n) = f(m) then 3n, = 37, and so ny = ny. 
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10. 


Tl. 


13. 


16. 


Also fis onto because if m is any element in 3Z, then 

m = 3k for some integer k. Then f(k) = 3k = m by defi- 
nition of f. Thus, since there is a function f: Z — 3Z that 
is one-to-one and onto, Z has the same cardinality as 3Z. 


Hint: If m E 2Z, show that J(m) = J(m + 1) = m. 
b. For each positive integer n, F(n) = (— Dll 


It was shown in Example 7.4.2 that Z is countably 
infinite, which means that Z* has the same cardinal- 
ity as Z. By exercise 3, Z has the same cardinality as 
3Z. It follows by the transitive property of cardinality 
(Theorem 7.4.1 (c)) that Z* has the same cardinality as 
3Z. Thus 3Z is countably infinite /by definition of count- 
ably infinite], and hence 3Z is countable [by definition of 
countable]. 


Proof: Define f: S — U by the rule f(x) = 2x for each 
real number x in S. Then fis one-to-one by the same 
argument as in exercise 10a of Section 7.2 with R in 
place of Z. Furthermore, fis onto because if y is any 
element in U, then 0 < y < 2 and so 0 < y/2 < 1. Con- 
sequently, y/2 E€ S and f(y/2) = 2(y/2) = y. Hence fis 
a one-to-one correspondence, and so S and U have the 
same cardinality. 


Hint: Define h: S — V as follows: h(x) = 3x + 2, for 
every x E S. 


It is clear from the graph that fis one-to-one (since it is 
increasing) and that the image of fis all of R (since the 
lines x = 0 and x = 1 are vertical asymptotes). Thus S$ 
and R have the same cardinality. 


In Example 7.4.4 it was shown that there is a one-to-one 
correspondence from Z* to Q*. This implies that the 
positive rational numbers can be written as an infinite 
sequence: F1, F2, F3, 4,.... Now the set Q of all rational 
numbers consists of the numbers in this sequence 
together with O and the negative rational numbers: 


SF is. ~T —1 33: irse 
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22. 


23. 


25. 


26. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Let rọ = 0. Then the elements of the set of all rational 
numbers can be “counted” as follows: 


Fo, Fi, TF1, F2, 125 F3, F3, F4, ~V4y 


In other words, we can define a one-to-one correspon- 
dence as follows: for each integer n = 1, 


Tn/2 if nis even 


a= | if n is odd 


=r (n—1)/2 
Therefore, Q is countably infinite and hence countable. 


Hint: See the hints for exercises 18 and 19 in Section 4.3. 


© Hint: No. 


. Hint: Suppose r and s are real numbers with 


é 2 
s >r > 0. Letn be an integer such that n > s2 
and let m = Pou 1. Show that m >“ = m- 1, 


V2. V2 
and use the fact that s = r + (s — r) to conclude that 
ye <s, 


Hint: Use the unique factorization of integers theorem 
(Theorem 4.4.5) and Theorem 7.4.3. 


a. Define a function G: Z’"°""® — Zs x Zrennes 
as follows: Let G(0) = (0, 0), and then follow 
the arrows in the diagram, letting each succes- 
sive ordered pair of integers be the value of G for 
the next successive integer. Thus, for instance, 
G(1) = (1, 0), G(2) = (0, 1), GG) = (2, 0), 
G(4) = (1, 1), GS) = (0, 2), G(6) = (3, 0), 
G(7) = (2, 1), G(8) = (1, 2), and so forth. 

b. Hint: Observe that if the top ordered pair of any given 
diagonal is (k, 0), the entire diagonal (moving from top 
to bottom) consists of (k, 0), (k — 1, 1), (k— 2, 2),..., 
(2, k— 2), (1, k— 1), (0, k). Thus for every ordered pair 
(m, n) within any given diagonal, the value of m + n is 
constant, and as you move down the ordered pairs in 
the diagonal, starting at the top, the value of the second 
element of the pair keeps increasing by 1. 


Hint: There are at least two different approaches 

to this problem. One is to use the method dis- 

cussed in Section 4.3. Another is to suppose that 
1.999999 ... < 2 and derive a contradiction. (Show 
that the difference between 2 and 1.999999 ... can be 
made smaller than any given positive number.) 


Proof: Let A be an infinite set. Construct a countably 
infinite subset a4, a2, a3,...of A, by letting a, be any ele- 
ment of A, letting ay be any element of A other than a4, 
letting a3 be any element of A other than a, or az, and so 
forth. This process never stops (and hence ay, a2, a3, . . . 
is an infinite sequence) because A is an infinite set. More 
formally, 


27. 


29. 


30. 


31. 


32. 


1. Let a; be any element of A. 


2. For each integer n # 2, let a, be any element of 
, 4,—1}. Such an element exists, for 
. ,A,—1} would be empty 


A- {a], a2, A3,... 
otherwise A — {4}, do, 43, . . 
and A would be finite. 


Proof: Suppose A is any countably infinite set, B is any 
set, and g: A — B is onto. Since A is countably infinite, 
there is a one-to-one correspondence f: Z* —> A. Then, 
in particular, fis onto, and so by Theorem 7.3.4, g ° f 

is an onto function from Z* to B. Define a function h: 

B — Z as follows: Suppose x is any element of B. Since 
g ° f is onto, {m E Z” |(g ° f) (m) = x} + Ø. Thus, by 
the well-ordering principle for the integers, this set has 

a least element. In other words, there is a least positive 
integer n with (g ° f)(n) = x. Let h(x) be this integer. 

We claim that h is a one-to-one. Suppose h(x) = 

h(x) = n. By definition of h, n is the least positive inte- 
ger with (g ° f)(n) = xı. Moreover, by definition of h, n 
is the least positive integer with (g ° f )(n) = x. Hence 
xı = (8 ° f Xn) = x2. 

Thus / is a one-to-one correspondence between B and a 
subset S of positive integers (the range of h). Since any 
subset of a countable set is countable (Theorem 7.4.3), 

S is countable, and so there is a one-to-one correspon- 
dence between B and a countable set. It follows from 
the transitive property of cardinality that B is countable. 


Hint: Suppose A and B are any two countably infinite 
sets. Then there are one-to-one correspondences 

fi: Z — A andg: Z* SB. 

Case 1 (A N B = Ø): In this case define h: Z* > AUB 
as follows: For every integer n = 1, 


_ [fD 
a Le +1)/2) 


if n is even 
if n is odd. 


Show that h is one-to-one and onto. 

Case 2 (A N B € ©): In this case let C = B — A. Then 
AUB=AUCandAN C= Ø. If C is countably 
infinite, use the result of case 1 to complete the proof. 
If C is finite, use the result of exercise 28 to complete 
the proof. 


Hint: Use proof by contradiction and the fact that the 
set of all real numbers is uncountable. 


Hint: Consider the following cases: (1) A and B 

are both finite, (2) at least one of A or B is infinite 

and A N B = ©, (3) at least one of A or B is in- 

finite and A N B + Ø. In case 3 use the fact that 

A U B = (A — B) U (B — A) U (A N B) and that the sets 
(A — B), (B — A), and (A N B) are mutually disjoint. 


Hint: Use the one-to-one correspondence F: Z* — Z of 
Example 7.4.2 to define a function G: Z* X Z* > Z X Z 


8.1 


34. 


35. 


37. 
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by the formula G(m, n) = (F(m), Fn). Show that G is a 
one-to-one correspondence, and use the result of exer- 
cise 22 and the transitive property of cardinality. 


Hint for Solution 1: Define a function f: P(S) —> T as 
follows: For each subset A of S, let f(A) = xa, the char- 
acteristic function of A, where 

xa: S — {0, 1} is defined by the rule 


1 ifxEA 


Xa) = l if x € A for every x € S. 


Show that fis one-to-one (for all subsets A, and A, in 

S, if Xa, = Xa, then Aj = A) and that fis onto (given 
any function g: S — {0, 1}, there is a subset A of S such 
that g = xa). 

Hint for Solution 2: Define H: T — P(S) by letting 
H(f) = {x E S | f(x) = 1}. Show that H is a one-to-one 
correspondence. 


Partial proof (by contradiction): Suppose not. Suppose 
there is a one-to-one, onto function f: S — P(S). Let 


A={x€S|x€f@}. 


Then A E€ P(S) and since fis onto, there exists z E S$ such 
that A = f(z). [Now derive a contradiction!] 


Hint: Since A and B are countable, their elements can 
be listed as 


A: dj, 4, 43,.... and B: by, bo, b3,... 


Represent the elements of A X B in a grid: 
(ai, bı) (a, b2) (ay, b3) « 
(ao, bı) (do, bo) (a, b3) PPN 
(43, bı) (az, b2) (az, b3)... 


Now use a counting method similar to that 
of Example 7.4.4. 


SECTION 8.1 


1. 


a. 0 E 0 because 0— 0 = 0 = 2-0, so 2 | (0 — 0). 
5 É 2 because 5 — 2 = 3 and 3 # 2k for any integer 
k so2 (5-2). 
(6, 6) E E because 6 — 6 = 0 = 2-0, so 2 | (6 — 6). 
(—1, 7) E E because —1 —7 = —8 = 2: (—4), so 
2|(-1-7). 


. Hint: To prove a statement of the form p <+> (q v r), you 


need to prove both (1) p — (q v r) and (2) (q v r) > p. 
The easiest way to prove p — (q v r) is to prove the 
logically equivalent statement form (p ^A ~q) — r. And 
the easiest way to prove (q V r) —> p is to prove the 
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logically equivalent statement form (q —> p) ^ (r —> p). 16. Hint: See Example 8.1.6. 


In this case, suppose m and n are any integers, and let 


p be “m —n is even,” let q be “both m and n are even,” 19. A X B = {(2, 6), (2, 8), @, 10), 4, 6), 4, 8), 4, 10)} 


and let r be “both m and n are odd.” R = {(2, 6), (2, 8), 2, 10), 4, 8)} 
S = {(2, 6), 4, 8)} 
3. a. 1071 because 10— 1 = 9 = 3-3, and so RUS=R, RAS=S 
3| (10-1). ; 
1 T 10 because 1 — 10 = —9 = 3-(—3), and so an 7 
3| -— 10). d 


2 T 2 because 2—2 = 0 = 3-0, and so 3 | (2—2). 
8 7 1 because 8 — 1 = 7 # 3k, for any integer k. So 


4 
+° The shaded region 
/’«— is R. The dashed 


3 1 (8-1). p line is not included. 
b. One possible answer: 3, 6, 9, —3, —6 Je" sx 
e. Hint: All integers of the form 3k + 1, for some p 
integer k, are related by T to 1. ; y 
4. a. Yes, because 15 and 25 are both divisible by 5, , 
which is prime. d 
b. No, because 22 and 27 have no common prime á 
factor. 
y 
5. a. Yes, because both {a, b} and {b, c} have two A 
elements. 
6. a. No, because {a} N {c} = Ø. S consists of the 
points on this line. 
7. a. Yes. 1 R (—9) $ 5| °- (-9°). But 
1?°—(—9)} = 1—81 = —80, and 5 | (—80) because aa 
—80 = 5-(—16). 
8. a. Yes, because both abaa and abba have the same 
first two characters ab. 
b. No, because the first two characters of aabb are dif- 
ferent from the first two characters of bbaa. 
y 
9. a. Yes, because the sum of the characters in 0121 is 4 A 


and the sum of the characters in 2200 is also 4. 
b. No, because the sum of the characters in 1011 is 3, 
whereas the sum of the characters in 2101 is 4. 


The shaded region 
<— is RUS. The line 
y =x is included. 


10. R = {(3, 4), 38, 5), (3, 6), (4, 5), (4, 6), 6, 6)} 
R™' = {(4, 3), 6, 3), ©, 3), (5, 4), (6, 4), (6, 5)} 


>x 


Note that the union of the 

“less than” relation, <, and 
the “equals” relation, =, is 
the “less than or equal to” 

relation, <. 


12. a. No. If F: X — Y is not onto, then F™! fails to be 
defined on all of Y. In other words, there is an ele- 
ment y in Y such that (y, x)  F`' for any x € X. 
Consequently, F T! does not satisfy property (1) of 
the definition of function. 


The graph of the intersection of R and S is obtained by 
finding the set of all points common to both graphs. 
But there are no points for which both x < y and x = y. 
Hence R N S = Ø and the graph consists of no points 
at all. 


24. a. 574329 Tak Kurosawa 
011985 John Schmidt 


Ke 
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SECTION 8.2 


1. 


Ri: 
a. 


2 —— 
b. R; is not reflexive: 2 R,2. 


c. R is not symmetric: 2 R} 3 but 3 K, 2. 
d. R; is not transitive: 1 R, 0 and 0 R} 3 but 1 R, 3. 


a. (Oe e] 


>) 

b. R; is not reflexive: (0, 0) € R3. 

c. R; is symmetric. (If R3 were not symmetric, there 
would be elements x and y in A = {0, 1, 2, 3} such 
that (x, y) E R; but (y, x) É R3. It is clear by inspec- 
tion that no such elements exist.) 

d. R; is not transitive: (2, 3) E R3 and 
(3, 2) E R; but (2, 2) Æ R3. 


Re: 
a. 0 el 


20 e3 

b. Rg is not reflexive: (0, 0) É Re. 

Rg is not symmetric: (0, 1) E Re but (1, 0) É Re- 

d. Rg is transitive. (If Rẹ were not transitive, there 
would be elements x, y, and z in {0, 1, 2, 3} such 
that (x, y) © Re and (y, z) E Re and (x, z) É Re. It is 
clear by inspection that no such elements exist.) 


9 


R is reflexive: R is reflexive + for every real number x, 
x R x. By definition of R, this means that for every real 
number x, x = x. In other words, for every real number 
x, x > x or x = x, which is true. 

R is not symmetric: R is symmetric + for all real num- 
bers x and y, if x R y then y R x. By definition of R, this 
means that for all real numbers x and y, if x = y then 
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Tl. 


12. 


15. 
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y = x. The following counterexample shows that this is 
false. x = 1 and y = 0. Then x = y, but y 4 x because 
1=Oand0 # 1. 


R is transitive: R is transitive <> for all real numbers 
x, y, and z, if x R y and y R z then x R z. By definition 
of R, this means that for all real numbers x, y, and z, if 
x = yand y = z then x = z. This is true by definition 
of = and the transitive property of order for the real 
numbers. (See Appendix A, T18.) 


D is reflexive: For D to be reflexive means that for ev- 
ery real number x, x D x. By definition of D, this means 
that for every real number x, xx = r= 0, which is true. 


D is symmetric: For D to be symmetric means that for 
all real numbers x and y, if x D y then y D x. By defini- 
tion of D, this means that for all real numbers x and y, 
if xy = 0 then yx = 0, which is true by the commutative 
law of multiplication. 


Dis not transitive: For D to be transitive means that 
for all real numbers x, y, and z, if x D y and y D z then 
x D z. By definition of D, this means that for all real 
numbers x, y, and z, if xy = 0 and yz = 0 then xz = 0. 
This is false because there exist real numbers x, y, and 
z such that xy = 0 and yz = 0 but xz # 0. As a coun- 
terexample, let x = 1, y = 0, and z = —1. Then x Dy 
and y D z because 1:0 = 0 and 0-(—1) = 0. But x Ø z 
because 1-(—1) # 0. 


E is reflexive: [We must show that for every integer m, m 
E m.] Suppose m is any integer. Since m — m = 0 and 

4 | 0, we have that 4 | (m — m). Consequently, m E m by 
definition of E. 


E is symmetric: [We must show that for all integers m 
and n, ifm En then n E m.] Suppose m and n are any 
integers such that m E n. By definition of E, this means 
that 4 | (m—n), and so, by definition of divisibility, 
m—n = 4r for some integer r. Now n—m = —(m—n). 
Hence, by substitution, (4r) = 4(—r). It fol- 
lows that 4 | (n — m) by definition of divisibility (since 
—r is an integer), and thus n E m by definition of E. 


m = 


E is transitive: [We must show that for all integers m, n, 
and p ifm E n andn E p then m E p.] Suppose m, n, and p 
are any integers such that m E n and n E p. By defini- 
tion of E this means that 4 | (m — n) and 4 | (n — p), 

and so, by definition of divisibility, m — n = 4r for 
some integer r and n — p = 4s for some integer s. Now 
m—p = (m—n)+(n-—p). Hence, by substitution, 
m—p = 4r + 4s = 4(r + s). It follows that 4 | (m — p) by 
definition of divisibility (since r + s is an integer), and 
thus m E p by definition of E. 


D is reflexive: [We must show that for every positive inte- 
germ, m D m.] Suppose m is any positive integer. Since 
m = m: 1, by definition of divisibility m | m. Hence 

m D m by definition of D. 
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18. 


20. 


23. 


25. 


Dis not symmetric: For D to be symmetric would 
mean that for all positive integers m and n, if m Dn 
then n D m. By definition of divisibility, this would 
mean that for all positive integers m and n, if m | n then 
n |m. A counterexample shows that this is false. Let 

m = 2andn = 4. Then m|n because 2 | 4 but n { m 
because 4 f 2. 


D is transitive: To prove transitivity of D, we must 
show that for all positive integers m, n, and p, if m Dn 
and n D p then m D p. By definition of D, this means 
that for all positive integers m, n, and p, if m | n and 

n | p then m | p. But this is true by Theorem 4.4.3 (the 
transitivity of divisibility). 


Hint: Q is reflexive, symmetric, and transitive. 


E is reflexive: E is reflexive + for every subset A of 
X, A E A. By definition of E, this means that for every 
subset A of X, A has the same number of elements as A, 
which is true. 


E is symmetric: E is symmetric = for all subsets A 
and B of X, if A E B then B E A. By definition of E, this 
means that if A has the same number of elements as B, 
then B has the same number of elements as A, which is 
true. 


Eis transitive: E is transitive = for all subsets A, B, and 
C of X, if A E B and BE C then A E C. By definition of E, 
this means that for all subsets, A, B, and C of X, if A has 
the same number of elements as B and B has the number 
of elements as C, then A has the same number of elements 
as C, which is true. 


S is reflexive: S is reflexive + for every subset A of 

X, A S A. By definition of S, this means that for every 
subset A of X, A C A. This is true because every set is a 
subset of itself. 


S is not symmetric: S is symmetric + for all subsets A 
and B of X, if A S B then B S A. By definition of S, this 
means that for all subsets A and B of X, if A C B then 
B C A. This is false because X # Ø and so there is an 
element, say a, in X. As a counterexample, take A = Ø 
and B = {a}. Then A C B but B CA. 


S is transitive: S is transitive <+ for all subsets A, B, 
and C of X, if A S B and B S C, then A S C. By defini- 
tion of S, this means that for all subsets A, B, and C of 
X, if A C Band B C C then A C C, which is true by the 
transitive property of subsets (Theorem 6.2.1 (3). 


R is reflexive: Suppose s is any string in A. Then s R s 
because s has the same first two characters as s. 


R is symmetric: Suppose s and t are any strings in A 
such that s R t. By definition of R, s has the same first 
two characters as t. It follows that t has the same first 
two characters as s,andsotRs. 
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R is transitive: Suppose s, t, and u are any strings in A 
such that s R t and t R u. By definition of R, s has the 
same first two characters as ft and t has the same first 
two characters as u. It follows that s has the same two 
characters as u, and so s R u. 


l is reflexive: [We must show that for every statement p, 

p \p.] Suppose p is a statement. The only way a condi- 
tional statement can be false is for its hypothesis to be 
true and its conclusion false. Consider the statement 

p — p. Both the hypothesis and the conclusion have the 
same truth value. Thus it is impossible for p — p to be 
false, and so p — p must be true. 


lis not symmetric: | is symmetric + for all statements 
p and q, if p l q then q | p. By definition of I, this means 
that for all statements p and q, if p — q then q > p. 
But this is false: there are statements p and q such that 
p — q is true and q — p is false. For instance, let p 

be “10 is divisible by 4” and let q be “10 is divisible by 
2.” Then p — q is “Tf 10 is divisible by 4, then 10 is 
divisible by 2.” This is true because its hypothesis, p, 
is false. On the other hand, g — p is “If 10 is divisible 
by 2, then 10 is divisible by 4.” This is false because its 
hypothesis, q, is true and its conclusion, p, is false. 


lis transitive: [We must show that for all statements p, 

q, andr, ifp |q and q I r then p \r.] Suppose p, q, and r 
are statements such that p I q and q l r. By definition of 
l, this means that p — q and q > r are both true. By 
transitivity of if-then (Example 2.3.6 and exercise 20 of 
Section 2.3), we can conclude that p — r is true. Hence, 
by definition of I, p Ir. 


Fis reflexive: F is reflexive = for all elements (x, y) in 
R X R, (a, y) F (x, y). By definition of F, this means that 
for all elements (x, y) in R X R, x = x, which is true. 


Fis symmetric: [We must show that for all elements 

(x1, y) and (Xp, y2) in R X R, if œ, y1) F (x2, y2) then 

(X, Yo) F Œ, y,)./ Suppose (x), y1) and (x5, y2) are elements 
of R X R such that (4, yı) F (9, y2). By definition of F, 
this means that x, = x». By symmetry of equality, 

x2 = xı. Thus, by definition of F, (x2, y2) F (@y, y1). 


Fis transitive: [We must show that for all elements 

(x1, y1), 2, Y2), and (x3, y3) in R X R, if Œ, y1) F (2, y2) 
and (Xp, y2) F (x3, y3) then (x1, y1) F (x3, y3).] Suppose 

(x1, Y1), (%2, Y2), and (x3, y3) are elements of R X R 
such that (x), yı) F @2, y2) and (%2, y2) F (x3, y3). By 
definition of F, this means that x, = x and x = x3. By 
transitivity of equality, x; = x3. Hence, by definition of 
F, (x), y1) F (x3, y3). 


R is reflexive: R is reflexive + for every person p 

in A, p R p. By definition of R, this means that for 
every person p living in the world today, p lives within 
100 kilometers of p, which is true. 


34. 


37. 


38. 
41. 


43. 


45. 


R is symmetric: [We must show that for all people p and 
qin A, ifp R q then q R p.] Suppose p and q are people in 
A such that p R q. By definition of R, this means that p 
lives within 100 kilometers of q. This implies that q lives 
within 100 kilometers of p. So, by definition of R, q R p. 


R is not transitive: R is transitive + for all people p, 

q, and r, if p R q and q R r then p R r. This is false. As a 
counterexample, take p to be an inhabitant of Chicago, 
Illinois, q an inhabitant of Kankakee, Illinois, and r 

an inhabitant of Cullom, Illinois. Then p R q because 
Chicago is less than 100 kilometers from Kankakee, 
and q R r because Kankakee is less than 100 kilometers 
from Cullom, but p & r because Chicago is not less than 
100 kilometers from Cullom. 


Proof: Suppose R is any reflexive relation on a set A. 
[We must show that R! is reflexive. To show this, we must 
show that for every x in A, x R`! x.] Given any element x 
in A, since R is reflexive, x R x, and by definition of 
relation, this means that (x, x) € R. It follows, by 
definition of the inverse of a relation, that (x, x) € R, 
and so, by definition of relation, x R! x [as was to be 
shown]. 


a. RN S is reflexive: Suppose R and S are reflexive. 
[To show that R N S is reflexive, we must show that 
Vx € A, (x, x) E RAN S.] So suppose x € A. Since 
R is reflexive, (x, x) E R, and since S is reflexive, 
(x, x) E S. Thus, by definition of intersection, 
(x, x) E R N S [as was to be shown]. 


Hint: The answer is yes. 


Yes. To prove this we must show that for all x and y in 
A, if (x, y) E R U S then (y, x) E R U S. So suppose 

(x, y) is a particular but arbitrarily chosen element in 
RUS. [We must show that (y, x) E R U S.] By definition 
of union, (x, y) E R or (x, y) E S. In case (x, y) E R, 
then (y, x) E R because R is symmetric, and hence 

O, x) E R U S by definition of union. In case (x, y) E S 
then (y, x) E S because S is symmetric, and hence 

O, x) E R U S by definition of union. Thus, in both 
cases, (y, x) E R U S [as was to be shown]. 


R; is not irreflexive because (0, 0) E R4. R; is not 
asymmetric because (0, 1) E€ R; and (1, 0) E R4. R, is 
not intransitive because (0, 1) E R} and (1, 0) E R4 and 
(0, 0) ER. 


R; is irreflexive because no element of A is related by R3 
to itself. R3 is not asymmetric because (2, 3) E R3 and 
(3, 2) E R3. R3 is intransitive. To see why, observe that 
R; consists only of (2, 3) and (3, 2). Now (2, 3) E R3 
and (3, 2) E R; but (2, 2) € R3. Also (3, 2) E R3 and 
(2, 3) E R; but (3, 3) É R3. 


8.3  SOLUTIONSAND HINTS TO SELECTED EXERCISES 


A-84 


48. R; is irreflexive because no element of A is related 


51. 


54. 


by Re to itself. Rg is asymmetric because Re consists 
only of (0, 1) and (0, 2) and neither (1, 0) nor (2, 0) is 
in Re. Re is not intransitive. Let x = y = z = 0. Then 
(x, y) E Re and (y, z) E Re and (x, z) E Re. 


R' = R U {(0, 0), (0, 3), (1, 0), (3, 1), G, 2), (3, 3), 
(0, 2), CL, 2)} 
= {(0, 0), ©, 1), 0, 2), 0, 3), CL, 0), CL, D, A, 2), 
(1, 3), (2, 2), 3, 0), 3, 1), G, 2), G, 3)} 
Algorithm—Test for Reflexivity 
[The input for this algorithm is a binary relation R defined 
ona set A, that is represented as the one-dimensional 
array a{\], a[2],..., a[n]. To test whether R is reflexive, a 
variable called answer is initially set equal to “yes,” and 
each element a[i] of A is examined in turn to see whether 
it is related by R to itself. If any element is not related to 
itself by R, then answer is set equal to “no,” the while loop 
is not repeated, and processing terminates.] 
Input: n [a positive integer], a[1], a[2],..., a[n] 
[a one-dimensional array representing a set A], 
R [a subset of A X A] 
Algorithm Body: 
i:= 1, answer := “yes” 
while (answer = “yes” andi =n) 
if (a[i], a[i]) É R then answer := “no” 
i:=i+1 
end while 
Output: answer [a string] 


SECTION 8.3 


1. 


2. 


3. 


Tl. 


a. cRc b. bRa,cRb,eRd c aRc 
d. cRc,bRa,cRb,eRd,aRc,cRa 


a. R= {(0, 0), (0, 2), (2, 0), (2, 2), (1, 1), 83, 3), 
(3, 4), (4, 3), 4, 9} 

[0] = {0, 4}, [1] = {1, 3}, [2] = {2}, [3] = (1, 3} 

There are three distinct equivalence classes: 

[0] = {0, 4} = [4], [1] = {1, 3} = [3], [2] = {2} 


[1] = {1, 5, 9, 13, 17}, [2] = {2, 6, 10, 14, 18}, 
[3] = {3, 7, 11, 15, 19}, [4] = {4, 8, 12, 16, 20}, 
[5] = {5, 9, 13, 17, 1} = [1] 


There are four distinct equivalence classes: [1], [2], [3], [4] 
{(1, 3), (3, 9}, {2, 4), (-4, —8), (3, 6)}, {C, 5)} 
{OD}, {fa}, {b}, {ch}, (a, b}, {a, c}, {b, ch, (fa, b, ch 


[0] = {x E A| 4|@?-0)} = {x EA] 4] 27} 
= {—4, —2, 0, 2, 4} 

[1] = {x EA] 4] @?-19} = {x EA] 4|@’?-1} 
= {-3, -1, 1,3} 
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13. 


15. 


16. a. 


17. 


18. 


19. 


20. 


{aaaa, aaab, aaba, aabb}, {abaa, abab, abba, abbb}, 
{baaa, baab, baba, babb}, {bbaa, bbab, bbba, bbbb} 


a. True. 17—2 = 15 and 5|15. 


[7] = [4] = [19], -4 = [17], [~6] = [27] 


a. Proof: Suppose that m and n are integers such that 
m = n (mod 3). [We must show that m mod 3 = n mod 3.] 
By definition of congruence, 3 | (m — n), and so, by 
definition of divisibility, m — n = 3a for some integer 
a. Letr = m mod 3. Then m = 3b +r for some in- 
teger b. Since m—n = 3a, it follows by substitution 
thatm —n = (3b + r)—n = 3a, or, equivalently, 
n= 3(b-a)t+r. 
Now b — ais an integer and 0 = r < 3. So, by defi- 
nition of mod, n mod 3 = r, which equals m mod 3. 
Suppose that m and n are integers such that 
m mod 3 = n mod 3. [We must show that m =n 
(mod 3).] Let r = m mod 3 = n mod 3. Then, by 
definition of mod, m = 3p +r and n = 3q+r 
for some integers p and q. By substitution, 
m—n= (3p +r)— (34 +r) = 3(p — q). Since p- q 
is an integer, it follows that 3| (m — n), and so, by 
definition of congruence, m = n (mod 3). 


a. One possible answer: Let A = {1, 2} and 
B = {2,3}. Then A + B, so A and B are distinct. 
But A and B are not disjoint since 2 E A N B. 


a. (1) Proof: R is reflexive because it is true that for 
each student x at a college, x has the same major (or 
double major) as x. 


R is symmetric because it is true that for all students 
x and y at a college, if x has the same major (or 
double major) as y, then y has the same major (or 
double major) as x. 


R is transitive because it is true that for all students 
x, y, and z at a college, if x has the same major (or 
double major) as y and y has the same major (or dou- 
ble major) as z, then x has the same major (or double 
major) as z. R is an equivalence relation because it is 
reflexive, symmetric, and transitive. 


(2) There is one equivalence class for each major 
and double major at the college. Each class consists 
of all students with that major (or double major). 


(1) The solution to exercise 12 in Section 8.2 proved 
that E is reflexive, symmetric, and transitive. Thus 
E is an equivalence relation. 

(2) Observe that for any integer a, the equivalence class 

of ais 


la] = {x€Z|xEa} 


by definition of equivalence class 
= {x © Z|x-—ais divisible by 4} 
by definition of E 
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25 


26 


28 
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= {x €Z|x—a = 4k for some integer k} 
by definition of divisibility 

= {x © Z|x = 4k+a for some integer k} 
by algebra. 


Now when any integer a is divided by 4, the only pos- 
sible remainders are 0, 1, 2, and 3 and no integer has 
two distinct remainders when it is divided by 4. Thus 
every integer is contained in exactly one of the follow- 
ing four equivalence classes: 

{x E Z| x = 4k for some integer k} 

{x E Z| x = 4k + 1 for some integer k} 

{x E Z| x = 4k +2 for some integer k} 


{x E Z| x = 4k +3 for some integer k} 


Hint: Use facts about even and odd integers from Sec- 
tion 4.2 to show that m R n, if, and only if, m and n are 
both even or m and n are both odd, or, in other words, 
if, and only if, both m and n have the same parity. Use 
that result to show that R is an equivalence relation 
with two distinct equivalence classes: the set of all even 
integers and the set of all odd integers. 


(1) Proof: A is reflexive because each real number has 
the same absolute value as itself. 


A is symmetric because for all real numbers x and y, if 
|x| = |y| then |y| = |x]. 

A is transitive because for all real numbers x, y, and z, if 
|x| = [y| and [y| = |z| then |x| = |z]. 

A is an equivalence relation because it is reflexive, sym- 
metric, and transitive. 


(2) The distinct classes are all sets of the form {x, —x}, 
where x is a real number. 


Hints: (1) D is reflexive, symmetric, and transitive. The 
proofs are very similar to the proofs in exercise 17. 


(2) There are two distinct equivalence classes. Note that 
mn = (m—n)(m + n) for all integers m and n. In 
addition, 3|(m—n) or 3|(m+n) > either m—n = 3r or 


m+n = 3r, for some integer r. 


(1) Proof: I is reflexive because the difference between 
each real number and itself is 0, which is an integer. 


I is symmetric because for all real numbers x and y, if 
x —y is an integer, then y— x = (—1)(x—y), which is 
also an integer. 

I is transitive because for all real numbers x, y, and 

z, if x— y is an integer and y — z is an integer, then 
x—z=(x—y)+(y—z) is the sum of two integers and 
thus is an integer. 


Iis an equivalence relation because it is reflexive, sym- 
metric, and transitive. 


29. 


32. 


34 


36 


38. 


(2) There is one class for each real number x with 

0 =x < 1. The distinct classes are all sets of the form 
{y ©R|y = n+x, for some integer n}, where x is a real 
number such thatO =x < 1. 


(1) Proof: P is reflexive because each ordered pair of 
real numbers has the same first element as itself. 


P is symmetric for the following reason: Suppose (w, x) 
and (y, z) are ordered pairs of real numbers such that 
(w, x) P (y, z). Then, by definition of P, w = y. Now by 
the symmetric property of equality, this implies that 

y = w, and so, by definition of P, (y, z) P (w, x). 


P is transitive for the following reason: Suppose (u, v), 
(w, x), and (y, z) are ordered pairs of real numbers such 
that (u, v) P (w, x) and (w, x) P (y, z). Then, by definition 
of P, u = w and w = y. It follows from the transitive 
property of equality that u = y. Hence, by definition of 
P, (u, v) P (y, 2). 

P is an equivalence relation because it is reflexive, sym- 
metric, and transitive. 


(2) There is one equivalence class for each real number. 
The distinct equivalence classes are all sets of ordered 
pairs {(x, y) E R X R | x = a}, for each real number 

a. Equivalently, the equivalence classes consist of all 
vertical lines in the Cartesian plane. 


Solution: There is one equivalence class for each real 
number ż such that 0 = tf < m. One line in each class 
goes through the origin, and that line makes an angle 
of t with the positive horizontal axis. 

A 


line L 


Alternatively, there is one equivalence class for every 
possible slope: all real numbers plus “undefined.” 


No. If points p, q, and r all lie on a straight line with q 
in the middle, and if p is c units from q and q is c units 
from y, then p is more than c units from r. 


Proof: Suppose R is an equivalence relation on a set A 
and a € A. Because R is an equivalence relation, R is 
reflexive, and because R is reflexive, each element of A 
is related to itself by R. In particular, a R a. Hence, by 
definition of equivalence class, a € [a]. 


Proof: Suppose R is an equivalence relation on a set A 
and a, b, and c are elements of A with b R c and c € [a]. 
Since c € [a], then c R a by definition of equivalence 
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40. 


41. 
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class. Now R is transitive because R is an equivalence 
relation. Thus, since b R c and c Ra, then b R a. It fol- 
lows that b € [a] by definition of equivalence class. 


Proof: Suppose a, b, and x are in A, a R b, and x € [a]. 
By definition of equivalence class, x R a. So x R a and 
a R b, and thus, by transitivity, x R b. Hence x € [b]. 


Hint: To show that [a] = [b], show that [a] C [b] and 
[b] C [a]. To show that [a] C [b], show that for every x 
in A, if x E [a] then x € [b]. 


42. c. One possible answer: (2, 6), (—2, —6), (3, 9), 


43. 


(=3, =9). 
a. Suppose that (a, b), (a', b’), (c, d), and (c’, d’) are 
any elements of A such that 
(a, b)] = [(a’, b’)] and [(¢, d)] = Ke’, d’). 
By definition of R, 
ab’ = ba’ (*) and cd’ = dc' (**). 
We must show that 
Ia, b) + Ke, DI = Ia’, bY + Ke’, d’). 
By definition of the addition on A, this equation is 
true if, and only if, 
[(ad + bc, bd)| = [(a'd' + b'c', b'd')]. 
And, by definition of the relation, this equation is 
true if, and only if, 
(ad + bce)b'd' = bd(a'd' + b’c'). 
After multiplying out, this becomes 
adb'd' + bcb'd' = bda'd' + bdb'c', 
and regrouping, turns it into 
(ab')(dd') + (cd')(bb') = (ba')(dd') + (dc')(bb’). 
Substituting the values from (*) and (**) shows that 
this last equation is true. 


c. Suppose that (a, b) is any element of A. We must 
show that 


[(a, b)] + [0, D] = [(a, b)]. 
By definition of the addition on A, this equation is 
true if, and only if, 

[(a-1+b-0, b-1)] = [(a, b)]. 


And this last equation is true because a: 1 +b-:0 =a 
and b-1 =b. 

Suppose that (a, b) is any element of A. We must 
show that 


[(a, b)] + [(~a, b)] 
= [(~a, b)] + (a, b)] = [(0, DI. 
By definition of the addition on A, this equation is 


true if, and only if, 
[(ab + b(—a), bb)] = [(0, 1)], 
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or, equivalently, 7. a. 128 = 2 (mod 7) because 128 — 2 = 126 = 7-18, and 
[(0, bb)] = [(0, 1)]. 61 = 5 (mod 7) because 61 — 5 = 56 = 7°8 

By definition of the relation, this last equation is b. 128+ 61 = (2 +5) (mod 7) because 128 + 61 = 189, 

true if, and only if, 0-1 = bb-0, which is true. 2+5 =7,and 189-7 = 182 = 7-26 


c. 128—61 = (2—5) (mod 7) because 128 — 61 = 67, 


was XZ. 
44. a. Let (a, b) be any element of Z Z . We must 2—5 = —3, and 67 — (—3) = 70 = 7-10 


show that (a, b) R (a, b). By definition of R, this 
relationship holds if, and only if, a+ b = b + a. But d. 128-61 = (2:5) (mod 7) because 128-61 = 7808, 
this equation is true by the commutative law of ad- 2: a 10, and 7808 — (10) = 7798 =7: sue 
dition for real numbers. Hence R is reflexive. e. 128° = 2° (mod 7) because 128” = 16384, 2” = 4, 

c. Hint: You will need to show that for any posi- and 16384 — 4 = 16380 = 7:2340. 
tive integers a, b, c, and d, if a +d = c + b and 
c+f=d+e,thena+f=b+e. 

d. One possible answer: (1, 1), (2, 2), (3, 3), (4, 4), 
(5, 5) 

g- Observe that for any positive integers a and b, the 
equivalence class of (a, b) consists of all ordered 


9. a. Proof: Suppose a, b, c, d, and n are integers with 
n > 1, a = c (mod n), and b = d (mod n). By 
Theorem 8.4.1, a — c = nr and b — d = ns for some 
integers r and s. Then 


(a+b)—(c+d)= (a-—c)+(b—d) = nr+ns 


pairs in Z* X Z* for which the difference between =n(r+s). 
the first and second coordinates equals a — b. Thus Now r + s is an integer, and so, by Theorem 8.4.1, 
there is one equivalence class for each integer: a+b = (c + d) (mod n). 


positive, negative, and zero. Each positive integer n 


12. a. Proof (by mathematical induction): Let the property 


corresponds to the class of (n + 1, 1); each negative a 
P(n) be the congruence 10° = 1 (mod 9). 


integer —n corresponds to the class of (1, n + 1); 


and zero corresponds to the class (1, 1). Show that P(0) is true: 
When n = 0, the left-hand side of the congruence is 
47. c. “Ways and Means 10° = 1 and the right-hand side is also 1. 
SECTION 8.4 Show that for every integer k = 0, if P(k) is true, 
. then P(k +1) is true. 
1. a. ZKHUH VKDOO ZH PHHW Let k be any integer with k = 0, and suppose P(k) is 
b. IN THE CAFETERIA true. That is, suppose 10° =1 (mod 9). (*) [This is 


3 The relation 3 | (25 —19) i b the inductive hypothesis.] By Theorem 8.4.1, 10 = 1 
. a e relation 3 | ( ) is true because (mod 9) (**) because 10— 1 = 9 = 9- 1. And by 


25— 19 = 6 and 3 | 6 (since 6 = 3-2). Theorem 8.4.3, we can multiply the left- and right- 
b. By definition of congruence modulo n, to show that hand sides of (*) and (**) to obtain 10°10 = 1-1 


25 = 19 (mod 3), one must show that 3/(25 — 19). (mod 9), or, equivalently, 10°*! = 1 (mod 9). Hence 
This was verified in part (a). P(k +1) is true. 
c. To show that 25 = 19 + 3k for some integer k, one 


i Alternative Proof: Note that 10 = 1 (mod 9) 
solves the equation for k and checks that the result 


because 10 — 1 = 9 and 9 | 9. Thus by 


is an integer. In this case, k = (25 — 19)/3 = 2, Theorem 8.4.3(4), 10” = 1” = 1 (mod 9). 
which is an integer. Thus 25 = 19 +2-3. i 
d. When 25 is divided by 3, the remainder is 1 be- 14. 14 mod 55 = 14 
cause 25 = 3-8 + 1. When 19 is divided by 3, the 14° mod 55 = 196 mod 55 = 31 
remainder is also 1 because 19 = 3-6 + 1. Thus 25 14* mod 55 = (14? mod 55} mod 55 = 


and 19 have the same remainder when divided by 3. 


31° mod 55 = 26 
e. By definition, 25 mod 3 is the remainder obtained á 


8 — pat 2 = 

when 25 is divided by 3, and 19 mod 3 is the 14° mod 55 = (14° mod 55) mod 55 = 
remainder obtained when 19 is divided by 3. In part 26° mod 55 = 16 

(d) these two numbers were shown to be equal. 14'° mod 55 = (14° mod 55)? mod 55 = 
6. Hints: (1) Use the quotient-remainder theorem and 16° mod 55 = 36 


Theorem 8.4.1 to show that given any integer a, a is in 
one of the classes [0], [1], [2],... [n — 1]. (2) Use the 
quotient-remainder theorem (Theorem 4.5.1) to prove 
that if0 =<a<n,0=b<n, anda = b (mod n), then 
a=b. 


15. 4” mod 55 = 14!6+8+?+! mod 55 
= {(14!° mod 55)(148 mod 55)(14° mod 55) 
(14! mod 55)} mod 55 
= (36-16-3114) mod 55 = 249984 mod 55 = 9 
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16. 


19. 


22. 


25. 


Note that 307 = 256+32+16+2+1. 

675! mod 713 = 675 

675° mod 713 = 18 

675‘ mod 713 = 18° mod 713 = 324 

675° mod 713 = 324° mod 713 = 165 

675'° mod 713 = 165° mod 713 = 131 

675° mod 713 = 131° mod 713 = 49 

675™ mod 713 = 49? mod 713 = 262 

675" mod 713 = 262? mod 713 = 196 

675%% mod 713 = 196° mod 713 = 627 

Thus 

675°” mod 713 = 67570174 16+2+1 mod 713 
= (675° -675°?-675'°-675°-675') mod 713 
= (627:-49+ 131+ 18-675) mod 713 = 3. 


The letters in HELLO translate numercially into 08, 05, 
12, 12, and 15. By Example 8.4.9, the H is encrypted as 
17. To encrypt E, we compute 5° mod 55 = 15. To en- 
crypt L, we compute 12° mod 55 = 23. And to encrypt O, 
we compute 15° mod 55 = 20. Thus the ciphertext is 17 
15 23 23 20. (In practice, individual letters of the alphabet 
are grouped together in blocks during encryption so that 
deciphering cannot be accomplished through knowledge 
of frequency patterns of letters or words.) 


By Example 8.4.10, the decryption key is 27. Thus the 
residues modulo 55 for 1377, 2077, and 97’ must be found 
and then translated into letters of the alphabet. 

Because 27 = 16 +8 +2 + 1, we first perform the fol- 
lowing computations: 


20' = 20(mod 55) 

20° = 15(mod 55) 

20* = 15° = 5 (mod 55) 
208 = 25° = 5(mod 55) 
20!° = 25° = 20 (mod 55) 


13! = 13(mod 55) 

13° = 4(mod 55) 

134 = 4° = 16(mod 55) 
13° = 16° = 36 (mod 55) 
13! = 36° = 31 (mod 55) 


9! = 9 (mod 55) 

9? = 26 (mod 55) 

94 = 26° = 16 (mod 55) 

98 = 16° = 36 (mod 55) 

9!6 = 36° = 31 (mod 55) 
Then we compute 


13” mod 55 = (31-36-413) mod 55 = 7, 

20” mod 55 = (20-25-15-20) mod 55 = 15, 

9°’ mod 55 = (31-36-26-9) mod 55 = 4. 

Finally, because 7, 15, and 4 translate into letters as G, 
O, and D, we see that the message is GOOD. 


Hint: By Theorem 5.2.2, using a in place of r and n — 1 


P = a’—1 
in place of n, we have 1 +a +a? +- +a"! = ai 
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26. 


28. 


31. 
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Multiplying both sides by a — 1 gives 
4 DA +a+a +: 


a'-l=(a Ha’). 


Step 1: 6664 = 765-8 + 544, and so 544 = 
6664 — 765:8 
Step 2: 765 = 544-1+ 221, and so 221 = 765 — 544 
Step 3: 544 = 221-2 + 102, and so 102 = 544—221-2 
Step 4: 221 = 102-2 + 17, and so 17 = 221 — 102-2 
Step 5: 102 = 17-6+0 
Thus gcd(6664, 765) = 17 (which is the remainder 
obtained just before the final division). Substitute back 
through steps 4-1 to express 17 as a linear combination 
of 6664 and 765: 
17 = 221 — 102-2 

= 221 — (544 — 221-2) = 221-5 — 544-2 

= (765 — 544)-5 — 544-2 = 765-5 — 544-7 

= 765:5 — (6664 — 765-8)-7 = (—7)-6664 + 61-765. 


(When you have finished this final step, it is wise to 
verify that you have not made a mistake by checking 
that the final expression really does equal the greatest 
common divisor.) 


a 330 | 156 18 12 6 
b 156 18 12 6 0 

0 

2 
s 1 0 1 —8 9 
t 0 1 =2 17 =19 
u 0 1 —8 9 —26 
v 1 —2 17 =19 55 
newu 1 —8 9 =26 
newv =2 17 =19 55 
sa+tb | 330 18 =6 6 6 


a. Step 1: 210 = 13-16+2, and so 2 = 210 — 16-13 
Step 2: 13 = 2-6 + 1, and so 1 = 13— 2-6 
Step 3: 6 = 1-6 +0, and so gcd(210, 13) = 1 
Substitute back through steps 2 and 1: 

1 =13-—2-6 

= 13—(210— 16-13)-6 = (—6)-210+ 97-13 
Thus 210-(—6) = 1 (mod 13), and so —6 is an 
inverse for 210 modulo 13. 

b. Compute 13 — 6 = 7. Note that 7 = —6 (mod 13) 
because 7 — (—6) = 13 = 13-1. Thus, by Theorem 
8.4.3(3), 210-7 = 210-(—6) (mod 13). By part (a), —6 
is an inverse for 210 modulo 13, and so 210-(—6) = 1 
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(mod 13). It follows, by the symmetric and transitive L: 12 = 12 (mod 713) 
properties of congruence, that 210-7 = 1 (mod 13), 12° = 144 (mod 713) 
and so 7 is a positive inverse for 210 modulo 13. . = 144? = 59 (mod 713) 


c. This problem can be solved using either the result = 59° = 629 (mod 713) 


of part (a) or that of part (b). By part (b) 210-7 = 1 =a 629 = 639 (mod 713) 

(mod 13). Multiply both sides by 8 and apply Theo- 12°? = 639? = 485 (mod 713) 

rem 8.4.3(3) to obtain 210-56 = 8 (mod 13). Thus Thus the ciphertext is 

a positive solution for 210x = 8 (mod 13) is x = 56. 43 

Note that the least positive residue corresponding eee 

to this solution is also a solution. By Theorem 8.4.1, ASDO ANAA a = AS: 


56 = 4 (mod 13) because 56 = 13-4 + 4, and so, by P: 16 = 16 (mod 713) 
Theorem 8.4.3(3), 210-56 = 210-4 = 9 (mod 13). 16° = 256 (mod 713) 

This shows that 4 is also a solution for the congru- 16! = 256° = 653 (mod 713) 
ence, and because 0 = 4 < 13, 4 is the least positive 168 = n = 35 (mod 713) 


solution for the congruence. 16!° = 35? = 512 (mod 713) 


16° = n- = 473 (mod 713) 
Thus the ciphertext is 
16° mod 713 
= (473-35-256-16)mod713 = 128. 


Therefore, the encrypted message is 233 129 048 128. 
(Again, note that in practice, individual letters of the 
alphabet are grouped together in blocks during encryp- 
tion so that deciphering cannot be accomplished through 
knowledge of frequency patterns of letters or words. We 
kept them separate so that the numbers in the computa- 
tions would be smaller and easier to work with.) 


33. Hint: If as + bt = 1 and c = au = by, then 
c = asc + btc = as(bv) + bt(au). 


35. Proof: Let a be any integer and let n be any positive in- 
teger, and suppose s and ¢ are any inverses for a modulo 
n. Thus as = 1 (mod n) and at = 1 (mod n). Note that 
ast = (as):t = (at): s By Theorem 8.4.3(3), (as):t = t 
(mod n) and (at)-s = s (mod n). Thus, by symmetry 
and transitivity of congruence modulo n, s = t (mod n). 
Because s and tf were chosen arbitrarily, we conclude 
that any two inverses for a are congruent modulo n. 


36. The numeric equivalents of H, E, L, and P are 08, 
05, 12, and 16. To encrypt these letters, the following 
quantities must be computed: 8° mod 713, 5# mod 713, 
12° mod 713, and 16” mod 713. We use the fact that 
43 = 32+8+2+1. 


H: 8 =8 (mod 713) 
8° = 64 (mod 713) 


39. By exercise 38, the decryption key, d, is 307. 
Hence, to decrypt the message, the following quan- 
tities must be computed: 675°% mod 713, 89°% 
mod 713, and 48°°' mod 713. We use the fact that 
307 = 256+32+16+2+1 

675 = 675 (mod 713) 
675° = 18 (mod 713) 
8° = 64° = 531 (mod 713) 6754 = 18” = 324 (mod 713) 
8° = 531° = 326 (mod 713) 6758 = 3242 = 165 (mod 713) 
16 = 326° = 39 (mod 713) 675'° = 165° = 131 (mod 713) 

832 = 39? = 95 (mod 713) 675°* = 1317 = 49 (mod 713) 

675° = 49° = 262 (mod 713) 

67518 = 262? = 196 (mod 713) 

675% = 196° = 627 (mod 713) 


Thus the ciphertext is 
8° mod 713 
= (95-326-64-8) mod 713 = 233. 


5 = 5 (mod 713) 

5? = 25 (mod 713) 

54 = 625 (mod 713) 

58 = 625° = 614 (mod 713) 
5! = 614° = 532 (mod 713) 

a- = 532° = 676 (mod 713) 


Thus the ciphertext is 
8° mod 713 
= (676:614-25-5) mod 713 = 129. 


89 = 89 (mod 713) 

89° = 78 (mod 713) 

89 = 78° = 380 (mod 713) 

898 = 380° = 374 (mod 713) 

89!6 = 3747 = 128 (mod 713) 

89°? = 128° = 698 (mod 713) 
89% = 6987 = 225 (mod 713) 

918 = - = 2 (mod 713) 

89°56 = 2? = 4 (mod 713) 
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48 = 48 (mod 713) 

48” = 165 (mod 713) 

48* = 131 (mod 713) 

48° = 49 (mod 713) 

48!6 = 262 (mod 713) 

48° = 196 (mod 713) 

48% = 627 (mod 713) 

48'8 = 627° = 266 (mod 713) 

48° = 266° = 169 (mod 713) 
Thus the decryption for 675 is 

675°% mod 713. = (6757017164241) mod 713 
= (627-49-131-18-675) mod 713 = 3, 
which corresponds to the letter C. 
The decryption for 89 is 
89°07 mod 713 = (8970+ 324 16*2*1) mod 713 
= (4-698: 128-78-89) mod 713 = 15, 
which corresponds to the letter O. 
The decryption for 48 is 
48°" mod 713 = (487°6732416*241) mod 713 
= (169-196-262: 165-48) mod 713 = 12, 

which corresponds to the letter L. 


Thus the decrypted message is COOL. 


41. a. Hint: For the inductive step, assume p|qiq2°** qs+1 
and let a = qıqx ` 'qs. Then p|aq,+1, and either 
P = 4s+1 Or Euclid’s lemma and the inductive 
hypothesis can be applied. 


42. a. Whena = 15 and p = 7, a’! = 15% = 


11390625 = 1 (mod 7) because 11390625 — 
1 = 7: 1627232. 


SECTION 8.5 


1. a. 


R, is not antisymmetric: 1 R, 3 
and 3 R; | and 1 # 3. 


2 


) O 


8:5 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-90 


R, is antisymmetric: There are 
no cases where a R b and 
bRaanda#b. 


1 
2 
è 
2. Ris not antisymmetric. Let x and y be any two distinct 
people of the same age. Then x R y and y R x but x # y. 
5. Risa partial order relation. 


Proof: 


R is reflexive: Suppose (a, b) E R X R. Then 

(a, b) R (a, b) because a = a and b Sb. 

R is antisymmetric: Suppose (a, b) and (c, d) are 
ordered pairs of real numbers such that (a, b) R (c, d) 
and (c, d) R (a, b). Then 


eithera<c or botha = candb <d 


and 
eitherc<a or bothc=aandd=b. 
Thus 
axcandc=a 
and so 
a=c. 
Consequently, 
bsd and dsb 
and so 


Hence (a, b) = (c, d). 


R is transitive: Suppose (a, b), (c, d), and (e, f) are or- 
dered pairs of real numbers such that (a, b) R (c, d) and 
(c, d) R (e, f). Then 
eithera<c or botha = candb =< d 
and 
eitherc<e or bothc=eandd<f. 
It follows that one of the following cases must occur. 


Case 1 (a < c and c < e): Then by transitivity of <, 

a < e, and so (a, b) R (e, f) by definition of R. 

Case 2 (a < c and c = e): Then by substitution, a < e, 
and so (a, b) R (e, f) by definition of R. 


Case 3 (a = c and c < e): Then by substitution, a < e, 
and so (a, b) R (e, f) by definition of R. 
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Case 4 (a = c and c = e): Then by definition of R, 17. a. {0, 1} 18. 91,1) 
b < dand d =£ f, and so by transitivity of <S, b =f. 
Hence a = e and b = f, and so (a, b) R (e, f) by defini- 


tion of R. $ (1,0) 
In each case, (a, b) R (e, f). Therefore, R is transitive. {0} {1} 
Since R is reflexive, antisymmetric, and transitive, R is $(0, 1) 


a partial order relation. 


8. Ris nota partial order relation because R is not anti- 400.0) 


symmetric. Ø 
Counterexample: 1 R 3 (because 1 + 3 is even) and 3 R 


Geena 3 + lis'even but L #3. 21. a. Proof: [We must show that for all a and b in A, a | b 


or b|a.] Let a and b be particular but arbitrarily 


10. No. Counterexample: Define relations R and S chosen elements of A. By definition of A, there are 
on the set {1, 2} as follows: R = {(1, 2)} and S = nonnegative integers r and s such that a = 2’ and 
{(2, D}. Then both R and S are antisymmetric, but b = 2’. Now either r= s or s < r. If r < s, then 
R U S = {(1, 2), (2, 1)} is not antisymmetric because b= = 27-98 = ge, 


ers AS and (2,1) ERUS but L A2, where s — r = 0. It follows, by definition of divisibil- 


11. a. True, by (1). ity, that a | b. By a similar argument, if s < r, then 
b. False. By (1), bba < bbab. b|a. Hence either a | b or b | a [as was to be shown]. 

e @ e @ @ 

1B. Ry = {(a, a), (b, D)}, Ro = {(a, a), (b, b), (a, Dh, bi 2 2 2 ne 
R, = {(a, a), (b, b), (b, a} 22. greatest element: none; least element: 1; maximal ele- 


14. a. R, = {(a, a), (b, b), (c, ©} ments: 15, 20; minimal element: 1 


R, = {(a, a), (b, b), (c, £), (b, a)} 24. greatest element: {0, 1}; least element: @; maximal ele- 
R; = {(a, a), (b, b), (c, c), (c, @} ment: {0, 1}; minimal element: Ø 
R, = {(a, a), (b, b), (c, c), (b, a), (c, a} 26. greatest element: (1, 1); least element: (0, 0); maximal 
R; = {(a, a), (b, b), (c, ©, (c, b), (c, aà} element: (1, 1); minimal element: (0, 0) 
Rs = {(a, a), (b, b), (c, c), (b, c), (b, a)} 30. a. No greatest element, no least element 
R; = {(a, a), (b, b), (c, ©, (c, b), (b, a), (c, a} b. Least element is 0, greatest element is 1 
Rg = {(a, a), (b, b), (c, c), (b, ©), (b, a), (c, a)} 31. R is a total order relation because it is reflexive, anti- 
Ry = {(a, a), (b, b), (c, ©), (b, ©} symmetric, and transitive (so it is a partial order) and 
Rz bb SED because [b, a, c, d] is a chain that contains every ele- 
10 = {(a, a), ©, b), (c, ©, (c, b)} ment of A: b Rc, c Ra, andaR d. 
15. Hint: R is the identity relation on A: x R x for each 34. Hint: Let R' be the restriction of R to B and show that 


xEA, andxKyifx# y. R' is reflexive, antisymmetric, and transitive. In each 
16. a. 20 case, this follows almost immediately from the fact that 
R is reflexive, antisymmetric, and transitive. 


35. One possible solution: Ø C {w} C {w, x} C {w, x,y} C 


4 10 15 Uaa 
38. Proof: Suppose A is a partially ordered set with respect 
to a relation <. By definition of total order, A is totally 
j A ordered if, and only if, any two elements of A are com- 


parable. By definition of chain, this is true if, and only 
if, A is a chain. 


39. Proof (by mathematical induction): Let A be a set that 
1 is totally ordered with respect to a relation <, and let 
the property P(n) be the sentence “Every subset of A 
with n elements has both a least element and a greatest 
element.” 
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Show that P(1) is true: 


If A = ©, then P(1) is true by default. So assume that 
A has at least one element, and suppose S = {a,} isa 


44. One such total order is 1, 5, 2, 15, 10, 4, 20. 
46. One such total order is (0, 0), (1, 0), (0, 1), Cl, D. 
50. a. One possible answer: 1, 6, 10, 9, 5, 7, 2, 4, 8, 3 


subset of A with one element. Because ~ is reflexive, 
a, Xa. So, by definition of least element and greatest 
element, a, is both a least element and a greatest ele- 
ment of S, and thus the property is true for n = 1. 


Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 


51. 


b. Critical path: 1, 2, 5, 8, 9 


SECTION 9.1 


2. 3/4, 1/2, 1/2 
Let k be any integer with k = 1, and suppose that any 
subset of A with k elements has both a least element 3. (1, 20, 3, 4%, 54, 6%, 74, 80, 9%, 104, 1V, 2V, 
and a greatest element. [Inductive hypothesis.] We must 3Y, 4Y, 5V, 6Y, 79, 8V, 9V, 10V}, probability = 
show that any subset of A with k + 1 elements has 20/52 = 38.5% 
both a least element and a greatest element. If A has 
fewer than k + 1 elements, then the statement is true by 5. {10%, J%, eee AS10919, y K$, ae ng 
default. So assume that A has at least k + 1 elements and TY, QY, BY, AY, 108, J&, Q4, K4, AS}, probability = 
that S = {a,, do,..., ae41} is a subset of A with k+ 1 20/52 = 5/13 = 38.5% 
elements. By inductive hypothesis, S — {a;,+,} has both 7. {26, 35, 44, 53, 62}, probability = 3/8 = 37.5% 
a least element s and a greatest element t. Now because 
A is totally ordered, a,4., and s are comparable. If 9. {11, 12, 13, 14, 15, 21, 22, 23, 24, 31, 32, 33, 41, 42, 51}, 
ag+1 Z 8, then, by transitivity of <, a,., is the least probability = 15/36 = 413% 
element of S; otherwise, s remains the least element of 1. a. (HHH, HHT, HTH, HTT, THH, THT. TTH, TTT} 


S. And if t < a,+,, then, by transitivity of <, a,+, is the 
greatest element of S; otherwise, t remains the greatest 
element of S. Thus S has both a greatest element and a 
least element fas was to be shown]. 


40. a. Proof by contradiction: Suppose not. Suppose A is 


a nonempty, finite set that is partially ordered with 

respect to a relation <, and suppose no element of A 

is minimal. Construct a sequence of elements x4, X2, 

X3,... of A as follows: 

1. Pick any element of A and call it x}. 

2. For each i = 2, 3, 4,..., pick x; to be an element 
of A for which x; < x;~, and x; # x;—1. [Such an 
element must exist because otherwise x;-, would 
be minimal, and we are supposing that no element 
of A is minimal.] Now x; # x; for any i # j. 

[For if x; = Xp where i < j, then on the one hand, 
XjXXj-1 N+ X1 xX; and so Xj Xxj+). On 

the other hand, since x; = x; and x; > x; +, then 

Xj > Xj +1- Hence by antisymmetry, x; = Xi+ı, and 
s0 X; = xX;+; because x; = x; But this contradicts 
the definition of the sequence x1, X2, X3,...] 

Thus x), X2, x3, . . . is an infinite sequence of dis- 
tinct elements, and consequently {x1, X2, x3,...} 
is an infinite subset of the finite set A, which is 
impossible. Hence the supposition is false and we 
conclude that any partially ordered subset of a 
finite set has a minimal element. 


12. 


13. 


14. 
16. 


18. 


21. 


b. (i) {HTT, THT, TTH}, probability = 3/8 = 37.5% 


a. {BBB, BBG, BGB, BGG, GBB, GBG, GGB, GGG} 
b. (i) {GBB, BGB, BBG}, probability = 3/8 = 37.5% 


a. {CCC, CCW, CWC, CWW, WCC, WCW, WWC, 
WWW} 
b. (i) {CWW, WCW, WWC}, probability = 3/8 = 
37.5% 


a. probability = 3/8 = 37.5% 


a. {RRR, RRB, RRY, RBR, RBB, RBY, RYR, RYB, RYY, 
BRR, BRB, BRY, BBR, BBB, BBY, BYR, BYB, BYY, 
YRR, YRB, YRY, YBR, YBB, YBY, YYR, YYB, YYY} 

b. {RBY, RYB, YBR, BRY, BYR, YRB}, probability = 
6/27 = 2/9 = 22.2% 

c. {RRB, RBR, BRR, RRY, RYR, YRR, BBR, BRB, RBB, 
BBY, BYB, YBB, YYR, YRY, RYY, YYB, YBY, BYY}, 
probability = 18/27 = 2/3 = 664% 

a. {B,B), B,B>, B,W, B2B, ByB>, BW, WB,, WB>, WW} 

. {B,B), B,B>, ByB,, ByB5}, probability = 4/9 = 44.4% 

c. {B,W, BW, WB,, WB>}, probability = 4/9 = 44.4% 


a 1011 12 13 14 15 1617 18... 96 9798 99 


t t to of t 
3-4 35 36 3:32 3:33 


The above diagram shows that there are as many 


42. $ d positive two-digit integers that are multiples of 3 as 
there are integers from 4 to 33 inclusive. By Theo- 
rem 9.1.1, there are 33 — 4 + 1, or 30, such integers. 

a b 
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b. There are 99— 10+ 1 = 90 positive two-digit 
integers in all, and by part (a), 30 of these are 
multiples of 3. So the probability that a randomly 
chosen positive two-digit integer is a multiple of 
3 is 30/90 = 1/3 = 334%. 

c. Of the integers from 10 through 99 that are mul- 
tiples of 4, the smallest is 12 (=4-3) and the largest 
is 96 (=4-24). Thus there are 24— 3 + 1 = 22 
two-digit integers that are multiples of 4. Hence the 
probability that a randomly chosen two-digit integer 
is a multiple of 4 is 22/90 = 365%. 


m=3+1_ m=2 
n = A 


23. c. Probability = 


d. Because eal = 19, the probability is Matt = 3, 


24. a. (i) Ifnis even, there are [5 = 5 elements in the 


sub-array. 
(ii) If n is odd, there are [z] = wat elements in the 
sub-array. 
b. There are n elements in the array, so 
(i) The probability that an element is in the given 
n 


p + 2 1 
sub-array when n is even is ^ = 7> 
n 


(ii) The probability that an element is in the given 
n=1 
=) g f 2 _n-l 
sub-array when n is odd is —— = “5, 
n 


26. Let k be the 27th element in the array. By Theorem 9.1.1, 
k—42+1 = 27, and sok = 42 +27 — 1 = 68. Thus the 
27th element in the array is A[68]. 


28. Let m be the smallest of the integers. By Theorem 9.1.1, 
279 —m+ 1 = 56, and so m = 279 — 56 + 1 = 224. 
Thus the smallest of the integers is 224. 


31.123 456789... 999 1000 1001 
q q f { 
31 32 33 3:333 


Thus there are 333 multiples of 3 between 1 and 1001. 


32. a. 

M Tu W ThF Sa Su M Tu W Th F Sa Su... F Sa Su M 

123456 7 8 9 1011 1213 14 362 363 364 365 

t t t 
7-1 7-2 7°52 

Sundays occur on the 7th day of the year, the 14th 
day of the year, and in fact on all days that are 
multiples of 7. There are 52 multiples of 7 between 
1 and 365, and so there are 52 Sundays in the year. 
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SECTION 9.2 


1. Game 4 Game 5 Game 6 Game 7 
A (A wins) 
Sun A (A wins) 
art: i 
Ahas A (A wins) 


A (A wins) 


B (B wins) 


There are five ways to complete the series: A, B—A, 
B-B-A, B-B-B-A, and B-B-B-B. 


3. Four ways: A-A—A—A, B-A-A-A-—A, B-B—-A-A-A-—A, 
and B—B—B—A—A—A-—A 


4. Two ways: A~-B—A—B-—A-—B-A and B~A—B—A-—B-A-B 


6. a. Step 1: Step 2: Step 3: 
Choose urn. 


Choose ball 1. Choose ball 2. 


b. There are 12 equally likely outcomes of the 
experiment. 
c. 2/12 = 1/6 = 163% d. 8/12 = 2/3 = 663% 


8. By the multiplication rule, the answer is 3-2-2 = 12. 


9. a. In going from city A to city B, one may take any of 
the 3 roads. In going from city B to city C, one may 
take any of the 5 roads. So, by the multiplication 
tule, there are 3-5 = 15 ways to travel from city A 
to city C via city B. 

b. A round-trip journey can be thought of as a four- 
step operation: 
Step 1: Go from A to B. 
Step 2: Go from B to C. 
Step 3: Go from C to B. 
Step 4: Go from B to A. 


Since there are 3 ways to perform step 1, 5 ways to 
perform step 2, 5 ways to perform step 3, and 


11. a. 


12. a. 


13. a. 


3 ways to perform step 4, by the multiplication rule, 
there are 3-5-5-3 = 225 round-trip routes. 

In this case the steps for making a round-trip journey 
are the same as in part (b), but since no route segment 
may be repeated, there are only 4 ways to perform 
step 3 and only 2 ways to perform step 4. So, by the 
multiplication rule, there are 3-5-4-2 = 120 round- 
trip routes in which no road is traversed twice. 


Imagine constructing a bit string of length 8 as an 
eight-step process: 


Step 1: Choose either a 0 or a 1 for the left-most 
position, 

Step 2: Choose either a 0 or a 1 for the next position 
to the right. 


Step 8: Choose either a 0 or a 1 for the right-most 
position. 


Since there are 2 ways to perform each step, the 
total number of ways to accomplish the entire op- 
eration, which is the number of different bit strings 
of length 8, is 2-2-2-2-2-2-2-2 = 28 = 256. 
Imagine that there are three 0’s in the three left- 
most positions, and imagine filling in the remaining 
5 positions as a five-step process, where step i is to 
fill in the (i+ 3)rd position. Since there are 2 ways 
to perform each of the 5 steps, there are 2 ways to 
perform the entire operation. So there are 2°, or 32, 
8-bit strings that begin with three 0’s. 


Think of creating a hexadecimal number that satis- 
fies the given requirements as a five-step process. 


Choose the left-most hexadecimal 
digits. It can be any of the 9 hexa- 
decimal digits from 3 through B. 


Step 1: 


Steps 2—4: Choose the three hexadecimal digits 
for the middle three positions. Each 
can be any of the 16 hexadecimal 
digits. 

Step 5: Choose the right-most hexadecimal 
digit. It can be any of the 11 hexa- 


decimal digits from 5 through F. 


There are 9 ways to perform step 1, 16 ways to 
perform each of steps 2 through 4, and 11 ways to 
perform step 5. Thus, the total number of specified 
hexadecimal numbers is 9-16-16: 16-11 = 405,504. 


In each of the four tosses there are two possible 
results: Either a head (H) or a tail (T) is obtained. 
Thus, by the multiplication rule, the number of 
outcomes is 2-2-2-2 = 24 = 16. 


. There are six outcomes with two heads: HHTT, 


HTHT, HTTH, THHT, THTH, TTHH. Thus the 
probability of obtaining exactly two heads is 


6/16 = 3/8. 


9,2. SOLUTIONS AND HINTS TO SELECTED EXERCISES 


14. a. 


16. a. 


e. 


A-94 


Think of creating license plates that satisfy the given 

conditions as the following seven-step process: In 

steps 1—4 choose the letters to put in positions 1—4, 

and in steps 5-7, choose the digits to put in positions 

5-7. Since there are 26 letters and 10 digits and since 

repetition is allowed, there are 26 ways to perform 

each of steps 1—4 and 10 ways to perform each of 

steps 5-7. Thus the number of license plates is 
26-26-26-26-10-10-10 = 456,976,000. 

In this case there is only one way to perform step 1 

(because the first letter must be an A) and only one 

way to perform step 7 (because the last digit must 

be a 0). Therefore, the number of license plates is 

26-26-26-10-10 = 1,757,600. 

In this case there are 26 ways to perform step 1, 

25 ways to perform step 2, 24 ways to perform step 

5, and 8 ways to perform step 6, so the number of li- 

cense plates is 26-25-24-23-10-9-8 = 258,336,000. 


Two solutions: 


(i) By the multiplication rule, the number of inte- 
gers from 10 through 99 


the number of || the number of 
= | ways to pick || ways to pick 
the first digit |} the second digit 


= 9-10 = 90 


(ii) By Theorem 9.1.1, the number of integers from 
10 through 99 = 90 — 10+ 1 = 90. 

Because odd integers end in 1, 3, 5, 7, or 9, the 

number of odd integers from 10 through 99 


the number of 
ways to pick 
the second digit 


the number of 
= | ways to pick 
the first digit 
=9:5=45. 
An alternative solution uses the listing method 
shown in the solution for Example 9.1.4. 


the number of integers 
with distinct digits 


the number of || the number of 
= | ways to pick || ways to pick 
the first digit |} the second digit 


=9-9=81 


the number of odd integers 
with distinct digits 


the number of 
ways to pick 
the first digit 


the number of 
= | ways to pick 

the second digit 
= 5-8 = 40 because the first digit 
can’t equal 0, nor can it 
equal the second digit 


81/90 = 9/10, 40/90 = 4/9 
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18. 


19. 


20. 


21. 


22. 


a. Let step 1 be to choose either the number 2 or one 
of the letters corresponding to the number 2 on the 
keypad, let step 2 be to choose either the number 1 
or one of the letters corresponding to the number 
1 on the keypad, and let steps 3 and 4 be to choose 
either the number 3 or one of the letters correspond- 
ing to the number 3 on the keypad. There are 4 ways 
to perform step 1, 3 ways to perform step 2, and 
4 ways to perform each of steps 3 and 4. So by the 
multiplication rule, there are 4-3-4-4 = 192 ways 
to perform the entire operation. Thus there are 192 
different PINs that are keyed the same as 2133. Note 
that on a computer keyboard, these PINs would not 
be keyed the same way. 


Step 1: Step 2: Step 3: 
Choose the Choose the Choose the 
secretary. treasurer. president. 

Bob 
Cyd 


There are 14 different paths from “root” to “leaf” of 
this possibility tree, and so there are 14 ways the of- 
ficers can be chosen. Because 14 = 2-7, reordering the 
steps will not make it possible to use the multiplication 
rule alone to solve this problem. 


a. The number of ways to perform step 4 is not con- 
stant; it depends on how the previous steps were 
performed. For instance, if 3 digits had been chosen 
in steps 1-3, then there would be 10 — 3 = 7 ways 
to perform step 4, but if 3 letters had been chosen in 
steps 1-3, then there would be 10 ways to perform 
step 4. 


Hint: 


a. The answer is 2”. b. The answer is n”. 


a. The answer is 4-4-4 = 4° = 64. Imagine creating 
a function from a 3-element set to a 4-element set 
as a three-step process: Step 1 is to send the first 
element of the 3-element set to an element of the 
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4-element set (there are four ways to perform this 
step); step 2 is to send the second element of the 
3-element set to an element of the 4-element set 
(there are also four ways to perform this step); and 
step 3 is to send the third element of the 3-element 
set to an element of the 4-element set (there are four 
ways to perform this step). Thus the entire process 
can be performed in 4-4-4 different ways. 


24. The outer loop is iterated 30 times, and during each 


27. 


29. 


31. 


32. 


34. 


35. 


37. 


38. 


39. 


iteration of the outer loop there are 15 iterations of the 
inner loop. Hence, by the multiplication rule, the total 
number of iterations of the inner loop is 30-15 = 450. 


The outer loop is iterated 50 — 5 + 1 = 46 times, 

and during each iteration of the outer loop there are 

20 — 10+ 1 = 11 iterations of the inner loop. Hence, by 
the multiplication rule, the total number of iterations of 
the inner loop is 46-11 = 506. 


Hint: An efficient solution is to add leading zeros as 
needed to make each number five digits long. For 
instance, write 1 as 00001. Then, instead of choosing 
digits for the positions, choose positions for the digits. 
The answer is 720. 


a 
»p. 
b. A divisor is a product of any one of the a+ 1 


a. There are a+ 1 divisors: 1, p, Ds ee 


numbers listed in part (a) times any one of the b+ 1 
numbers 1, q, f, ES g. So, by the multiplication 
rule, there are (a+ 1)(b + 1) divisors in all. 


a. Since the nine letters of the word ALGORITHM are 
all distinct, there are as many arrangements of these 
letters in a row as there are permutations of a set with 
nine elements: 9! = 362,880. 

b. In this case there are effectively eight symbols to 
be permuted (because [AL] may be regarded as a 
single symbol). So the number of arrangements is 
8! = 40,320. 


The same reasoning as in Example 9.2.9 gives an 
answer of 4! = 24. 


WX, WY, WZ, XW, XY, XZ, YW, YX, YZ, ZW, ZX, ZY 
6! 6:5°4°3-2-7 


a GD en A A 
a. P6,3)= 422 o 
a. PO, gyn BE Woy 


6l 
8:7:6:5:4- 3 
x 


E 


P(8,5) = = 6,720 
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41. Proof: Let n be an integer and n = 2. Then 3. a. The set of integers from 1 through 999 with no re- 


P(n+ 1, 2) — P(n, 2) 
_, (n+1)! n! (n+ 1)! n! 
[n+1)—-2]! (n-2)! (n=1)! (n-2)! 
_@+)nG@-! mn- a2! 
m1)! 2)! 


peated digit can be broken into three disjoint subsets: 
those from 1 through 9, those from 1 through 99, and 
those from 100 through 999. Now constructing an 
integer from 100 through 999 with no repeated digit 
can be thought of as a three-step process. 


Step 1: Choose a digit for the left-most position 


: 5 n-(n—1)! (where there are 9 choices because 0 cannot be 
=n +n-(n—n) = 2n=2- (n—D! chosen). 

nl Step 2: Choose a digit for the middle position 
=2: a= = 2P(n, 1). (where there are also 9 choices because the digit in 


This is what was to be proved. 


45. Hint: Let P(n) be the sentence, “There are n! permuta- 


47. a. 


tions of a set with n elements.” In the inductive step, as- 
sume P(k) is true, and let X be a set with k + 1 elements. 
Use a 2-step operation to create a permutation of the 
elements of X, where step 1 is to choose the element to 
write first, and step 2 is to write the remaining elements 
of X in some order. 


2 3 


Woo me VEe 
=. N NEN 


NEU e eW 


1 
l 
1 
1 
l 
1 
1 


Wen Ven Ne 
me e Ww New YK 
We nee Ne 
Seep WeN een 
euv Few VEU 


2 


SECTION 9.3 


1. 


a. Think of creating a bit string with n bits as an n- 
step process where a general step k is to place either 
a0 ora 1 in the kth position. Since there are two 
ways to do this for each position, by the multiplica- 
tion rule, the number of bit strings of length k is 
2* Now the set of all bit strings consisting of from 
1 through 4 bits can be broken into four disjoint 
subsets: 


bit strings 
consisting 


bit strings 
consisting 
of 2 bits 


bit strings 
consisting 
of 3 bits 


bit strings 
consisting 


of 1 bit of 4 bits 


There are 
24 of these. 


There are 
23 of these. 


There are 
2? of these. 


There are 
2 of these. 
Applying the addition rule to the figure shows that 
there are 2 +2? +2? +24 = 30 bit strings consisting 
of from one through four bits. 
b. By reasoning similar to that of part (a), there are 
25+ 26 +2’ +28 = 480 bit strings of from five 
through eight bits. 


the left-most position cannot be reused but 0 can be 
used). 


Step 3: Choose a digit for the right-most position 
(where there are 8 choices because neither of the 
other two digits can be reused). 


Thus there are 9-9-8 integers from 100 through 
999 with no repeated digit. Similar reasoning 
shows that there are 9-9 integers from 10 through 
99 with no repeated digit. Finally, there are clearly 
9 integers from 1 through 9 with no repeated digit. 
Hence, by the addition rule, the number of inte- 
gers from 1 through 999 with no repeated digit is 
9+9-9+9-9-8 = 738. 


b number of integers from 1 through 999 
with at least one repeated digit 


total number of 
integers from 
1 through 999 


= 999 — 738 = 261 
c. The probability that an integer chosen at random 
has at least one repeated digit is 261/999 = 26.1%. 


number of integers 
—| from 1 through 999 
with no repeated digits 


Use the multiplication rule to count the elements in 
each of the three sets containing 1, 2, and 3 letters, re- 
spectively. Then, because these sets are disjoint, use the 
addition rule to compute the total number of elements 
in the three sets taken together. 


Set of Arrangements (without repetition) 
of No More Than 3 Letters of NETWORK 


arrangements 


arrangements 
of no more 
than 2 letters 


arrangements 
of no more 
than 3 letters 


of no more 
than 1 letter 


f f f 
| | | 
There are There are There are 

7 of these. 7-6 of these. 7-6-5 of these. 


Applying the addition rule to the figure above shows that 
there are 7+7:6+7:6:5 = 259 arrangements of three 
letters of the word NETWORK if repetition of letters is 
not permitted. 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


6. In this exercise the 26 letters in the alphabet plus the 
10 digits give a total of 36 symbols that can be used on 
a license plate. 


C. 


Imagine constructing a license plate with 4 sym- 
bols as a four-step process: step 1 is to fill in the 
first symbol, step 2 is to fill in the second symbol, 
step 3 is to fill in the third symbol, and step 4 is 

to fill in the fourth symbol. Because any one of 
the 36 symbols can be used in each step, by the 
multiplication rule, the number of license plates 
that use four symbols is 36". Similarly, the number 
that use 5 symbols is 36°, and the number that use 
six symbols is 36°. Thus because license plates 
can have anywhere from 4 to 6 symbols, the total 
number of plates with repeated symbols allowed is 


361 + 36° + 36° = 2,238,928,128. 


- When repetition is not allowed, the number of 


license plates that use four symbols is 36:35-34:33. 
The reason is that in the second step the symbol 
used in the first step cannot be used, so there are 
only 35 choices for the second step. In the third 
step, neither of the symbols used in the first two 
steps can be used, and so there are only 34 choices 
for the third step. And in the fourth step, none of 
the symbols used in the first three steps can be 
used, and so there are only 33 choices for the fourth 
step. Similarly, the number of license plates that use 
5 symbols is 36-35-34-33-32, and the number that 
use six symbols is 36-35-34-33-+32-31. Thus the 
total number of license plates is 


36-35 -34-33 + 36:35-34-33-32 

+ 36:35-34-33-32-31 = 1,449,063,000. 
Consider two sets: the set of plates with repeti- 
tion not allowed and the set of plates that have a 
repeated symbol. Note that these two sets have no 
elements in common, and that since every license 
plate either has a repeated symbol or does not have 
a repeated symbol, every license plate considered 
in part (a) is in one of the two sets. In other words, 
the set of all license plates with repetition allowed 
is composed of two disjoint subsets: the set of plates 
with repetition not allowed and the set of plates that 
have a repeated symbol. Thus, by the difference 
rule, the number of license plates with a repeated 
symbol is the difference between the number of 
plates with repetition allowed minus the number of 
plates with repetition not allowed: 


2,238,928, 128 — 1,449,063,000 = 789,865,128. 


- The probability that a license plate chosen at 


random has at least one repeated symbol is 
789,865,128 _ 
2,238,928,128 — 35.3%. 


- The 26 letters in the alphabet plus the 10 digits plus 


the 14 special characters give a total of 50 symbols 
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8. a. 


11. a. 


13. a 


that can be used. By the multiplication rule, the 
number of passwords with 3, 4, and 5 symbols is 
50°, 50+, and 50°. Since the sets consisting of these 
passwords are disjoint, by the addition rule, the 
number of passwords is 


50° + 50* + 50° = 318,875,000. 


Hint: One approach is to divide the license plates 
into four groups depending on the number of digits 
and letters they contain. Another approach is to 
consider creating a license plate as a two-step pro- 
cess: step 1: either choose one digit or do not choose 
a digit; and step 2: choose 4 or 5 letters. 


- Each column of the table below corresponds to a 


pair of values of i and j for which the inner loop 
will be iterated. 


i} i) 2--s 3 | 4 : 


JULyiy2) rt) 2;3),i)2),3)4 


oe 


1 2 3 4 


Since there are 1 +2 +3 +4 = 10 columns, the in- 
ner loop will be iterated ten times. 


The answer is the number of permutations of the 
five letters in QUICK, which equals 5! = 120. 
Because QU (in order) is to be considered as a sin- 
gle unit, the answer is the number of permutations 
of the four symbols [QU |, I, C, K. This is 4! = 24. 
By part (b), there are 4! arrangements of | QU |, I, C, 


K. Similarly, there are 4! arrangements of | UQ |, 
I, C, K. Therefore, by the addition rule, there are 


4! +4! = 48 arrangements in all. 


the number of ways to place eight people 
in arow keeping A and B together 


_ | the number of ways to arrange 
[AB] CD E F GH 


BA CDEFGH 
7! = 5,040 + 5,040 = 10,080 


es b number of ways to arrange | 


=7! 


the number of ways to arrange the eight 
people in a row keeping A and B apart 


the total number of ways 
= | to place the eight 
people in a row 


the number of ways to place 
— | the eight people in a row 
keeping A and B together 
= 8! — 10,080 = 40,320 
= 30,240 


10,080 


14. the number of variable names 


_ | the number of numeric the number of string 
variable names variable names 


= (26 + 26:36) + (26 + 26-36) = 1,924 


15. Hint: In exercise 14 note that 
1 
26 + 26-36 = 26%, 36". 
k=0 
Generalize this idea here. Use Theorem 5.2.3 to evalu- 
ate the expression you obtain. 
16. a. 10-9-8-7-6-5-4 = 604,800 
ke number of phone numbers | 


with at least one repeated digit 


__ | the total number 
of phone numbers 


_ | the number of phone numbers 
with no repeated digits 


= 10’ — 604,800 = 9,395,200 


9,395,200 
ce ——_— = 93.95% 
10 
18. a. Proof: Let A and B be mutually disjoint events 
in a sample space S. By the addition rule, 
MA U B) = MA) + N(B). Therefore, by the equally 
likely probability formula, 


N(AU B) N(A)+N(B) 


P(A U B) = 1S TT 
-7O NO L PA) + PE) 
N(S) N(S) l í 


19. Hint: Justify the following answer: 39-38-38. 


20. a. Use strings of five digits to represent integers from 1 
to 100,000 that contain the digit 6 exactly once. For 
example, use 00306 to represent 306. Strings of six 
digits are not needed because 100,000 does not con- 
tain a 6. Imagine constructing a five-digit string that 
contains exactly one 6 as a five-step operation to fill 


Step 1: Choose one of the five positions for the 6. 


Step 2: Choose a digit for the left-most remaining 
position. 

Step 3: Choose a digit for the next remaining posi- 
tion to the right. 


Step 4: Choose a digit for the next remaining posi- 
tion to the right. 


Step 5: Choose a digit for the right-most 
position. 


9.3 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-98 


Since there are 5 choices for step 1 (any one of 
the five positions) and 9 choices for each of steps 
2-5 (any digit except 6), by the multiplication 
rule, the number of ways to perform this op- 
eration is 5-9:9-9-9 = 32,805. Hence there are 
32,805 integers from 1 to 100,000 that contain 
the digit 6 exactly once. 


21. Hint: The answer is 2/3. 


23. a. Let A = the set of integers that are multiples of 4 and 
B = the set of integers that are multiples of 7. Then 
A N B = the set of integers that are multiples of 28. 


Now MA) = 250 since 12345678...999 1000, 
T Ț J 
4-1 4-2... 4-250 
or, equivalently, since 1,000 = 4-250. 


Also M(B) = 142 since 1234567...14...994 995... 1000 
t ¢ ¢ 
T FQ Pa 
or, equivalently, since 1,000 = 7-142 +6. 


And MA N B) = 35 since 123...28...56... 980... 1000, 
t 4 t 
281 28+2.....28°35 

or, equivalently, since 1,000 = 28-35 + 20. 


So MA U B) = 250 + 142 — 35 = 357. 


25. a. LengthO:A 
Length 1: 0, 1 
Length 2: 00, 01, 10, 11 
Length 3: 000, 001, 010, 011, 100, 101, 110 


Length 4: 0000, 0001, 0010, 0011, 0100, 0101, 0110, 
1000, 1001, 1010, 1011, 1100, 1101 

b. By part (a), dọ = 1, dı = 2, d, = 4, d; = 7, and 
dy = 13. 

c. Let k be an integer with k = 3. Any string of length 
k that does not contain the bit pattern 111 starts 
either with a 0 or with a 1. If it starts with a 0, this 
can be followed by any string of k — 1 bits that does 
not contain the pattern 111. There are d,_, of these. 
If the string starts with a 1, then the first two bits 
are 10 or 11. If the first two bits are 10, then these 
can be followed by any string of k — 2 bits that 
does not contain the pattern 111. There are d,_» of 
these. If the string starts with a 11, then the third bit 
must be 0 (because the string does not contain 111), 
and these three bits can be followed by any string 
of k — 3 bits that does not contain the pattern 111. 
There are d,_3 of these. Therefore, for every integer 
k = 3, dy = dy + dk-3. 

d. By parts (b) and (c), d; = dyt+d3;+d) = 13+7+4 

= 24. This is the number of bit strings of length five 
that do not contain the pattern 111. 
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26. 


28. 


29. 


31. 


C. 


C. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Hint: Sk 7 2Sk—1 + 2Sk—2 
Hint: For every integer n = 0, 
V3 +2 V3-2 
5, = d +Vv37' +4 d- v3. 
os 2V3 


a3 = 3 (The three permutations that do not move 
more than one place from their “natural” positions 
are 213, 132, and 123.) 

11001010, = 2+2? +2% +27 = 202 

00111000, = 27+24+2° = 56 

01101011, = 1+2+27+2°+2° = 107 
11101110, = 2+27+2°+2°+2°+27 = 238 

So the answer is 202.56.107.238. 


. The network ID for a Class A network consists of 


8 bits and begins with 0. If all possible combina- 
tions of eight 0’s and I’s that start with a 0 were 
allowed, there would be 2 choices (0 or 1) for each 
of the 7 positions from the second through the 
eighth. This would give 2’ = 128 possible ID’s. But 
because neither 00000000 nor 01111111 is allowed, 
the total is reduced by 2, so there are 126 possible 
Class A networks. 

Let w.x.y.z be the dotted decimal form of the IP ad- 
dress for a computer in a Class A network. Because 
the network IDs for a Class A network go from 
00000001 (= 1) through 01111110 (=126), w can be 
any integer from 1 through 126. In addition, each of 
x, y, and z can be any integer from 0 (=00000000) 
through 255 (=1111111)), except that x, y, and z 
cannot all be 0 simultaneously and cannot all be 
255 simultaneously. 


- Twenty-four positions are allocated for the host ID in 


a Class A network. If each could be either 0 or 1, 
there would be 2” = 16,777,216 possible host 
IDs. But neither all 0’s nor all 1’s is allowed, which 
reduces the total by 2. Thus there are 16,777,214 
possible host IDs in a Class A network. 

Observe that 140 = 128+ 8 +4 = 10001100), 
which begins with 10. Thus the IP address comes 
from a Class B network. An alternative solution 
uses the result of Example 9.3.5: Network IDs for 
Class B networks range from 128 through 191. 
Thus, since 128 = 140 = 191, the given IP address 
is from a Class B network. 


There are 12 possible birth months for A, 12 for B, 
12 for C, and 12 for D, so the total is 12 = 20,736. 
If no two people share the same birth month, there 
are 12 possible birth months for A, 11 for B, 10 for C, 
and 9 for D. Thus the total is 12-11-10-9 = 11,880. 
If at least two people share the same birth month, the 
total number of ways birth months could be associ- 
ated with A, B, C, and D is 20,736 — 11,880 = 8,856. 
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d. The probability that at least two of the four people 
share the same birth month is pie = 42.7%. 


e. When there are five people, the probability 
that at least two share the same birth month is 
12 = 12-11-10-9°8 = 61.8%, and when there are 
more than five people, the probability is even 
greater. Thus, since the probability for four people 
is less than 50%, the group must contain five or 
more people for the probability to be at least 50% 
that two or more share the same birth month. 


32. Hint: Analyze the solution to exercise 31. 


33. 


a. The number of students who checked at 
least one of the statements is N(H) + M(C) + 
N(D)- NAN C)—-NAN D)-NCN D)+ 
NAO CN D) = 28+26+4+ 14-—8-4 
34+2=55. 

b. By the difference rule, the number of students who 
checked none of the statements is the total number 
of students minus the number who checked at least 
one statement. This is 100 — 55 = 45. 

d. The number of students who checked #1 and #2 but 
not #3 is MH O D)—-MHM CN D)=8-2=6. 


35. Let 


37. 


M = the set of married people in the sample, 


Y = the set of people between 20 and 30 in the 
sample, and 


F = the set of females in the sample. 


Then the number of people in the set M U Y U F is less 
than or equal to the size of the sample. And so 


1,200 = MM UYU F) 
= NM) + NY) +NF)-NM NY) 
-NMN FY-NYONF)+MMNYNF) 
= 675 + 682 + 684 — 195 — 467 — 318 + 165 
= 1,226. 


This is impossible since 1,200 < 1,226, so the polltaker’s 
figures are inconsistent. They could not have occurred as 
a result of an actual sample survey. 


Let A be the set of all positive integers less than 1,000 
that are not multiples of 2, and let B be the set of all 
positive integers less than 1,000 that are not multiples 
of 5. Since the only prime factors of 1,000 are 2 and 5, 
the number of positive integers that have no common 
factors with 1,000 is N(A N B). Let the universe U be 
the set of all positive integers less than 1,000. Then A‘ 
is the set of positive integers less than 1,000 that are 
multiples of 2, B° is the set of positive integers less than 
1,000 that are multiples of 5, and A N B“ is the set of 
positive integers less than 1,000 that are multiples of 
10. By one of the procedures discussed in Section 9.1 
or 9.2, it is easily found that M(A‘) = 499, N(BS) = 199, 


40. 


42. 


43. 


48. 


49. 


and N(A‘ N B°) = 99. Thus, by the inclusion/exclusion 
rule, 


NAS U B®) = N(A®) + N(B‘) — MAS A BS) 
= 499 + 199 — 99 = 599. 
But by De Morgans law, N(A° U B®) = N((A N BY‘), 
and so 


N((A N BY‘) = 599. 0) 


Now since (A N B) = U-— (A N B), by the difference 
rule we have 

N(A N B) = NU) -NA N B). C 
Equating the right-hand sides of (*) and (**) gives 
N(U)— NA N B) = 599. And because N(U) = 999, we 
conclude that 999 — N(A N B) = 599, or, equivalently, 
N(A N B) = 999 — 599 = 400. So there are 400 positive 
integers less than 1,000 that have no common factor with 
1,000. 


Hint: Let A and B be the sets of all positive integers less 
than or equal to n that are divisible by p and q, respec- 
tively. Then (n) = n — (NA U B)). 


c. Hint: If k = 6, any sequence of k games must begin 
with W, LW, or LLW, where L stands for “lose” and 
W stands for “win.” 


c. Hint: Divide the set of all derangements into two 
subsets: one subset consists of all derangements in 
which the number 1 changes places with another 
number, and the other subset consists of all de- 
rangements in which the number 1 goes to position 
i + 1 but i does not go to position 1. The answer is 
dy = (k — Vd + (k 


1)d,—. Can you justify it? 


Hint: Use the associative law for sets from Theorem 
6.2.2 and the generalized distributive law for sets from 
exercise 40, Section 6.2. 


Hint: Use the solution method described in Section 5.8. 
The answer is 5; = 25,1 + 384-2 for every integer k = 4. 


SECTION 9.4 


1. 


a. No. For instance, the aces of the four different suits 
could be selected. 

b. Yes. Let x1, x2, X3, X4, X5 be the five cards. Consider 
the function S that sends each card to its suit. 


5 cards (pigeons) 4 suits (pigeonholes) 


S, 
eclub 
S(x;) = the suit 


of x; e diamond 


e heart 


e spade 


9.4 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


3. 


5. 


7. 
9. 


A-100 


By the pigeonhole principle, S is not one-to-one: 
S(x) = S(x;) for some two cards x; and x;. Hence at 
least two cards have the same suit. 


Yes. Denote the residents by x, X2, ..., X500. Consider 
the function B from residents to birthdays that sends 
each resident to his or her birthday: 


500 residents (pigeons) 366 birthdays (pigeonholes) 


B 
— 


B(x;) = the birthday 
of x; 


By the pigeonhole principle, B is not one-to-one: 
B(x;) = B(x;) for some two residents x; and x;. Hence at 
least two residents have the same birthday. 


a. Yes. There are only three possible remainders that 
can be obtained when an integer is divided by 3: 0, 
1, and 2. Thus, by the pigeonhole principle, if four 
integers are each divided by 3, then at least two of 
them must have the same remainder. 


More formally, call the integers n4, n2, n3, and n4, 
and consider the function R that sends each integer 
to the remainder obtained when that integer is 
divided by 3: 


4 integers (pigeons) 3 remainders (pigeonholes) 


R 
a 


R(n;) = the remainder 
obtained when n; 
is divided by 3 


By the pigeonhole principle, R is not one-to-one: 
R(n;) = R(n;) for some two integers n; and n;. Hence 
at least two integers must have the same remainder. 

b. No. For instance, {0, 1, 2} is a set of three integers 
no two of which have the same remainder when 
divided by 3. 


Hint: Look at Example 9.4.3. 


a. Yes. 


Solution 1: Only six of the numbers from 1 to 12 
are even (namely, 2, 4, 6, 8, 10, 12), so at most six 
even numbers can be chosen from between 1 and 
12 inclusive. Hence if seven numbers are chosen, at 
least one must be odd. 
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Solution 2: Partition the set of all integers from 1 
through 12 into six subsets (the pigeonholes), each 
consisting of an odd and an even number: {1, 2}, 

{3, 4}, {5, 6}, {7, 8}, (9, 10}, {11, 12}. If seven inte- 
gers (the pigeons) are chosen from among 1 through 
12, then, by the pigeonhole principle, at least two 
must be from the same subset. But each subset con- 
tains one odd and one even number. Hence at least 
one of the seven numbers is odd. 


Solution 3: (a formal version of Solution 2): Let 

S = {x1, X2, X3, X4, X5, X6, X7) be a set of seven num- 
bers chosen from the set T = {1, 2, 3, 4, 5, 6, 7, 8, 

9, 10, 11, 12), and let P be the following partition of 
T: {1, 2}, {3, 4}, {5, 6}, {7, 8}, {9, 10}, and {11, 12}. 
Since each element of S lies in exactly one subset of 
the partition, we can define a function F from 5 to P 
by letting F(x) be the subset that contains x;. 


P (pigeonholes) 


S (pigeons) 


E 
— 


F(x;) = the subset 
that 
contains x; 


Since S has 7 elements and P has 6 elements, by the 
pigeonhole principle, F is not one-to-one. Thus two 
distinct numbers of the seven are sent to the same 
subset, which implies that these two numbers are 
the two distinct elements of the subset. Therefore, 
since each pair consists of one odd and one even 
integer, one of the seven numbers is odd. 

b. No. For instance, none of the 10 numbers 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19 is even. 


10. Yes. There are n even integers in the set {1, 2, 3,..., 2n}, 


12. 


14 


namely, 2(= 2:1), 4(= 2:2), 6(= 2-3),..., 2n(= 2:n). 
So the maximum number of even integers that can be 


chosen is n. Thus if n + 1 integers are chosen, at least one 
of them must be odd. 


The answer is 27. There are only 26 black cards in a 
standard 52-card deck, so at most 26 black cards can be 
chosen. Hence if 27 are taken, at least one must be red. 


There are 61 integers from 0 through 60. Of these, 31 are 
even (0 = 2:0,2 = 2-1,4 = 2-2,...,60 = 2-30) and 
so 30 are odd. Hence if 32 integers are chosen, at least 
one must be odd, and if 31 integers are chosen, at least 
one must be even. 
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24 


25. 


26. 


29. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


The answer is 8. (There are only seven possible remain- 
ders for division by 7: 0, 1, 2, 3, 4, 5, 6. Hence if 8 are 
chosen, at least two must be the same.) 


a. The answer is 20,483 because the possible remain- 
ders are 0, 1, 2,..., 20482. 

b. The length of the repeating section of the decimal 
representation of 5/20483 is less than or equal to 
20,482. The reason is that 20,482 is the number of 
nonzero remainders that can be obtained when a 
number is divided by 20,483. Thus, in the long-di- 
vision process of dividing 5 by 20,483, either some 
remainder is 0 and the decimal expansion termi- 
nates, or only nonzero remainders are obtained and 
at some point within the first 20,482 successive 
divisions, a nonzero remainder is repeated. At that 
point the digits in the developing decimal expansion 
begin to repeat because the sequence of successive 
remainders repeats those previously obtained. 


This number is irrational because the decimal expan- 
sion neither terminates nor repeats. 


Let A be the set of the thirteen chosen numbers, and 

let B be the set of all prime numbers between 1 and 40. 
Note that B = {2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37}. 
For each x in A, let F(x) be the smallest prime number 
that divides x. Since A has 13 elements and B has 12 el- 
ements, by the pigeonhole principle F is not one-to-one. 
Thus F(x) = F(x) for some x, # x,in A. By defini- 
tion of F, this means that the smallest prime number 
that divides x, equals the smallest prime number that 
divides x2. Therefore, two numbers in A—namely, x, 
and x.—have a common divisor greater than 1. [Strictly 
speaking, only integers less than or equal to 20 can divide 
integers less than or equal to 40. So we could have made 
the set B even smaller.] 


Yes. This follows from the generalized pigeonhole prin- 
ciple with 30 pigeons, 12 pigeonholes, and k = 2, using 
the fact that 30 > 2-12. 


No. For instance, the birthdays of the 30 people could 
be distributed as follows: three birthdays in each of the 
six months January through June and two birthdays in 
each of the six months July through December. 


The answer is x = 3. There are 18 years from 17 through 
34. Now 40 > 18-2, so by the generalized pigeonhole 
principle, you can be sure that there are at least x = 3 
students of the same age. However, since 18-3 > 40, 
you cannot be sure of having more than three students 
with the same age. (For instance, three students could be 
each of the ages 17 through 20, and two could be each of 
the ages from 21 through 34.) So x cannot be taken to be 
greater than 3. 


9.5 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-102 


31. Hint: Use the same type of reasoning as in Exam- b. Unordered selections: {a, b, c, d}, {a, b, c, e}, 
ple 9.4.6. {a, b, d, e}, {a, c, d, e}, {b, c, d, e}. 
32. Hints: (1) The number of subsets of the six integers Hence, G) =5. 
is 2° = 64. (2) Since each integer is less than 13, the 
largest possible sum is 57. (Why? How is this sum 
obtained?) 3. P(7,2) = ( J 
33. Hint: The power set of A has 2° = 64 elements, and 
so there are 63 nonempty subsets of A. Let k be the 5. a. (6) = 6! = 6! =] 
smallest number in A. Then the sums over the ele- 0 016-0)! 161 
ments in the nonempty subsets of A lie in the range 6 6! 651 
from k through k+ 10+11+12+13+14=k+60. (*)- ae Ses 
How many numbers are in this range? i f aa 

35. Hint: Let X be the set consisting of the given 52 positive 6. a. the number of committees of 6 

integers, and let Y be the set containing the following 

elements: {00}, {50}, {01, 99}, {02, 98}, {03, 97},..., ($) = 

{48, 52}, {49, 51}. Define a function F from X to Y by the 6 (15 — 6)!6! 

rule F(x) = the set containing the last two digits of x. 7 5 

Use the pigeonhole principle to argue that F is not one- _ 38 14-13-¥2-11 10-9 = 5.005 

to-one, and show how the desired conclusion follows. of : 6 i 3 4 4 i 3 i 2 , 
36. Hint: Write the 101 integers as x), x2, x3,..., Xo), and b. | the number of 

represent each x; as ap 2" where a; is odd and k; > 0. nO that 

Now 1 < x; = 200, and so 1 = a; = 199 for every i. Use don toont A 

the fact that there are only 100 odd integers from 1 to and B together 

199 inclusive. 

37. b. Hint: For each k = 1, 2,...,n, let ap = xı + the number of the number of 
Xo +++++2,. If some a; is divisible by n, then the = oe with A committees with B 
problem is solved: the consecutive subsequence is and five others— and five others— 
Xis X2, . - - , X% If no a, is divisible by n, then aj, az, Tone gE them B tone otithem A 
a3, . . . , Ap satisfies the hypothesis of part (a). Hence 


the number of 
+ | committees with 
neither A nor B 


a; — a; is divisible by n for some integers i and j with 
j > i. Write a; — a; in terms of the x;’s to derive the 
given conclusion. 


38. Hint: Let ay, do,..., a+, be any sequence ofn’ +1 = ($) +4 a 4 (2) 
distinct real numbers, and suppose that this sequence 5 5 6 


contains neither a strictly increasing subsequence of 


= 1,287 + 1,287 + 1,716 = 4,2 
length n+ 1 nor a strictly decreasing subsequence of ae 28 AOZO 


length n + 1. Let S be the set of all ordered pairs of Alternative solution: 
integers (i, d), where 1 =i S n and 1 S d £ n. For each 
term a; in the sequence, let F(a}) = (ip, dy), where i, is 
the length of the longest increasing sequence starting 

at a,, and d; is the length of the longest decreasing se- 
quence starting at a,. Suppose that F is one-to-one and | 


the number of committees 
that don’t contain 
A and B together 


derive a contradiction. 


the total number 
of committees 


SECTION 9.5 


_ | the number of committees 
that contain both A and B 


1. a. 2-combinations: {x;, x2}, {x1, X3} {xX2; x3}. 15) /B 
3 6 4 
ii (3) = = 5,005 — 715 = 4,290 
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Tl. 


the number of the number of 
c. | committees with | + | committees with 
both A and B neither A nor B 


13). (13 
= = 715 + 1,716 = 2,431 


d. (i) 


the number of subsets 
of three men 
chosen from eight 


the number of subsets 
-| of three women 
chosen from seven 


8\/7 
= = 56°35 = 1,960 
the number of 


committees with 
at least one woman 


(ii) 


the total the number 
= | number of |— | of all-male 
committees committees 
15 8 
— = 5,005 — 28 
= 4,977 


the number of 
ways to choose 
two sophomores 


the number of 
ways to choose |- 
two freshmen 


the number of 
ways to choose 


the number of 
-| ways to choose |: 


two juniors two seniors 
_ (3\(4)\/3\/5 

2) \2)/\2/\2 
= 540 


. Hint: The answers are a. 1001, b. (i) 420, (ii) all 1001 


require proof, (iii) 175, c. 506, d. 561 
24\/16 24\/16 24\ / 16 
b. + + 
+ sll eee = 3,223,220 
6 0 3223, 


4 (because there are as many royal flushes as 
there are suits) 
ae i = 0.0000015 

(2) 2,598,960 ` 


c. (1) 13-(*) = 624 (because one can first choose 


the denomination of the four-of-a-kind and 
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then choose one additional card from the 48 
remaining) 
624 624 


6 = 2,598,960 = 0.00024 


Imagine constructing a straight (including a 
straight flush and a royal flush) as a six-step 
process: step 1 is to choose the lowest denomi- 
nation of any card of the five (which can be 
any one of A, 2,..., 10), step 2 is to choose a 
card of that denomination, step 3 is to choose 
a card of the next higher denomination, and so 
forth until all five cards have been selected. By 
the multiplication rule, the number of ways to 
perform this process is 


OOOO- em 


By parts (a) and (b), 40 of these numbers 
represent royal or straight flushes, so there are 
10,240 — 40 = 10,200 straights in all. 
10,200 _ 10,200 

(2) 2,598,960 


2) = 0.0039 


13. a. 2! = 1,024 


the number of 
d. | outcomes with 
at least one head 


the total the number of 
= | number of |— | outcomes with 
outcomes no heads 


1,024 — 1 = 1,023 


15. a. 50 b. 50 
c. To get an even sum, both numbers must be even or 
both must be odd. Hence 


the number of subsets 
of two integers chosen 
from 1 through 100 
whose sum is even 


the number of 
subsets of two odd 
integers chosen from 
the 50 odd integers 


the number of 
subsets of two even 
integers chosen from 
the 50 even integers 


50\ , (50 
= + = 2,450. 
(2)+(2) “240 


d. To obtain an odd sum, one of the numbers must 

be even and the other odd. Hence the answer is 

° J-C) = 2,500. Alternatively, note that the answer 
equals the total number of subsets of two integers 


17. a. 


19. a. 


chosen from 1 through 100 minus the number of 
such subsets for which the sum of the elements is 


even. Thus the answer is (2) — 2,450 = 2,500. 


Two points determine a line. Hence 


the number of subsets 
of two points 
chosen from ten 


: ($) = 45, 


the number of straight 
lines determined 
by the ten points 


10! 

Sanu 151,200 since there are 2 A’s, 1 B, 1 

21111131211! H, 3 L’s, 2 O’s, and 1 U 
8! ! 

QNNI2!21 5,040" &. 121113121 13,120 


23. The rook must move seven squares to the right and 
seven squares up, so 


the number 


the number of 


paths the rook | = 


of orderings 
of seven R’s 


where R stands 


can take for “right” and U 
and seven U’s | stands for “up” 
a2 = 3,432 
“ae 
24. b. Solution 1: One factor can be 1, and the other factor 


can be the product of all the primes. (This gives 1 
factorization.) One factor can be one of the primes, 
and the other factor can be the product of the other 
three. (This gives 6) = 4 factorizations.) One factor 
can be a product of two of the primes, and the other 
factor can be a product of the two other primes. 
The number (6) = 6 counts all possible sets of two 
primes chosen from the four primes, and each set 
of two primes corresponds to a factorization. Note, 
however, that the set {p,, p2} corresponds to the 
same factorization as the set {p3, p4}, namely, pı p2 
P3P4 (just written in a different order). In general, 
each choice of two primes corresponds to the same 
factorization as one other choice of two primes. 
Thus the number of factorizations in which each 


Y 
factor is a product of two primes is > = 3. 


(This gives 3 factorizations.) The foregoing cases 
account for all the possibilities, so the answer is 
4+3+1=8. 


Solution 2: Let S = {py, P2, P3, pa}. Let pi pop3 

p4 = P, and let fi ‘fa be any factorization of P. The 
product of the numbers in any subset A C S$ can be 
used for f}, with the product of the numbers in AÀ 
being f2. There are as many ways to write fı as there 
are subsets of S, namely, 2* = 16 (by Theorem 6.3.1). 
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25. a. 


A-104 


However, because f; -/: = fafi, and because two fac- 
torizations are considered the same regardless of the 

order in which the factors are written, the number of 

ways to write P as a product of two factors is half the 
number of subsets of S. So the answer is £ =8. 


There are four choices for where to send the first el- 
ement of the domain (any element of the co-domain 
may be chosen), three choices for where to send 

the second (since the function is one-to-one, the 
second element of the domain must go to a different 
element of the co-domain from the one to which the 
first element went), and two choices for where to 
send the third element (again since the function is 
one-to-one). Thus the answer is 4:3:2 = 24. 

none 

Hint: The answer is n(n— 1): (n—=m+ 1). 


Let the elements of the domain be called a, b, and c 
and the elements of the co-domain be called u and v. 
In order for a function from {a, b, c} to {u, v} to be 
onto, two elements of the domain must be sent to u and 
one to v, or two elements must be sent to v and one to 
u. There are as many ways to send two elements of the 
domain to u and one to v as there are ways to choose 
which elements of {a, b, c} to send to u, namely, 

G) = 3. Similarly, there are G) = 3 ways to send two 
elements of the domain to v and one to u. Therefore, 
there are 3 +3 = 6 onto functions from a set with 
three elements to a set with two elements. 

Hint: The answer is 6. 

Consider functions from a set with four elements 
to a set with two elements. Denote the set of four 
elements by X = {a, b, c, d} and the set of two 
elements by Y = {u, v}. Divide the set of all onto 
functions from X to Y into two categories. The first 
category consists of all those that send the three 
elements in {a, b, c} onto {u, v} and that send d to 
either u or v. The functions in this category can be 
defined by the following two-step process: 


Step 1: Construct an onto function from 
{a, b, c} to {u, v}. 


Step 2: Choose whether to send d to u or to v. 


By part (a), there are six ways to perform step 1, 
and, because there are two choices for where to 
send d, there are two ways to perform step 2. Thus, 
by the multiplication rule, there are 6-2 = 12 ways 
to define the functions in the first category. 


The second category consists of all the other onto 
functions from X to Y: those that send all three 
elements in {a, b, c} to either u or v and that send d 
to whichever of u or v is not the image of a, b, and 
c. Because there are only two choices for where 

to send the elements in {a, b, c}, and because d is 
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27. 


28. 


simply sent to wherever a, b, and c do not go, there 
are just two functions in the second category. 


Every onto function from X to Y either sends at least 
two elements of X to the image of d or it does not. 
If it does, then it is in the first category. If it does 
not, then it is in the second category. Therefore, all 
onto functions from X to Y are in one of the two 
categories and no function is in both categories. So 
the total number of onto functions is 12 +2 = 14. 


a. A relation on A is any subset of A X A, and A X A 
has 8° = 64 elements. So there are 2™ relations on A. 
c. Form a relation that is both reflexive and symmetric 
by a two-step process: (1) pick a set of elements 
of the form (a, a) (there are eight such elements, 
so 28 sets); (2) pick a set of pairs of elements of 
the form (a, b) and (b, a) where a + b (there are 
(64 — 8)/2 = 28 such pairs, so 2°8 such sets). The 
answer is therefore 2° x 278 = 2%, 


Hint: Use the difference rule and the generalization of 
the inclusion/exclusion rule for 4 sets. (See exercise 48 
in Section 9.3.) 


SECTION 9.6 


1. 


a =Q- a. 


b. The three elements of the set are 1, 2, and 3. The 
5-combinations are [1, 1, 1, 1, 1], [1, 1, 1, 1, 2], 1, 1, 
1, 1, 3], 12,2 (1, 1, 1, 2, 3], 
{1, 1, 1, 3, 3], i 21; [5-1;.2,-2;.3]; 

» U1, 2, 2, 2, 3], 

{1, 3, 3, 3, 3], 
3], [2, 2, 2, 3, 3], 
3], and [3, 3, 3, 3, 3]. 


[i 
2 
29) 
3,3 
2, 2: 
3, 


vo = J = 53,130 


b. Ifat least three are éclairs, then 17 additional 


pastries are selected from six kinds. The number of 
17+6 


e 
Note: In parts (a) and (b), it is assumed that the 
selections being counted are unordered. 

c. Let T be the set of selections of pastry that may 


selections is ( 


be any one of the six kinds, let E=; be the set of 
selections containing three or more éclairs, and let 
Es,» be the set of selections containing two or fewer 
éclairs. Then 


N E22) = MT) — NEz3) because T = Ezz U Ex; 
and Es N Ex; = © 
= 53,130 — 26,334 by parts (a) and (b) 


= 26,796. 
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10. 


11. 


16. 


APPENDIX B SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Thus there are 26,796 selections of pastry contain- 
ing at most two éclairs. 


The answer equals the number of 4-combinations with 
repetition allowed that can be formed from a set of n 
elements. It is 


4t+n—-1\_ [{[n+3 
4 4 
| (nt+3\n+2)(n+ Danni)! 
A\(n—T)! 
_ nant Int 2)(n + 3) 
24 ` 
As in Example 9.6.4, the answer is the same as the 
number of quadruples of integers (i, j, k, m) for which 
1sisjs=k=m <n. By exercise 5, this number is 


l + 3 n(n + 1)(n + 2)(n + 3) 
4) 24 3 


Think of the number 20 as divided into 20 individual 
units and the variables x), x2, and x3 as three catego- 
ries into which these units are placed. The number of 
units in category x; indicates the value of x;, in a solu- 
tion of the equation. By Theorem 9.6.1, the number of 
ways to select 20 objects from the three categories is 
eS i) 2 2) — 22:21 
20 20 


z = 231, so there are 231 non- 
negative integer solutions to the equation. 


The analysis for this exercise is the same as for exercise 
10 except that since each x; = 1, we can imagine taking 
3 of the 20 units, placing one in each category x), x2, and 
x3, and then distributing the remaining 17 units among 


the three categories. The number of ways to do this is 


(7 va o) = (2) = — = 171, so there are 171 positive 


integer solutions to the equation. 


a. Let Ly, be the set of selections that include at least 
seven cans of lemonade. In this case an additional 
eight cans can be selected from the five types of soft 
drinks, and so 


N(Lz7) = ~ ') = a = 495. 


Let T be the set of selections of cans in which the 
soft drink may be any one of the five types as- 
suming that there are at least 15 cans of each type 
assuming that there are at least 15 cans of each type 
and let L- be the set of selections that contain at 
most six cans of lemonade. Then 
N(L=6) = MT) — N(L=7) because T = Leg U L=7 
= 3,876 — 495 and Lag Lay = Ø 
_ by the above and part (a) 
= 3,381. of Example 9.6.2 


Thus there are 3,381 selections of fifteen cans of soft 
drinks that contain at most six cans of lemonade. 


b. Let R=; be the set of selections containing at most 
five cans of root beer, and let Le¢ be the set of 
selections containing at most six cans of lemonade. 
The answer to the question can be represented as 
N(Res N Leg). As in part (a), let T be the set of 
all the selections of fifteen cans in which the soft 
drink may be any one of the five types assuming 
that there are at least 15 cans of each type. If you 
remove all the selections from T that contain at 
least six cans of root beer or at least seven cans of 
lemonade, then you are left with all the selections 
that contain at most five cans of root beer and at 
most six cans of lemonade. Thus, in the notation of 
part (a) and Example 9.6.2, 


N(Res N Leo) = N(T) — N(R=6 U L=7). (*) 
Use the inclusion/exclusion rule as follows to com- 
pute N(R=6 U L=7): 
N(R=6 U L=7) 
= N(R=6) + N(L=7) — N(R=6 N L=7). 


To find N(R=6 N L=7), observe that if at least 6 cans 
of root beer and at least 7 cans of lemonade are 
selected, then at most 2 additional cans of soft drink 
can be chosen from the other three types to make 
up the total of 15 cans. A selection of two such cans 


can be represented by a string of 2 X’s and 3|’s, and 
a selection of one such can can be represented by a 
string of 1 X and 3|’s. Hence 


2+3-1 1l+3=1 
N(R=6 N Lez) = 2 + 1 


_ (4 
2 
It follows that 


N(R=6 U Laz) = N(Rag + N(L=7) 


by the inclusion/ 


exclusion rule 
=N (R=6 N L=7) 
= 715 +495- 9 by part (a) and 
the computa- 
= 1,201. tion above, and 
by part (b) of 
Example 9.6.2 


Putting this result together with equation (*) and the 
value of N(T) from Example 9.6.2(a) gives that 
MRes N Leg) = NT) —NR=6 U L=7) 
= 3,876 — 1,201 = 2,675. 


Thus there are 2,675 selections of fifteen soft drinks 
that contain at most five cans of root beer and at 
most six cans of lemonade. 


17. Hints: a. The answer is 10,295,472. b. See the so- 
lution to part (c) of Example 9.6.2. The answer is 
9,949,368. c. The answer is 9,111,432. d. Let T denote 
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the set of all the selections of thirty balloons, assum- 
ing that there are at least 30 of each color. Let Re 
denote the set of selections from T that contain at most 
twelve red balloons, let Beg denote the set of selec- 
tions from T that contain at most eight blue balloons, 
let R=|3 denote the set of selections that contain at 
least thirteen red balloons, and let Bs» denote the set 
of selections that contain at least nine blue balloons. 
Then the answer to the question can be represented 

as N(Rej. N Beg). Out of the total of all the balloon 
selections, if you remove the selections containing at 
least thirteen red or at least nine blue balloons, then 
you are left with the selections containing at most 
twelve red and at most eight blue balloons. Thus 
N(R<12 N Bag) = MT) — N(R=13 U Bao). Compute 
N(R=13 N Bey), and use the inclusion/exclusion rule to 
find MR» ;3 U Bso). 


19. Hints: The answers are a. 51,128 b. 46,761 
SECTION 9.7 
1 n n! nÍ 1 

` (0) On-0! Lent 


3 n\ _ n! _ ne(n—1)-=2y! 
2] (n—2)!-2! (n—-2)!-2! 


E n(n— 1) 
© 2 
5. Proof: Suppose n and r are nonnegative integers and 
r =n. Then 
ny _ n! by Theorem 9.5.1 
r ri(n—r)! 
n! 
= since n — (n — r) = 
m-n- S 
n! 


by interchanging the fac- 
tors in the denominator 


T n-un- n- r)! 


by Theorem 9.5.1. 


6. Solution 1: Apply formula (9.7.2) with m + k in place of 
n. This is legal because m +k = 1. 


Solution 2: 
m+k (m+ k)! 
ee ~ (m+k—1)![(m+k)—-(m+k-—1)]! 
(m+k)-(m+k—1)! 
(m+k—1)\m+k-—m—k+1)! 
(m+k):-(m+k—1)! 
~ (m+k-DŅDL1 


+k 
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e 
9-0-0- 
J- 
REKONE 
» Q-Q- 
D-O- 
Q-a 


c. Rowforn=7: 1 7 21 35 35 21 7 1 


13. Proof by mathematical induction: Let the property P (n) 
be the formula 


nth fy _ n+2 Fe 
> 2 3 . < P(n). 


Show that P (1) is true: 
To prove P (1) we must show that 


Se) 6C) 
so P(1) is true. 


Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 


Let k be any integer with k = 1, and suppose that 
kifi k+2 MIRA , 
` 2 = 3 . << P(k) inductive hypothesis 


We must show that 


(k+1)+1 i B (k+1)+2 
S) 


or, equivalently, 


5 = . —P(k+1) 
i=2 2 3 
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Now the left-hand side of P(k + 1) is 
k+2 [> Lael ay A k+2 
zlo)= 213)" 
= \2 = \2 2 
_{k+2\  (k+2 
3 2 
— ((k+2)+1 
3 
_ [k+3 
3 F 
which is the right-hand side of P(k+ 1) [as was to be 


shown]. [Since we have proved the basis step and the in- 
ductive step, we conclude that P (n) is true for alln = 1.] 


by writing the last 
term separately 


by inductive 
hypothesis 


by Pascal’s formula 


14. Hint: Use the results of exercises 3 and 13. 


17. Hint: This follows by letting m = n = r in exercise 16 
and using the result of Example 9.7.2. 


omfet 


= 1 +7x +2132 +35 + 35x44 210° + 7x° + x7 


6 , (6 6 
21. 1+6(—x)4 (5) x) (5) x4 (o) x) 
P (Jv + 


= 1 — 6x + 15x? — 20 + 15x4 — 62° + xÉ 


23. (p—29)'= > (ez 
= 
= (Je 2q) (r 24)! 
(i) 2q} + ($)o 2q)° 
+ (roza 


= p 8p°q H 24p q 32pq° H 16q' 


1 1 
= 74 5x° ieee 424 
x 


29. The term is (3)x°y = 84x°y3, so the coefficient is 84. 


31. The term is ()a°(—2b)’ = 792a°(—128)b = 
= 101,376a°b’, so the coefficient is — 101,376. 


33. The term is (', )(3p?)(—2g)’ = (‘7)3°(—2)’p!°q’, so the 
coefficient is (',)3°(—2)’ = —5,404, 164,480. 


36. Proof: Let a = 1, let b = —1, and let n be a positive 
integer. Substitute into the binomial theorem to obtain 


dsp = Shi) ren 


k=0 


n n k 
= Gy 
3 (7) 


On the other hand, (1 + (—1))" = 0” = 0, so 


nso 


k=0 


since 1” * = 1. 


37. Hint:3 =2+1 


38. Proof: Let m be any integer with m = 0, and apply the 
binomial theorem with a = 2 and b = —1. The result is 


1= 1” — (2 } ( 1)" Z 5 ("ney 


i=0 


Z if ™ moi 
= 2A) ( ")p : 


41. Hint: Apply the binomial theorem with a = 1 and 
b = —3, and analyze the resulting equation when n is 
even and when n is odd. 


Z N \ zk d NY) n-kek _ n gn 
43. 3 (7s -3(")1 5 = (1+5)"=6 


k=0 k=0 
n n 4 n n ae 
45. F= Wy = 1+ n 
SU- ¥ (rear 
2n fI . 2n p : F 
47. So "| ( "eaaa 
j=0 J j=0 \ J 
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m -(m\1 m m A 1\i 
5. S(-D( Jo = yi —— 
Seoti 37) a) 


53. Scn(")sr2 = > (ea 
f i 


i=0 \! 


=(5 —2)" = 3" 
55. b. n(ltx" = 5 Je 
( ) si! 


[The term corresponding to k = 0 is zero because 
d 
a) = 0] 

- (i) Substitute x = 1 in part (b) above to obtain 


A= n n x n n 
nd +1)" = 3(7Jer t= S(;) 


k=1 


oebe 


Dividing both sides by n and simplifying gives 
hs" E eal) eka a 
n| \1 2 3 n 


SECTION 9.8 


1. By probability axiom 2, P(@) = 0. 


A 


2. a. By probability axiom 3, P(A U B) = 
P(A) + P(B) = 0.3 + 0.5 = 0.8. 
b. Because A U B U C = Sand because A, B, and C 
are mutually exclusive events, C = S—(A U B). 
Thus, by the formula for the probability of the 
complement of an event, P(C) = P((A U B)®) = 
1—P(A U B) = 1-0.8 = 0.2. 


4. By the formula for the probability of a general 
union of two events, P(A U B) = P(A) + P(B) — 
P(A N B)=0.8+0.7-0.6 = 0.9. 


7. a P(A U B) = 0.4+0.3 = 0.7 

b. P(C) = P(A U B)) = 1 — P(A U B) = 
1-0.7=03 

c. P(A U C) = 0.4+0.3 = 0.7 

d. P(A‘) = 1— P(A) = 1-0.4 = 0.6 

e. P(A‘ N B®) = P(A U BY) = 1 — P(A UB) = 
1-0.7 =0.3 

f. P(4® U B®) = P(A N B)) = P(@") = P(S)=1 


9. a. P(A U B) = P(A) + P(B) — P(A N B) = 
0.4+0.5-—0.2 = 0.7 
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d. P(A‘ N B°) = P(A U BY) = 1 — P(A UB) = 


1 2 3 4 5 6 

pase 1 2 3 4 5 6 7 

Tl. Hint: Since U C V, V= U U (V- U) 2 3 4 5 6 T 8 
12. Hint: For arbitrarily chosen sets U and V, 3 4 5 6 l 8 9 
UU(V-U)=UUV. 4 5 6 7 8 9 10 
13. Hint: (AUAU ... UAD N Aa =Ø 5 6 7 8 9 10 11 
and Ay UA) U ... U Ag U Aga = 6 | 7 8 9 10 11 12 


(Ay UA, U ... U Ap) U Akı. 
Each cell in the table represents an outcome whose prob- 


14. Solution 1: The net gain of the grand prize winner is ability is * . Thus the expected value of the sum is 


$2,000,000 — $2 = $1,999,998. Each of the 10,000 sec- 


ond prize winners has a net gain of $20 — $2 = $18, and A ! l 2 ) q 3 ) { 4 ) ' d 5 ) | ( 6 ) 
each of the 50,000 third prize winners has a net gain of 36 36 36 36 36 36 
$4 — $2 = $2. The number of people who do not win any- 5 4 a), AR 1 
thing is 1,500,000 — 1 — 10,000 — 50,000 = 1,439,999, DETI REACTS DEN EYI DELI ETI ae 
and each of these people has a net loss of $2. Because all 252 
of the 1,500,000 tickets have an equal chance of winning =- 
a prize, the expected gain or loss of a ticket is 36 
————— ($1,999,998 -1 + $18- 10000 20. Hint: The answer is about 7.7 cents. 
1500000 ©? i " 

+ $2-50000 + (—$2)- 1,439,999) = — $0.40. 22. Hint: The answer is 1.875. 


23. Hint: To derive Pao, use the distinct roots theorem from 
5300 520 


=1. 


Solution 2: The total income to the lottery organizer is 
$2 (per ticket): 1,500,000 (tickets) = $3,000,000. The Section 5.8. The answer is Po = 
payout the lottery organizer must make is $2,000,000 + 

($20)(L0,000) + ($4)(50,000) = $2,400,000, so the net 

gain to the lottery organizer is $600,000, which amounts S ECTI O N 9.9 


to aati = $0.40 per ticket. Thus the expected net loss 1. P(B) = 


530] 


P(ANB) 1/6 1 
P(A|B) 1/23 


to a purchaser of a ticket is $0.40. 


16. Let 2; and 2, denote the two balls with the number 2, 3. Hint: The answer is 60%. 


and let 5 and 6 denote the other two balls. There are 
4. a. Proof: Suppose S is any sample space and A and B 


are any events in S such that P(B) # 0. Note that 
(1) A U AS = S by the complement law for U. 
(2) BO S = B by the identity law for N. 

(3) BN (A UA‘) = (A N B) U (ASN B) by the 


() = 4 subsets of 2 balls that can be chosen from the 
urn. The following table shows the sums of the numbers 
on the balls in each set and the corresponding prob- 
abilities: 


Subset Sums | Probability that the sum = s distributive law and commutative laws for sets. 

BL 2} 4 1/6 (4) (AN B) N (ASN B) = Ø by the complement 
law for N and the commutative and associative 

{21, 5},{22, 5} 7 2/6 laws for sets. 

{21, 6}{2>, 6} 8 2/6 Thus B = (A N B) U (ACN B), and, by probability 

{5, 6} 11 1/6 axiom 3, P(B) = P(A N B) + P(A‘ N B). Therefore, 


P(A‘ N B) = P(B) — P(A N B). By definition of 


So the expected value is conditional probability, it follows that 


EA O 1 . P(A AN B P(B)—-P(ANB 
beg eoe 2 itt=73, pepe e T A 
P(B) P(B) 
19. The following table displays the sum of the numbers PAN B) 
showing face up on the dice: PB) = 1-—P(A|B). 
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5. Hints: (1) A = (A N B) U (A N B®) 
P(A) — P(A|B)P(B) 


1— P(B) 


(2) The answer is P(A | B°) = 


- Let R, be the probability that the first ball is red, 
and let R, be the probability that the second ball is 
red. Then R, is the probability that the first ball is 
not red, and Ry is the probability that the second 
ball is not red. The tree diagram shows the various 
relations among the probabilities. 


Then 
P(R, N Ro) = P(R3 |R): P(R;) 
= a = 2 = 38.5% 
13 8 13 


P(R, N RS) = P(RF |R): P(Ri) 
E 5 5 25 = 24%, 
=i 8 104 
P(R i N Ro) = P(Ro |R i) -P(RE) 
25 3 25 
= = — = yh, 
39 8 104 
P(RY ORS) = P(RS | RE): PCRS) 
14 3 14 


=== 13, 
308 14 


So the probability that both balls are red is 5/13, the 
probability that the first ball is red and the second 
is not is 25/104, the probability that the first ball is 
not red and the second ball is red is 25/104, and the 
probability that neither ball is red is 14/104. 

b. Note that 


Ry = (Ry N R) U(R2N RY) and 
Thus the probability that the second ball is red is 


P(Ro) = PR N Ri) + PR OR) 
5 25 65 
+ = 62.5%. 
13 104 104 Saa 
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c. If exactly one ball is red, then either the first ball is 
red and the second is not or the first ball is not red 
and the second is red, and these possibilities are 
mutually exclusive. Thus 


P(exactly one ball is red) = P(R; N Ry) + P(R A Rp) 


025 25 50 
104 104 104 
= = 48.1%. 
52 


The cag that both balls are red is 
P(R N R) = Š = 38.5%. Then 


P(at least one ball is red) = P(exactly one ball is red) 


+ P(both balls are red) 


25.5 
= — + — 
52 13 
45 
“5 = 86.5% 


- Let W; be the event that a woman is chosen on the 
first draw, 


W, be the event that a woman is chosen on the 
second draw, 


M, be the event that a man is chosen on the first 
draw, 


M, be the event that a man is chosen on the second 
draw. 


Then P(W,) = jp and P(W>|W,) = å, and thus 


P(W, N W) = P(W2|W,)P(W,) = $- = 7s = 65%. 


c. Hint: The answer is i = 463%. 


P(B, A) 


PCA) and that 


9. Hint: Use the facts that P(B,|A) = 
(AN B,))UANB)=A 


11. a. Let U, be the event that the first urn is chosen, U, 
the event that the second urn is chosen, and B the 
event that the chosen ball is blue. Then 


and P(B|U;) = es 


P(B|U,) = l 
(B|U}) . 
P(BNU,) = P@|U)PW) = = = 
wa {e  - 3e 

Also 
PAN U) = Pela 
? UELUT TA SA 
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13. 


14. 


16. 


17. 


19. 


Now B is the disjoint union of B N U, and B N U}. 
So 


P(B)P(U,|B) = P(B N U,) + P(BN Up) 


12 8 
=—+— = 46.4%. 
38 54 Á 
Thus the probability that the chosen ball is blue is 
approximately 46.4%. 


b. Given that the chosen ball is blue, the probability 
that it came from the first urn is P(U,|B). By Bayes’ 
theorem and the computations in part (a), 

P(B| UP) 
~ P(B|U,)PU) + P(B| UPU) 
7 (12/19)(0.5) 
(12/19)(0.5) + (8/27)(0.5) 


P(U,|B) 


= 68.1% 


Hint: The answers to parts (a) and (b) are approximately 
52.9% and 54.0%, respectively. 


Let A be the event that a randomly chosen person tests 
positive for drugs, let B, be the event that a randomly 
chosen person uses drugs, and let B be the event that 

a randomly chosen person does not use drugs. Then 

A“ is the event that a randomly chosen person does not 
test positive for drugs, and P(B,) = 0.04, P(B,) = 0.96, 
P(A|B2) = 0.03, and P(A‘|B,) = 0.02. Hence 

P(A|B,) = 0.98 and P(A‘|B3) = 0.97. 


P(A|B,)P(B)) 


a. P(B,|A) = 
P(A|B,)P(B;) + P(A|Bo)P(Bo) 
o (0.97)(0.04) = 576% 
(0.97)(0.04) + (0.03)(0.96) 
P(A‘|B>)P(B 
b. P(B;|A°) = (A°|B>)P(B) 


P(A‘|B,)P(B,) + P(A‘|By)P(Bo) 


E (0.98)(0.96) 
(0.02)(0.04) + (0.98)(0.96) 


= 99.9% 


Hint: The answers to parts (a) and (b) are 11.25% and 
214%, respectively. 


Proof: Suppose A and B are events in a sample space 
S, and P(A |B) = P(A) # 0. Then 


P(B NA) _ P(A|B)P(B) 


P(B|A) = 


P(A) P(A) 
_ POA)P(B) _ te 
P(A) 


As in Example 6.9.1, the sample space is the set of all 
36 outcomes obtained from rolling the two dice and 
noting the numbers showing face up on each. Let A be 
the event that the number on the blue die is 2 and B the 
event that the number on the gray die is 4 or 5. Then 
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A = {21, 22, 23, 24, 25, 26}, 
B = {14, 24, 34, 44, 54, 64, 15, 25, 35, 45, 55, 65}, and 
A N B = {24, 25}. 
Since the dice are fair (so all outcomes are equally likely), 
P(A) = £, P(B) = %, and P(A N B) = &. By definition 


of conditional probability, 


2 
pajni PANB) 36 1 P 
Pe 12 6 ™ 

36 

2 

P(ANB) 36 

BBIA PA 6 3 
36 


Now P(A)=%=2 and P(B)=łŻ=Ż, and hence 
P(A|B) = P(A) and P(B|A) = P(B). 


Let A be the event that the student answers the first 
question correctly, and let B be the event that the 
student answers the second question correctly. Because 
two choices can be eliminated on the first question, 
P(A)= i, and because no choices can be eliminated 
on the second question, P(B) = L Thus P(A‘) = A and 
PB°) =$. 
a. Hint: The probability that the student answers both 

questions correctly is 

P(A N B) = P(A)P(B A = 62% 
( ) = P(A)P(B) 3515 03% 

b. The probability that the student answers exactly one 

question correctly is 


P(A N B®) U (ASN B)) 
= P(A N B°) + P(4° N B) 
= P(A)P(B°) + P(A°)P(B) 
14 21 6 2 
35 35 15 5 


40%. 


c. One solution is to say that the probability that 
the student answers both questions incorrectly is 
P(A® N B®), and P(AS N B°) = P(A‘)P(B‘) by the 
result of exercise 22. Thus the answer is 


24 8 
35 15 


Another solution uses the fact that the event that 
the student answers both questions incorrectly 
is the complement of the event that the student 
answers at least one question correctly. Thus, 
by the results of parts (a) and (b), the answer is 


1-(h +3) = = ahs. 


P(AYP(B*) = 


= 531% 
ge 


25. Let H; be the event that the result of toss i is heads, and 
let T; be the event that the result of toss 7 is tails. Then 
P(H;) = 0.7 and P(T) = 0.3 for i = 1 and 2. 


b. 


The probability of obtaining exactly one head is 
P(A, N T) U (Ti N Ap) 

= P(A, N Tə) + P(T; M H3) 

= P(H P(T) + P(T,)P (Ad) 

= (0.7)(0.3) + (0.3)(0.7) = 42%. 


27. Hint: The answer is F, 


28. a. 


29. a. 


30. b. 


P(seven heads) 


the number of different 
ways seven heads can 
be obtained in ten tosses 


120(0.7)'(0.3)° = 0.267 = 26.7% 


(0.7)'(0.3)° 


P(none is defective) 


the number of different 
ways of having 0 defective (0.03)°(0.97)'° 
items in the sample of 10 


= 1-(0.3)°(0.97)'° = 0.737 = 73.7% 


ll 


The probability that a woman will have at least one 
false positive result over a period of ten years is 
1 — (0.96) = 33.5%. 


. P(none is male) = 1.3% 
. P(at least one is male) = 


1 — P(none is male) = 1 — 0.013 = 98.7% 


34. Hint: PY) = PY N X) +P N X’) 


SECTION 10.1 


1. a. 


b. 
c. 


trail (no repeated edge), not a path (has a repeated 
vertex, vı), not a circuit 

walk, not a trail (has a repeated edge, eo), not a circuit 
closed walk (starts and ends at the same vertex), 
trail (no repeated edge since no edge), not a path or 
a circuit (since no edge) 

circuit, not a simple circuit (repeated vertex, v4) 
closed walk (starts and ends at the same vertex but 
has repeated edges, {v2, v3} and {v3, v4}) 

path 


- No. The notation vvv; could equally well refer to 


V1€1V2€2V1 OF tO v1€2V2€1V1, Which are different walks. 


© Three. (There are three ways to choose the middle 


edge.) 
3!+3 = 9 (The three paths from part (a) are also 
trails, and there are an additional 3! trails with 


10.1 


23. 
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vertices V1, V2, V3, V2, V3, V4. The reason is that from 
v2 there are 3 choices of an edge to go to v3, then 

2 choices of a different edge to go back to v}, and 
then 1 choice of a different edge to return to v3. 

c. Infinitely many. (Since a walk may have repeated 
edges, a walk from v; to v4 may contain an arbi- 
trarily large number of repetitions of edges joining 
a pair of vertices along the way.) 


. a. {vi V3}, {Vo, v3}, {V4 v3}, and {vs, v3} are all the 


bridges. 


- a. Three connected components, as shown in the next 


column. 
b E 


ë ee f h 


d 
1 2 3 


a. No. This graph has two vertices of odd degree, 
whereas all vertices of a graph with an Euler circuit 
have even degree. 


© One Euler circuit is e4e5e5e3€2€7€ ge}. 
- One Euler circuit is iabihbchgcdgfdefi. 


. There is an Euler trail since deg(u) and deg(w) are odd, 


all other vertices have positive even degree, and the 
graph is connected. One Euler trail is uv;v9v7Uv2V3V4V2 
V6VaWV5VeW. 


a. The nonempty subgraphs are as follows: 


U2 V vi 


Uy Vi vg 
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24. a. 


26. 


27. 


29 


31. 
32. 


34. 
38 


39. 


v V3 


b. A 


D 


Hint: Consider the graph obtained by taking the verti- 
ces and edges of G plus all the edges of G”. 


VOV7V1V2V3V5V6V0 
Hint: See the solution to Example 10.1.9. 


Here is one sequence of reasoning you could use: Call 
the given graph G, and suppose G has a Hamiltonian 
circuit. Then G has a subgraph H that satisfies conditions 
(1)-(4) of Proposition 10.1.6. Since the degree of b in G is 
4 and every vertex in H has degree 2, two edges incident 
on b must be removed from G to create H. Edge {a, b} 
cannot be removed because doing so would result in 
vertex d having degree less than 2 in H. Similar reason- 
ing shows that edge {b, c} cannot be removed either. So 
edges {b, i} and {b, e} must be removed from G to create 
H. Because vertex e must have degree 2 in H and be- 
cause edge {b, e} is not in H, both edges {e, d} and {e, f} 
must be in H. Similarly, since both vertices c and g must 
have degree 2 in H, edges {c, d} and {g, d} must also be 
in H. But then three edges incident on d, namely, {e, d}, 
{c, d}, and {g, d}, must all be in H, which contradicts the 
fact that vertex d must have degree 2 in H. 


Hint: This graph does not have a Hamiltonian circuit. 


Partial answer: 


vo v 


v4 uş 


This graph has an Euler circuit vovıvv3vıv4vo but no 
Hamiltonian circuit. 


Partial answer: 
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This graph has a Hamiltonian circuit vovyv2v9 but no 
Euler circuit. 


40. Partial answer: 


4 


-_ 


VY 


Vo v3 


The walk vọvıv2vo is both an Euler circuit and a Hamil- 
tonian circuit for this graph. 


Partial answer: 


vo 


This graph has the Euler circuit e,;e,e3e4e5e6 and the 
Hamiltonian circuit vgv,v2v3V9. These are not the same. 


43. a. Proof: Suppose G is a graph and W is a walk in G 


that contains a repeated edge e. Let v and w be the 
endpoints of e. In case v = w, then v is a repeated 
vertex of W. In case v # w, then one of the follow- 
ing must occur: (1) W contains two copies of vew 
or of wev (for instance, W might contain a section 
of the form vewe'vew, as illustrated below); (2) W 
contains separate sections of the form vew and wev 
(for instance, W might contain a section of the form 
vewe'wev, as illustrated below); or (3) W contains a 
section of the form vewev or of the form wevew (as 
illustrated below). In cases (1) and (2), both vertices 
v and w are repeated, and in case (3), one of v or w is 
repeated. In all cases, there is at least one vertex in 
W that is repeated. 


£ e' 
e w 
= 
d) (2) 
e 
vV e—Y—e W 


(3) 


44. Proof: Suppose G is a connected graph and v and w 


are any particular but arbitrarily chosen vertices of 

G. [We must show that u and v can be connected by a 
path.] Since G is connected, there is a walk from v to 
w. If the walk contains a repeated vertex, then delete 
the portion of the walk from the first occurrence of the 
vertex to its next occurrence. (For example, in the walk 
Ve 1V2e5V76V2e3W, the vertex v occurs twice. Deleting 
the portion of the walk from one occurrence to the next 


46. 


48 


50 


51. 


gives ve;v2e3w.) If the resulting walk still contains a 
repeated vertex, do the above deletion process another 
time. Then check again for a repeated vertex. Continue 
in this way until all repeated vertices have been deleted. 
(This must occur eventually, since the total number of 
vertices is finite.) The resulting walk connects v to w 
but has no repeated vertex. By exercise 43(b), it has no 
repeated edge either. Hence it is a path from v to w. 


The graph below contains a circuit, any edge of which 
can be removed without disconnecting the graph. For 
instance, if edge e is removed, then the following walk 
can be used to go from v; to v9: vjV5V3V2. 

vs 


l V3 


vo v4 


Us 


Hint: Look at the answer to exercise 46 and use the fact 
that all graphs have a finite number of edges. 


Proof: Let G be a connected graph and let C be a circuit 
in G. Let G’ be the subgraph obtained by removing 

all the edges of C from G and also any vertices that 
become isolated when the edges of C are removed. /We 
must show that there exists a vertex v such that v is in both 
C and G'.] Pick any vertex v of C and any vertex w of 
G”. Since G is connected, there is a path from v to w 
(by Lemma 10.1.1(a)): 


V = Voe1V1e2V2 . . - Vj—1eViCj+1Vi4-1 - -< Vn—1€nVn 7 W. 
inC inC notin C in G’ 


Let i be the largest subscript such that v; is in C. If i = n, 
then v, =w is in C and also in G’, and we are done. 
If i< n, then v; is in C and v;+, is not in C. This im- 
plies that e; +, is not in C (for if it were, both endpoints 
would be in C by definition of circuit). Hence when G' 
is formed by removing the edges and resulting isolated 
vertices from G, then e;,, is not removed. That means 
that v; does not become an isolated vertex, so v; is not 
removed either. Hence v; is in G’. Consequently, v; is in 
both C and G’ [as was to be shown]. 


Proof: Suppose G is a graph with an Euler circuit. If G 
has only one vertex, then G is automatically connected. 
If v and w are any two vertices of G, then v and w each 
appear at least once in the Euler circuit (since an Euler 
circuit contains every vertex of the graph). The section 
of the circuit between the first occurrence of one of v or 
w and the first occurrence of the other is a walk from 
one of the two vertices to the other. Since the choice of 
v and w was arbitrary, given any two vertices in G there 
is a walk from one to the other. So, by definition, G 

is connected. 


10.2 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


56. 
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b. Hint: Divide the proof into three parts. (1) Show 
that if G is any graph containing a closed walk with 
an odd number of edges, then G contains a circuit 
with an odd number of edges. (2) Show that if G is 
any connected graph that does not have a circuit 
with an odd number of edges, then G is bipartite. 
(3) Show that if G is any graph with at least two 
vertices and is such that G does not have a circuit 
with an odd number of edges, then G is bipartite. 


SECTION 10.2 


1. 


5. 


6. 


a. Equating corresponding entries shows that 


a+b=1 
a—c=0 
c=-l1 
b-a =3. 
Thus a—c =a-(—1)=0, and so a= —1. Conse- 


quently, a+b = (—1)+b = 1, and hence b = 2. The 
last equation should be checked to make sure the answer 
is consistent: b — a = 2 — (—1) = 3, which agrees. 


vı v2 V3 
y |O 1 1 
av» |1 0 0 
v» LO 0 0 
a. 
vi 
vi 
Any labels may be 
applied to the edges 
because the adjacency 
matrix does not 
determine edge labels. 
Y3 
v4 
vV vn V3) Vy v n V3 Vy 
vO O 1 1 vyfO 1 1 
vO 0 2 0 v}l O 1 1 
a. c. 
vy,f1 2 0 0 vz}1 1 0 1 
v L1 0 O 1 vyL1 1 1 0 
a. 
Any labels may 
be applied to the 
vi U2 edges because the 
adjacency matrix 
does not determine 
edge labels. 
U3 


a. The graph is connected. 


a. 2-1+(-1):3 = -1 
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10. 


12. 


14. 


15. 


17. 


3 -3 12 a7 ° 
1 -5 2 “10 7 


a. No product. (A has three columns, and B has 


two rows.) 
—2 =2 2 
b. BA = 
É =>. 2 
4 
f. w=] o| 
1 9 


p agal A m 
` =5 = 
One among many possible examples is 


ee H 
0 0 


Hint: If the entries of the m X m identity matrix are 
0O ifi#k 
denoted by 6,;, then 6;, = i 
Pn ran a f ifi=k. 
The ijth entry of IA is X ô;zA;;- 


k=1 


Proof: Suppose A is an m X m symmetric matrix. Then 
for all integers i and j with 1 =i, j S m, 
(A’)i; = > AixAgj and (A) ji = WA Au 
k=1 k=1 

But since A is symmetric, Aj, = Az; and Ay; = Aj, for 
all i, j, and k, and thus Aj,Ay; = AjpAg; [by the com- 
mutative law for multiplication of real numbers]. Hence 
(A’);; = (A) ji for all integers i andj with 1 =i, j S m. 


Proof (by mathematical induction): Let the property 
P(n) be the equation A”A = AA”. 

Show that P(1) is true: 

We must show that A'A = AA!. But this is true because 
A' = A and AA = AA. 

Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 

Let k be any integer such that k = 1, and suppose that 
A‘A = AA“. [This is the inductive hypothesis.] We must 
show that A“*'A = AA**! But 


ATA = (AAA 


by definition of matrix power 


= A(A‘A) by exercise 16 
= A(AA‘) by inductive hypothesis 
= AA‘! by definition of matrix power. 
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11 2]f1 1 2 6 3 3 
19. a A°=|1 0 1]]/1 0 1)/=]3 2 2 
2 1 ojf2 1 0 3 2 5 
1 1 2|[6 3 3 15 9 15 
A°=|1 0 1]/3 2 2ļ|=| 95 8 
2 1 ojl32 5 15 8 8 
20. a. 2 since (A); = 2 


23. 


b. 3 since (A’)34 =3 
c. 6since (A) 14 =6 
d. 17 since (Aĉ); = 17 


. b. Hint: If G is bipartite, then its vertices can be par- 


titioned into two sets V} and V, so that no vertices 
in V; are connected to each other by an edge and 
no vertices in V are connected to each other by an 
edge. Label the vertices in V; as vj, v2,..., vg and 
label the vertices in Vy as Vk+1, Vk+2 -© +s Vne Now 
look at the matrix of G formed according to the 
given vertex labeling. 


b. Hint: Consider the ijth entry of 
A+A? +A? + +A” 


If G is connected, then given the vertices v; and v;, 
there is a walk connecting v; and v,. If this walk has 
length k, then by Theorem 10.2.2, the ijth entry of 
A‘ is not equal to 0. Use the facts that all entries of 
each power of A are nonnegative and that a sum of 
nonnegative numbers is positive provided that at 
least one of the numbers is positive. 


SECTION 10.3 


1. 


2. 


The graphs are isomorphic. One way to define the 
isomorphism is as follows: 


The graphs are not isomorphic. G has five vertices and 
G' has six. 


6. The graphs are isomorphic. One isomorphism is the 


8. The graphs are not isomorphic. G has a simple circuit 


10. The graphs are isomorphic. One way to define the 


12. The graphs are isomorphic. One isomorphism is 


14. 


following: 


ee, 


> 


of length 3; G’ does not. 


isomorphism is as follows: 


the following: 


e 

e e e 
1 2 
3 4 
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lol P 
O R 


18. Hint: There are 20. 
19. 


NL 


Q 
EHI 

Q Q 
Yo 


V0 
wy 


wD 
enone 


10 


11 


21. Proof: Suppose G and G’ are isomorphic graphs and G 
has n vertices, where n is a nonnegative integer. 
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23. 


25 


27. 


[We must show that G' has n vertices.] By definition of 
graph isomorphism, there is a one-to-one correspon- 
dence g: V(G) — V(G’) sending vertices of G to verti- 
ces of G’. Since V(G) is a finite set and g is a one-to-one 
correspondence, the number of vertices in V(G’) equals 
the number of vertices in V(G). Hence G’ has n vertices 
[as was to be shown]. 


Proof: Suppose G and G” are isomorphic graphs and 
suppose G has a circuit C of length k, where k is a 
nonnegative integer. Let C be vey eo . . . eg¥y(=Vo). BY 
definition of graph isomorphism, there are one-to-one 
correspondences g: V(G) —> V(G’) and h: E(G) — E(G') 
that preserve the edge-endpoint functions in the sense that 
for each v in VG) and each e in E(G), v is an endpoint 

of e = g(v) is an endpoint of h(e). Let C’ be g(vp)h(e)) 
gv he)... . hed gv,(=g(Vo)). Then C’ is a circuit of 
length k in G’. The reasons are that (1) because g and 

h preserve the edge-endpoint functions, both g(v;) and 
g(v;+4) are incident on h(e;+1) foreachi=0,1,... , 
k—1, and so C' is a walk from g(vo) to g(vọ), and (2) since 
Cis a circuit, then e}, e2, . . . , e, are distinct, and since h 

is a one-to-one correspondence, h(e,), h(é2), . . . , h(e,) are 
also distinct, which implies that C’ has k distinct edges. 
Therefore, G’ has a circuit C of length k. 


Hint: Suppose G and G” are isomorphic and G has m 
vertices of degree k; call them v}, v2,..., Vm. Since G 
and G” are isomorphic, there are one-to-one correspon- 
dences g: V(G) — V(G’) and h: E(G) — E(G’). Show 
that g(v,), g(v2),... , g(V,,) are m distinct vertices of G”, 
each of which has degree k. 


Hint: Suppose G and G” are isomorphic and G is con- 
nected. To show that G’ is connected, suppose w and 
x are any two vertices of G’. Show that there is a walk 
connecting w with x by finding a walk connecting the 
corresponding vertices in G. 


SECTION 10.4 


1. 
2. 


ee | 


<article> <adjective> <noun> 


the young 


a. Math 110 


<sentence > 


A 


< noun phrase > < verb phrase > 


we N 


<verb> < noun phrase > 


ball caught < article> < noun > 


the man 
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APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Hint: The answer is 2n — 2. To obtain this result, use 
the relationship between the total degree of a graph and 
the number of edges of the graph. 


= a 
H C C C H 
ked 


d. Hint: Each carbon atom in G is bonded to four other 
atoms in G, because otherwise an additional hy- 
drogen atom could be bonded to it, and this would 
contradict the assumption that G has the maximum 
number of hydrogen atoms for its number of carbon 
atoms. Also each hydrogen atom is bonded to 
exactly one carbon atom in G, because otherwise G 
would not be connected. 


Hint: Revise the algorithm given in the proof of Lemma 
10.4.1 to keep track of which vertex and edge were cho- 
sen in step 1 (by, say, labeling them vp and eọ). Then after 
one vertex of degree 1 is found, return to vg and search 
for another vertex of degree 1 by moving along a path 
outward from vo starting with another edge incident on 
Vo. Such an edge exists because vo has degree at least 2. 


a. Internal (or branch) vertices: v2, v3, V4, V6 


Leaves (or terminal vertices: v1, Vs, V7 


Any tree with nine vertices has eight edges, not nine. 
Thus there is no tree with nine vertices and nine edges. 


One such graph is 


h g f 


One such graph is 
b (a d e 
a < 
i h 8 f 
There is no tree with six vertices and a total degree of 
14. Any tree with six vertices has five edges and hence, 


by the handshake theorem (Theorem 4.9.1) it has a total 
degree of 10, not 14. 


One such tree is shown. 
a 


a) 
o 


10.5 SOLUTIONS AND HINTS TO SELECTED EXERCISES A-118 


13. No such graph exists. By Theorem 10.4.4, a connected 3. a. © 
graph with six vertices and five edges is a tree. Hence 


such a graph cannot have a nontrivial circuit. 
14. Q) 
; oOo pe 
vı L2) 


22. Yes. Since it is connected and has 12 vertices and (4) (e) 
11 edges, by Theorem 10.4.4 it is a tree. It follows from 
Lemma 10.5.1 that it has vertex of degree 1. Exercises 4 and 8-10 have other answers in addition to the 
A A ones shown. 
25. Suppose there were a connected graph with eight 


vertices and six edges. Either the graph itself would be 4. 
a tree or edges could be eliminated from its circuits to 

obtain a tree. In either case, there would be a tree with 

eight vertices and six or fewer edges. But by Theorem 

10.4.2, a tree with eight vertices has seven edges, not 

six or fewer. This contradiction shows that the supposi- 

tion is false, so there is no connected graph with eight 

vertices and six edges. 


26. Hint: See the answer to exercise 25. ; : . . . 
5. There is no full binary tree with the given properties 
27. Yes. Suppose G is a circuit-free graph with ten vertices because any full binary tree with five internal vertices 
and nine edges. Let G;, G3, . . . , Gy be the connected has six leaves, not seven. 


components of G. [To show that G is connected, we i p . N 
6. Any full binary tree with four internal vertices has five 


leaves for a total of nine—not seven—vertices in all. 
Thus there is no full binary tree with the given properties. 


will show that k = 1.] Each G; is a tree since each G; is 
connected and circuit-free. For each i = 1, 2,...,k, let 
G; have n; vertices. Note that since G has ten vertices 


in all, 7. There is no full binary tree with 12 vertices because 
ny +n +n, = 10. any full binary tree has 2k + 1 vertices, where k is the 
By Theorem 10.4.2 number of internal vertices. But 2k + 1 is always odd, 


and 12 is even. 
G, has nı — 1 edges, 


Gə has m — 1 edges, 


G; has ng — 1 edges. 


So the number of edges of G equals 


Gy = Dy a1 iai) 
= (mtt ohn) mse eb 1) 
ye 
= 10—k. kts 


But we are given that G has nine edges. Hence 10 — k = 9, 
and so k = 1. Thus G has just one connected component, 
G4, and so G is connected. 


28. Hint: See the answer to exercise 27 and the proof of 
Corollary 10.4.5. 


31. b. Hint: There are six. 


SECTION 10.5 


I a. 3 b 0 c5 d u,v 
e d f. k,l g. m,s,t,x,y h. 12 
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3. One of many spanning trees is as follows: 


b 


h ij k 


5. Minimum spanning tree: 
11. There is no binary tree that has height 3 and nine leaves be- 


cause any binary tree of height 3 has at most 2? = 8 leaves. 


20. a. The height of the tree = log, 25 = 4.6. So since the 
height of any tree is an integer, the height of this 
tree must be at least 5. 


Order of adding the edges: 
{a, b}, {ef}, {e, d}, {d, c}, {gf}, {b, c} 


7. Minimum spanning tree: same as in exercise 5 Order of 
adding the edges: 


{a, b}, {b, c}, {c, d}, {d, e}, {e, f}, {f 8} 


9. There are four minimum spanning trees: 


extraordinary 


SECTION 10 6 When Prim’s algorithm is used, edges are added in any 
7 of the orders obtained by following one of the eight paths 

1. from left to right across the diagram below. 

b a b a b 
{a,b} {ae}  {e,f} {b, c} 
{f 8} 
{c, d} 
d c d c d c 


{a,e} {a,b} {a, g} {e, c} 
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When Kruskal’s algorithm is used, edges are added in any of the orders obtained by following one of the eight paths from 
left to right across the diagram below. 


{a,b} {ae} {a,g} {b,c} 


{ae} {a,b} {ef}  {e,c} 


12. Let N = Nashville, S = St. Louis, Lv = Louisville, Ch = Chicago, Cn = Cincinnati, D = Detroit, Mw = Milwaukee, 
and Mn = Minneapolis. 


Step | V(T) E(T) F 

O {N} Ø {N} 

1 {N} Ø {Lv, Mn} 

2 |{N, Lv} {{N, Lv}} {Mn, S, Cn, Ch, D, Mw} 
3 {N, Lv, Cn} {{N, Lv}, {Lv, Cn}} {Mn, S, Ch, D, Mw} 
4 {N, Lv, Cn, S} {{N, Lv}, {Lv, Cn}, {Lv, S}} {Mn, Ch, D, Mw} 

5 {N, Lv, Cn, S, Ch} {{N, Lv}, {Lv, Cn}, {Lv, S}, {Lv, Ch}} {Mn, D, Mw} 

6 {N, Lv, Cn, S, Ch, D} {{N, Lv}, {Lv, Cn}, {Lv, S}, {Lv, Ch}{Lv, D}} {Mn, Mw} 

7 — | {N, Lv, Cn, S, Ch, D, Mw} {{N, Lv}, {Lv, Cn}, {Lv, S}, {Lv, Ch} {Lv, D}, (Ch, Mw}} {Mn} 

8 | {N, Lv, Cn, S, Ch, D, Mw, Mn} | { {N, Lv}, {Lv, Cn}, {Lv, S}, {Lv, Ch}, {Lv, D}, (Ch, Mw}, {N, Mn}} 


Step L(N) L(S) L(Ly) L(Cn) L(Ch) L(D) L(Mw) | L(Mn) 
0 0 o0 o0 foo) 00 o0 oe) fos) 

1 0 o) 243 o oo oo oo 1,118 
2 0 633 243 377 676 736 803 1,118 
3 0 633 243 377 676 736 803 1,118 
4 0 633 243 3T 676 736 803 1,118 
5 0 633 243 371 676 736 795 1,118 
6 0 633 243 377 676 736 795 1,118 
7 0 633 243 377 676 736 795 1,118 


Thus the shortest path from Nashville to Minneapolis has length L(Mn) = 1,118 kilometers. 


In step 2 D(Lv) = N, in step 3 D(Cn) = Ly, in step 4 D(S) = Cn, in step 5 D(Ch) = Ly, in step 6 D(D) = Ly, in step 7 
D(Mw) = Ch, and in step 8 D(Mn) = N. Tracing backwards from Mn gives D(Mn) = N, which is the starting point. So the 
shortest path is the direct route from Nashville to Minneapolis, without any intermediary stops. 


13. | Step | VT) E(T) F |L@|L®|LO|L@|LO|L@ 
0 {a} if) {a} 0 œ 00 00 œ œ 
1 {a} Ø {b, d} 0] 2] a0] 1 J] © | 
2 {a, d} {{a, d}} {b,c,e} | O | 2 | 6 1 | 11] © 
3 {a, b, d} {{a, d}, {a, b}} {ce} | O } 2] 5 | 1] 64] 
4 {a, b, c, d} {{a, d}, {a, b}, {b, c}} {e, z} 0 2 5 1 6 | 13 
5 {a, b, c, d, e} {{a, d}, {a, b}, {b, c}, {c, e}} {z} 0 2 5 1 6 8 
6 {a, b, C, d, e,z} {{a, d}, {a, b}, {b, ch, {e, e}, {e, z}} 
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Thus the shortest path from a to z has length L(z) = 8. 
In step 2 D(d) = a, instep 3 D(b) = b, instep 4 D(c) = b, 
in step 5 D(e) = c, and in step 6 D(z) = e. Tracing back- 
wards from z gives D(z) = e, D(e) = c, D(c) = b, and 
D(b) = a. So the shortest path from a to z is abcez. 


18. b. Proof: Suppose not. Suppose that for some tree T, 


u and v are distinct vertices of T, and P} and P, are 
two distinct paths joining u and v. [We must deduce 
a contradiction. In fact, we will show that T contains 
acircuit.] Let P, be denoted u = vo, vj, V2,.-.- 5 

Vm = v, and let P, be denoted u = wo, W1, W25... , 
w, = v. Because P} and P, are distinct, and T has 
no parallel edges, the sequence of vertices in P4 
must diverge from the sequence of vertices in P3 
at some point. Let i be the least integer such that 
v; # w;. Then v;_; = w;— 1. Let j and k be the least 
integers greater than i so that v; = wy. (There must 
be such integers because v,, = w, = v.) Then 


Vi—1ViVi+1 + V(5WE)Wk -1 - - WW; 1(=Vi-1) 


is a circuit in T. The existence of such a circuit con- 
tradicts the fact that T is a tree. Hence the supposition 
must be false. That is, given any tree with vertices u 
and v, there is a unique path joining u and w. 


20. Proof: Suppose G is a connected graph, T is a circuit- 


free subgraph of G, and if any edge e of G not in T is 
added to T, the resulting graph contains a circuit. Sup- 
pose that T is not a spanning tree for G. [We must derive 
a contradiction.] 


Case 1 (T is not connected): In this case, there are ver- 
tices u and v in T such that there is no walk in T from 
u to v. Now, since G is connected, there is a walk in G 
from u to v, and hence, by Lemma 10.2.1, there is a path 
in G from u to v. Let e}, e2, . . . , e, be the edges of this 
path that are not in T. When these edges are added to T, 
the result is a graph T’ in which u and v are connected 
by a path. In addition, by hypothesis, each of the edges 
e; creates a circuit when added to T. Now remove these 
edges one by one from 7’. By the same argument used in 
the proof of Lemma 10.5.3, each such removal leaves u 
and v connected since each e;is an edge of a circuit when 
added to T. Hence, after all the e; have been removed, u 
and v remain connected. But this contradicts the fact that 
there is no walk in T from u to v. 


Case 2 (T is connected): In this case, since T is not a 
spanning tree and T is circuit-free, there is a vertex v in 
G such that v is not in T. [For if T were connected, circuit- 
free, and contained every vertex in G, then T would be a 
spanning tree for G.] Since G is connected, v is not iso- 
lated. Thus there is an edge e in G with v as an endpoint. 
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Let T' be the graph obtained from T by adding e and v. 
[Note that e is not already in T because if it were, its end- 
point v would also be in T and it is not.] Then T’ contains 
a circuit because, by hypothesis, addition of any edge to 
T creates a circuit. Also T’ is connected because Tis and 
because when e is added to T, e becomes part of a circuit 
in T’. Now deletion of an edge from a circuit does not 
disconnect a graph, so if e is deleted from T’ the result 
is a connected graph. But the resulting graph contains v, 
which means that there is an edge in T connecting v to 
another vertex of T. This implies that v is in T [because 
both endpoints of any edge in a graph must be part of the 
vertex set of the graph], which contradicts the fact that v 
is not in T. 


Thus, in either case, the supposition that T is not a span- 
ning tree leads to a contradiction. Hence the supposition 
is false, and T is a spanning tree for G. 


a. No. Counterexample: Let G be the following graph. 
Ej 
vi <_- O Uy 
e2 
Then G has the spanning trees shown below. 


ei 


n Pa Tian la 


e? 


Hint: Suppose e is contained in every spanning tree 

of G and the graph obtained by removing e from G is 
connected. Let G’ be the subgraph of G obtained by 
removing e, and let T’ be a spanning tree for G’. How is 
T' related to G? 


Proof: Suppose that w(e’) > w(e). Form a new graph 

T' by adding e to T and deleting e’. By exercise 20, 
addng an edge to a spanning tree creates a circuit, 

and by Lemma 10.5.3, deleting an edge from a circuit 
does not disconnect a graph. Consequently, T’ is also 

a spanning tree for G. Furthermore, w(T') < w(T) 
because w(T') = w(T) — w(e’) + w(e) = w(T) — (w(e') 
w(e)) < w(T) [since w(e') > w(e), which implies that 
w(e’) — w(e) > 0]. But this contradicts the fact that Tis a 
minimum spanning tree for G. Hence the supposition is 


false, and so w(e’) = w(e). 


25. Hint: Suppose e is an edge that has smaller weight than 


any other edge of G, and suppose T is a minimum span- 
ning tree for G that does not contain e. Create a new 
spanning tree T’ by adding e to T and removing another 
edge of T (which one?). Then w(7") < w(T). 
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26. Yes. Proof by contradiction: Suppose G is a weighted 5. YA 
graph in which all the weights of all the edges are dis- 6+ = 
tinct, and suppose G has two distinct minimum span- 
ning trees T, and 7. Let e be the edge of least weight 5ST 
that is in one of the trees but not the other. Without 4+ —o 
loss of generality, we may say that e is in T}. Add e to 
T, to obtain a graph G”. By exercise 19, G’ contains a al y=2lx] 
nontrivial circuit. At least one other edge f of this cir- 2y lo 
cuit is not in 7; because otherwise T, would contain the it 
complete circuit, which would contradict the fact that 
T, is a tree. Now f has weight greater than e because all .—t e a — tf e 
ee i ee z -4 -3 -2 -l 12 3 4 4 
edges have distinct weights, fis in T) and not in T}, and ajid 
e is the edge of least weight that is in one of the trees 
ao 


and not the other. Remove f from G’ to obtain a tree 
T;. Then w(T3) < w(T>) because T; is the same as T) -3 + 
except that it contains e rather than f and w(e) < w(f). 


. “ —o-4 + 
Consequently, T} is a spanning tree for G of smaller 
weight than 7,. This contradicts the supposition that 7, =F 
is a minimum spanning tree for G. Thus G cannot have A EAE 
more than one minimum spanning tree. 
28. The output will be a “minimum spanning forest” for y 
the graph. It will contain a minimum spanning tree for 6 
each connected component of the input graph. F ae 
SECTION 11.1 j = 
a 3 = y= [2x] 
1. a. f(0) is positive. <i 
b. f(x) = 0 when x = —2 and x = 3 (approximately) 2 
c. x; = —1 and x) = 2 (approximately) ip =< 
d. x = 1 or x = —3 (approximately) Eo on a Hr r 
e. increase Z 3 = 1 1 2 3 4 x 
f. decrease 
3. OVA =9 
—-3 
— 4 
-5 
-6 
The graphs show that 2|x] # |2x] for many values of x. 
6. YA 
3+ — 
0.5 1 1.5 2. = Graphofg, 2T ° 
When 0 < x < 1, x! < x4, When x > 1, x! > x4, a(x) = [x] 1+— 
a SE Ge EN a a 
zA 13) 22, 1 12 3 4% 
o— -1+ 
o— -2 + 
— -3+ 
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8. 
t | Fe) = 4"? $ 
0 0 4+ ° ° 
1 0 
2 3 si e e 
1 1 
2 + e e 
2 i <— Graph of h 
3 1 1+ e ° 
4 2 
+ + t t t t t t t > 
y 1 2 3 4 5 6 7 8 9 
4 Graph of F 14. fis increasing on the intervals 
3 Ss © {x E R|-3 <x < —2} and 
2 OO {x E R|O <x < 2.5}, and fis decreasing on 
1 —_4 {x E R|-2 <x < 0} and {x E R|2.5 <x <4} 
pot tt ttt tt tt tt approximately). 
1 4 9 12 16 * : ; 
15. Proof: Suppose that x, and x are particular but arbi- 
10 trarily chosen real numbers such that x; < x». [We must 
"n f(a) = |n| show that f(x,) < f@œ).] Since 
0 0 XS XQ 
1 1 then 2x1 < 2x7 
2 2 and I =3 < 23 
3 3 
by basic properties of inequalities. Thus, by definition 
= 1 off, 
-2 2 fœ) <fa) 
=A 3 [as was to be shown]. Hence f is increasing on the set of 
all real numbers. 
A 
17. a. Proof: Suppose x, and x, are real numbers with 
7 37 g . xı < X2 < 0. [We must show that h(x,) > h(x»).] 
° 2+ e taph ory Multiply both sides of xı < x» by xı to obtain 
(a) > xx [by T23 of Appendix A since x; < 0], and 
ey oR multiply both sides of x, < x by x, to obtain 
— ' — XxX > CAN [by T23 of Appendix A since x < Oj]. 
3 22. =] 1 2 3 By transitivity of order [Appendix A, T18] (x2)” < (x1), 
12. and so, by definition of h, h(x.) < h(x). 
n _|n 
h(n) = B 18. a. Preliminaries: If both x; and x, are positive, then 
by the rules for working with inequalities (see 
0 0 Appendix A), 
1 0 X% 7 1 Xx. 1 
2 1 < > (x 1) < x(x 1) 
x} x2 
3 1 by multiplying both sides by 
4 2 XX (which is positive) 
5 2 > XX — AX < XX X 
6 3 by multiplying out 
7 3 => -X, << -x, 
8 4 by subtracting xıx2 from both sides 
> XxX. > X 
9 4 by multiplying by— 1. 


Are these steps reversible? Yes! 
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Proof: Suppose that x, and x, are positive real numbers 


and xı < x2. [We must show that k(x,) < k(x).] Then 
Xp SX 

Scy =i by multiplying by— 1 
snn a < Amn Xi by adding xx, to both sides 
=> x(x, — 1) < xı(x2— 1) by factoring both sides 


y= a1 by dividing both sides by 


the positive number x)x, 


=> 
27. 


xX) X 


> k(x) < k) by definition of k. 


[This is what was to be shown.] 


19. Proof: Suppose f: R — R is increasing. [We must show 
that f is one-to-one. In other words, we must show that for 
all real numbers x, and xy, if x; # x, then f(x) = f@).] 
Suppose x, and x, are real numbers and x, # x2. By the 
trichotomy law [Appendix A, T17] x; < X2, Or x4 > Xp. 
In case x; < x, then since fis increasing, f(x,) < f(x) 
and so f(x,) # f(x2). Similarly, in case x; > x2, then 
F(x) > f (x2) and so f(x;) # f(x2). Thus in either case, 
F(x) # f) [as was to be shown]. 


21. a. Proof: Suppose u and v are nonnegative real num- 
bers with u < v. [We must show that f(u) < f(v).] 
Note that v = u +h for some positive real number A. 


By substitution and the binomial theorem, 


y” = (u Ae hy" 


u" + m ue At m ue pe ee 
1 2 
+( a uezia] 
m—1 


The bracketed sum is positive because u = 0 and h > 0, 
and a sum of nonnegative terms that includes at least one 
positive term is positive. Hence 


ll 


m m 


v” = u" + a positive number, 
and so f(u) = u” < v” = f(v) [as was to be shown]. 1. 
22. 
A 
> 
6 4. 
5. 
24. Proof: Suppose that fis a real-valued function of a real 6: 
variable, fis decreasing on a set S, and M is any positive 10. 


real number. /We must show that Mf is decreasing on 
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S. In other words, we must show that for all x, and xz 
in S, if xı < x then (Mf \(x1) > (Mf )(x2).] Suppose xı 
and x, are in S and xı < x». Since fis decreasing on S, 
F(x) > f(x), and since M is positive, Mf(x,) > Mf) 
[because when both sides of an inequality are multiplied 
by a positive number, the direction of the inequality is 
unchanged]. It follows by definition of Mf that 

(Mf)(x1) > (Mf)(2) [as was to be shown]. 


To find the answer algebraically, solve the equa- 

tion 2x? = x? + 10x+ 11 for x. Subtracting x? from 
both sides gives x” — 10x — 11 = 0, and either using 
the quadratic formula or factoring x -10x-11 = 
(x— 11)(x+ 1) gives x = 11 (since x > 0). To find an 
approximate answer with a graphing calculator, plot 
both f(x) = x? +10x+11 and 2g(x) = 2x? for x > 0, 
as shown in the figure, and find that 2g(x) > f(x) when 
x > 11 (approximately). You can obtain only an ap- 
proximate answer from a graphing calculator because 
the calculator computes values only to an accuracy of a 
finite number of decimal places. 


YA 


700 F 


600 + 
500 + 2g(x)= 2x? 
400 + 
300 + fx) =x24 10x +11 
200 + 


100 + 


18 20 * 


2 4 6 8 10 12 14 #16 


SECTION 11.2 


a. Formal version of negation: f(n) is not O(g(n)) if, 
and only if, V positive real numbers a and A, J an 
integer n = a such that Ag(n) > f(n). 

b. Informal version of negation: f(n) is not O(g(n)) if, 
and only if, no matter what positive real numbers a 
and A might be chosen, it is possible to find an in- 
teger n greater than or equal to a with the property 
that Ag(n) > f(n). 


n— | +1 is O(n) 


-|"]4+1is 0 
n > is O(n) 
3n(n— 2) is O(n’) 


a. For each integer n = 1, 
0 <2n’+15n+4 
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because all terms in 2n? + 15n + 4 are positive. 
Moreover, 
Qn? + 15n+4 < 2n? + 15n? + 4n? 
because when n = 1, 15n = 15n? 
and4 < 4n? 


= 21n? by combining like terms. 
Therefore, by transitivity of equality and order, 
0< 2n +15n+4 5 21n? 

b. For each integer n = 1, 
2n* < 2n?+ 15n+4 


for each integer n = 1. 


because 15n + 4 > 0 since n is positive. 
c. Sketch of graph 


y=2n?+15n+4 


y= 21n? 


d. Let A = 2 anda = 1. Then, by substitution from 
the result of part (b), 


An? <2n?+15n+4 for each integer n = a, 
and hence, by definition of Q-notation, 2n? + 15n+4 


is O(n’). Let B = 21 and 6 = 1. Then, by substitu- 
tion from the result of part (a), 

0 <2n?+15n+4= Bn? for each integer n = 6, 
and hence by definition of O-notation, Qn? + 15n+4 
is O(n”). 

e. Solution l: Let A = 2, B = 21, and k = 1. By the 
results of parts (a) and (b), 
An? = 2n? + 15n+4 = Br? 
and hence, by definition of ©-notation, on? + 15n+4 
is O(n’). 

Solution 2: By part (d), 2n? + 15n + 4 is both Qn’) 
and O(n’). Hence, by Theorem 11.2.1, 
2n? + 15n+4is Q(n). 


for each integer n = k, 


13. For each integer n = 1, 
5n? < 5n? + 65n+30 
because 65n + 30 > 0 since n is positive. Moreover, 


50? + 65n + 30 < 5n? + 650° + 300° 
because when n = 1, then 
65n < 65n? and 30 < 30n? 


= 1007? 


by combining like terms. 
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Therefore, by transitivity of order and equality, 
5n? < 5n? + 65n +30 < 100n’. 
Thus, let A = 5, B = 100, and k = 1. Then 
An? = 5n? +65n +30 = Bn? for each integer n = k, 
and hence, by definition of @-notation, 5n? + 65n+ 30 
is Oò). 


For each integer n = 1, 
1 
ns ae < n+l, 


and so |n +5| =n, by definition of floor, and [n+5| 
is nonnegative. In addition, when n=1, then 
n+1=n+n = 2n, and thus, by transitivity of equality 
and order, 


1 
nS [a+] = 2n. 
2 


Let A = 1, B = 2, and k = 1. Then 


1 
An = + a < Bn for every integer n = k, 


and hence, by definition of ®-notation, |n + 1) is O(n). 


Proof of Theorem 11.2.7(b): 

Suppose f and g are real-valued functions defined on the 
same set of nonnegative integers, suppose f(n) = 0 and 
g(n) = 0 for every integer n = r, where r is a positive 
real number, and suppose f(n) is O(g(n)). [We must show 
that g(n) is O( f(n).] By definition of @-notation, there 
exist positive real numbers A, B, and k with k = r such 
that for each integer n = k, 


Ag(n) = f(n) = Bg(n). 
Dividing the left-hand inequality by A and the right- 
hand inequality by B gives that 


1 1 
g(n) = rea and BI = g(n), 


and combining the resulting inequalities produces 
1 1 
zI” < g(n) = FEL for each integer n = k. 


Now both f(n) = 0 and g(n) = 0 for each integer n = k. 
Also, since both A and B are positive real numbers, so 
are 1/A and 1/B. Thus, by definition of @-notation, g(n) 
is O(f(n)). 


Proof (by contradiction): Suppose not. That is, suppose 
n° is O(n’). [We must show that this supposition leads to 
a contradiction.] By definition of O-notation, there exist 
positive real numbers B and b such that 

0<n < Br? 


Dividing the inequalities by n? and taking the cube root 
of both sides gives 


O0<n=WB foreach integer n = b. 


for each integer n = b. 


These two conditions are contradictory because on the 
one hand n can be any integer greater than or equal to 
b, but when n is greater than b, then n is less than WB, 
which is a fixed integer. Thus the supposition leads to a 
contradiction, and hence the supposition is false. 


22. a. Solution 1 (using ad hoc calculations): Let 4 stand 
for the words “if, and only if,’ and observe that 


1 
(*) z“ < 2n — 90n? +3 


< nê < 4n — 180° +6 
because dividing or multiplying both sides 
of an inequality by 2, which is positive, 
preserves the direction of the inequality. 
<> 1807-6 = 3n 
because adding or subtracting 180n° — 6 
to both sides of an inequality preserves 
the direction of the inequality. 


2 
**) <> 60-—— =n because dividing or multiplying both sides 
3 S plying 
n of an inequality by 31°, which is positive, 
preserves the direction of the inequality. 


Because all the inequalities are equivalent (that is, 
each inequality is true if, and only if, all the others 
are true), any value of n that makes inequality (**) 
true makes inequality (*) true also. Now 


2 
if n = 60, then n= 60—-—, 
n 
which is inequality (**). Therefore, inequality (*) is 
also true for n = 60, for every integer n = 60, 


1 
ral < 2n — 900 +3. 
Let A = 5 and a = 60. Then for every integer n = a, 
Ant < 2n*—90n* +3. 
and so, by definition of Q-notation, 2n* — 90n? +3 
is Qn’). 
Solution 2 (using the general procedure): 


To use the general procedure from Example 11.2.4 
to show that 2n* — 90n? + 3 is O(n), let 


a=}2=1 d ae 90} + |3]) = 93 
jo and a = (| | + |3]) 
and note that a = 1. We will show that 

n? < 2n* — 90n? +3 for every integer n = a. Now 


n = a means that 


n= 90 F3, 

Multiplying both sides by n? gives 

n? = 90n? + 3n? 
and subtracting first 3n° and then 3 from the right- 
hand side gives that 
n’ = 90n? = 90n?—3 for every integer n = a. 
Subtracting the right-hand side from the left-hand 
side and adding nî to both sides gives 


2n'—90n7+3=n' for every integer n = a. 
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Thus since A = 1, 
2n — 90n? +3 = An? for every integer n = a, 
and so, by definition of Q-notation, 2n* — 90n? +3 
is O(n’). 

b. To show that 2n*— 90n? + 3 is O(n’), observe that 
for every integer n = 1, 


2n* — 90n? +3 < 2n* + 907° +3 
because when n = 1, 
then 907’ is positive 
< 2n*+90n* + 3n* 

by Theorem 11.2.2 


: 3 4 
(sincen=l,nsn 
4 


and L=7", 
and so 90n? < 90n* and 3 < 3n’) 
= 95n" because 2 +90 +3 = 95. 


Thus, by transitivity of order and equality, for every 
integern = 1, 


2n* — 90n? +3 < 9574. 


In addition, by part (a), for every integer n = 60, 
1 
7“ < 2n — 90n +3 


so since 0 = ini, transitivity of order gives that for 
every integer n = 60, 
0 = 2n*—90n? +3 < 95n', 
Let B = 14 and b = 60. Then, for every integer 
n=b, 
0 = 2n*—90n? +3 =< Bn* 
and hence, by definition of O-notation, 
2n* — 90n? + 3 is O(n’). 
c. Solution 1: Let A = 5, B = 95, and k = 60. By the 
results of parts (a) and (b), for every integer n = k, 
An* = 2n*—90n? +3 = Bn‘ 
and hence, by definition of @-notation, 
2n* — 90n? + 3 is O(n’). 
Solution 2: By parts (a) and (b), 2n* — 90n? + 3 is 
both Q(n*) and O(n’). Hence, by Theorem 11.2.1, 
2n* — 90n? + 3 is O(n’). 
25. Proof: Suppose 
P(n) = ayn" + dyn" | + dyn” > + ++ Hanta 


where all the coefficients do, a), . . . , Am are real numbers 

and an > 0. 

a. Proof that P(n) is O(n”): According to the general 
procedure described in Example 11.2.4, we let 


A=-a 


[am-1| + [am| + ++: + |a] + aul + [aol 
a=2( m—1 m—2 2 1 0 d 


am 


and a = max(d, 1). 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


A-127 APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


Then n = a means that Let 


n= oe |an—2 kate |a| H |a| t t B [anl H lam=1| |an=2| beset |a| | |a| H laol. 
7 Am l Then, by transitivity of order and equality, for each 
integer n = 1, 


Multiplying both sides by han"! gives 
-2 


-1 
amn” Hapin" + Amon” + ++ + antag S Bn". 


In addition, by part (a), there exists a positive real 
number a such that for each integer n = a, 


= (lan—1| H |an—2| Passa A H |a| H aoln"! 


= |amn—1 i? + laa alae te am 2 
beiak a GS Gat ag tay an 2 ha a, 
= Jay —y|n | + lapal n"? +++ + laln? + |ay|nt |ao| Now wyr > 0 because a,, > 0, and thus, transitiv- 
ee ee eae ity of order gives that for each integer n = a, 


r=m—|sinten= 1. 
Thus, by transitivity of order and equality, for each 
integer n = a, 


a 
( jr [ami [nT + Jamol" + ++ + Jalan lao]. 
Subtracting the right side from the left gives 
a 
(2) |n—1 |a"! ~~ |am—2|n” | = = |a |n T |ao| =0 


for each integer n = a. It follows that, for each 
integer n = a, 


an = = 
(2 n” + ayn | + dy ot” + anta =0 


because, by definition of absolute value, each 
a, 


—|a;| =S a; Adding (F) n” to both sides gives that, 


for each integer n = a, 


a 
z. -3 m 
amn” + am-in F Amon” “+ +++ + an + ag Z= ( 2 J” 


. Am 
Therefore, with A = > and a = max(d, 1), we have 
that for each integer n = a, 


-1 -2 
Ay lt” tapan Hapon" + + antag = An", 


and so, by definition of -notation, 
apn” + dy” | + yon 2 + +++ ant a 
is O(n”). 
b. Proof that P(n) is O(n"): Observe that for each 
integern = 1, 
Ay tt" + Ayn” | + apon" +++ Fant ay 


= [anla + Jami"! + Jamala"? 


tess + laln? Jai|n+ |ao| 
because by definition of absolute value each a; < |a;| 


<= |a,|n" + |an—1|n" + |an—2|n” 
tess + laln” + lain” + |ag|n” 
by Theorem 11.2.2 since n = 1, n”Ti < n" for each i 
from 0 through m 
= (l4m| + [am —1| + lam- + + la| + lar] + aon” 
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-1 =) 
0S amn” + ayn” + dyn” t: ant ao 
Let b = max(1, a). Then, for each integer n = b, 
-1 ~2 
OS ayn" tapan tapan" “+ +++ + an t ag S Bn", 


and hence, by definition of O-notation, 


=j ~2 
amn” + Amn" + Amon” “+ +++ + an+ ao 


is O(n”). 
c. Proof that P(n) is ®(n™): By parts (a) and (b), 
ann” T anin T pn ue an T day 


is both Q(n”) and O(n”). Hence, by Theorem 11.2.1, 


an” +ap n"! + apn"? + Hanta 
is O(n”). 
n+1)X(n-2) 1 
26. ee m +n—2n—2) = —(n —n—2) 
4 4 
iy 1 1 
=—n-——n--, 
4 4 2 


which is ©(n°) by the theorem on polynomial orders. 


nnt+1)Qnt+1) 1 


29. 6 ALG + 1)(2n+ 1)] 
= slo? t+n)2n+1)] 
5 On + 3n? 4 n) = 7” t Er } an 
which is O°) by the theorem on polynomial orders. 
32. By exercise 10 of Section 5.2, 4274374 °°: +n = 
mas which is Om’) by exercise 29 above. 
Hence 17 +2? +3? +- +n? is O°). 
34. Note that 
2+4+6+:+2n=21+2+3+: +n) 


by factoring out a 2 
n(n+ 1) 
=2 ie by Theorem 5.2.1 


=R +n by algebra, 
and so, by the theorem on polynomial orders, 


2+4+6+ = +2nis O(n’). 


36. 


38. 
40. 


43. 


44. 


46. 


Note that 


E 9) = 4D Disa? 


by Theorem 5.1.1 


n(n + 1) 
=4 7 (OOF e9) 
n terms 
by Theorem 5.2.1 
= 2n +2n—9n by definition of 
mutiplication 
=2n°— 7n by algebra, 


and so, by the theorem on polynomial orders, 
dS 4i-9) is Oô. 
i=1 


Hint: Use the result of exercise 13 from Section 5.2. 


a. Proof: Suppose c is a positive real number and fis 
a real-valued function defined on a set of nonnega- 
tive integers with f(n) = 0 for each integer n greater 
than or equal to a positive real number k. Now if we 
let A = B = c, we have that for each integer n = k, 
ARN) = cfin) = Bfin) 
and so, by definition of ©-notation, cf(n) is O(f()). 
b. Let c = 3 and fn) = n. Then fis a real-valued 
function and f(n) = 0 for each integer n = 0. So 
by part (a), cf(n) is O(f()), or, by substitution, 37 is 
O(n). 


By exercise 15, kk = | 


| is O(n), and by exercise 40(b), 


3n is also O(n). Thus |24+|+ 3n is O(n) by 


Theorem 11.2.9(a). 


By exercise 28, Ma is @(n?), by exercise 17, |z] is 
O(n), and by exercise 41 (with f(n) = 1), 1 is ©(1). 
Now n <n’ and 1 < n? for each integer n = 1. Thus 


mD |3 |+ 1 is @(n) by Theorem 11.2.9(). 


a. Proof (by mathematical induction): Let the property 
P(m) be the sentence 


If nis any integer with n > 1, then n” > 1. = <—P(m) 


Show that P(A) is true: We must show that if n is 
any integer with n > 1, then n' > 1. But this is true 
because n! = n. So P(\) is true. 

Show that for every integer k = 1, if P(k) is true 
then P(k + 1) is true: Let k be a particular but arbi- 
trarily chosen integer with k = 1, and suppose that 


If n is any integer with n > 1, then n> 1. <— P(k) 
inductive 
hypothesis 

We must show that 
If nis any integer with n > 1, then nt! >1. opt) 


So suppose n is any integer with n > 1. By induc- 
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tive hypothesis, n* > 1, and multiplying both sides 
by the positive number n gives nen’ > n-1, or, 
equivalently, n*! > n. Thus n't! > nandn > 1, 
and so, by transitivity of order, nk! 
be shown]. 

b. Proof: Suppose n is any integer with n > 1 and 
rand s are integers with r < s. Then s — r is an 
integer with s — r = 1, and so, by part (a), n° "> 1. 
Multiplying both sides by n” gives n:n" > n"-1, 
and so, by the laws of exponents, n° > n” [as was to 
be shown]. 


> 1 [as was to 


a. Proof (by mathematical induction): Let the property 
P(m) be the sentence 


IfO<x<1,thenx” <1. 


<—P(m) 


Show that P(A) is true: We must show that if 
0<x<1,thenx' =1.Butx=1 by assumption 
and x! = x. So P(1) is true. 

Show that for every integer k = 1, if P(k) is true 
then P(k + 1) is true: Let k be any integer with 

k = 1, and suppose that 


If0<x <1, then <1, <— P(k) inductive 


hypothesis 
We must show that 


If0<x<1,thenx*! <1. — PK +1) 


So let x be any number with 0 < x <= 1. By induc- 
tive hypothesis, x* < 1, and multiplying both sides 
of this inequality by the nonnegative number x 
gives xx < x1. Thus, by the laws of exponents, 


4 
xt! < x. Then 


y 
xX l <xandx= l, 


and hence, by the transitive property of order (T18 
in Appendix A), xt = 1, 

b. Hint: What is the contrapositive of the statement in 
part (a)? 


48. Proof of Theorem 11.2.6(b): 


Let fand g be real-valued functions defined on the same 
set of nonnegative integers, and suppose there is a posi- 
tive real number r such that f(n) = 0 and g(n) = 0 for 
each integer n = r. Suppose also that g(n) is O(f(n)). We 
will show that f(n) is O(g(n)). By definition of O-nota- 
tion, there are positive real numbers B and b such that 
b = r, and, for each integer n = b, 
0S g(n) S Bf(n) 


Divide the right-hand inequality by B to obtain 


a) = f(n), 


for each integer n = b. Let A = 1/B and a = b. Then for 
each integer n = a, 


Ag(n) = f(n) 
and so f(n) is O(g(n)) by definition of Q-notation. 
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50. a. Proof of Theorem 11.2.8(a): 


51. 


a. 


Let f and g be real-valued functions defined on the 
same set of nonnegative integers, and suppose there is a 
positive real number r such that f(n) = 0 and g(n) = 0 
for each n = r. Suppose also that f(n) is O(g(@)) and c 
is any positive real number. [We will show that cf(n) is 
O(g(n)).] By definition of Q-notation, there are positive 
real numbers A and a such that a = r, and, for each 
integer n = a, 
Ag(n) = f(n). 

Multiply both sides of the inequality by c to obtain 

cAgin) = cfr), 
and let A’ = Ac. Then A’ is a positive real number 
because both A and c are positive real numbers. 


Hence there are positive real numbers A’ and a such 
that a = r, and, for each integer n = a, 


A’ e(n) = cf(n). 
Thus cf(n) is O(g(n)) by definition of Q-notation. 


Partial proof of Theorem 11.2.9(a): Let fi, f2, and g 
be real-valued functions defined on the same set of 
nonnegative integers, and suppose there is a posi- 
tive real number r such that fi(n) = 0, fa(n) = 0, 
and g(n) = 0 for each integer n = r. Suppose also 
that fi(n) is O(g(n)) and fa(n) is O(g(n)). [We will 
show that (fi(n) + fa(n)) is O(g()).] By definition of 
Q-notation, there are positive real numbers A, B, 
A’, B', k, and k’ such that k = r, k’ = r and, for each 
integer n such that n = k and n =k’, 


Ag(n) S fi(n) S Bg(n) 


and 
A’g(n) S f(n) = B'g(n). 
Let k” = max(k, k'). 


SECTION 11.3 


1. 


d. 


log2(200) = me 2 = 7.6 nanoseconds = 


0.0000000076 second 
2007 = 40,000 nanoseconds = 0.00004 second 
2008 = 2.56 x 10!8 nanoseconds = 


2.56 X 10'8 = 
10°-60-60-24- (365.25) Years = 81.1215 years 


[because there are 10° nanoseconds ina second, 

60 seconds in a minute, 60 minutes in an hour, 

24 hours in a day, and approximately 365.25 days ina 
year on average]. 


When the input size is increased from m to 2m, 

the number of operations increases from cm to 
c(2m)" = dem’. 

By part (a), the number of operations increases by a 
factor of (4cm”)/cm? = 4. 

When the input size is increased by a factor of 10 
(from m to 10m), the number of operations increases by 
a factor of (c(10m)”)/(cm?) = (100cm’)/em? = 100. 
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- Algorithm A has order n? and algorithm B has order 


3 
n [2 


. Algorithm A is more efficient than algorithm B 


when 27? < 80n*/?. This occurs exactly when 


2 
R< 40n << 40 & n!” < 40 & n < 402, 
n 


Thus, algorithm A is more efficient than algorithm 
B when n < 40° = 1,600. 


- Algorithm B is at least 100 times more efficient than 


algorithm A for values of n with 100(80n°/ 2) = 2n’. 
This occurs exactly when 8,000n*/ <2 es 


4,000 = -53 = 4,000 = Vn = 16,000,000 = n. 


n 
Thus, algorithm B is at least 100 times more ef- 


ficient than algorithm A when n = 16,000,000. 


. There are two multiplications, one addition, and 


one subtraction for each iteration of the loop, so 
there are four times as many operations as there 
are iterations of the loop. The loop is iterated 
(n—1)—3+1 =n—3 times (since the number of 
iterations equals the top minus the bottom index 
plus 1). Thus the total number of operations is 
4(n— 3) = 4n— 12. 


© By the theorem on polynomial orders, 4n — 12 is 


O(n), so the algorithm segment has order n. 


. There is one addition for each iteration of the loop, 


and there are |n/2| iterations of the loop. 


© Because 


n/2 if n is even 
[n/2| = a 
(n—1)/2 if n is odd, then |n/2] 
is O(n) by theorem on polynomial orders. So the 
algorithm segment has order n. 


. For each iteration of the inner loop, there is one 


multiplication and one addition. There are 2n 
iterations of the inner loop for each iteration of 
the outer loop, and there are n iterations of the 
outer loop. Therefore, the number of iterations of 
the inner loop is 2n-n = 2n’. It follows that the 
total number of elementary operations that must 
be performed when the algorithm is executed is 
2-2n? = An’. 


. Since 4n? is @(n’) (by the theorem on polynomial 


orders), the algorithm segment has order n. 


. There is one addition for each iteration of the inner 


loop. The number of iterations in the inner loop 
equals the number of columns in the table below, 
which shows the values of k and j for which the 
inner loop is executed. 


2 3 


n-1 


2I TIZAS | L213 )4) as] 2 Z3 N A 


14. a. 


Hence the total number of iterations of the inner 
loop is 


24+34---+n=(14+2434-"+n)-1 
n(n+ 1) n+n yy I 

= nor Tn 

2 2 2 2 


(by Theorem 5.2.1). Because one operation is per- 
formed for each iteration of the inner loop, the total 
number of operations is in? + in =, 

By the theorem on polynomial orders, n? + sn = 1 
is @(n?), and so the algorithm segment has order n. 


There is one addition for each iteration of the inner 
loop, and there is one additional addition and one 
multiplication for each iteration of the outer loop. 
The number of iterations in the inner loop equals 
the number of columns in the following table, which 
shows the values of i and j for which the inner loop 
is executed. 


t2 3 se ln 


1}1]2)1)2)3)] «| E 2] 3] +: [an 


17. a. 


1 2 3 n 
Hence the total number of iterations of the inner loop is 


e 


n=(1+2+3+: +n) 
nn+1) nê°+n 1, 1 
2 2 3 2 


(by Theorem 5.2.1). Because one addition is 
performed for each iteration of the inner loop, the 
number of operations performed when the inner 
loop is executed is in? + L, Now an additional two 
operations are performed each time the outer loop 
is executed, and because the outer loop is execut- 
ed n times, this gives an additional 2n operations. 
Therefore, the total number of operations is 


ioe, 1,,5 


n+—n+2n = =n +n. 
2 2 


By the theorem on polynomial orders, sn oP xn is 
@(n’), and so the algorithm segment has order n°. 


There are two subtractions and one multiplication 
for each iteration of the inner loop. If n is odd, the 
number of iterations of the inner loop equals the 
number of columns in the following table, which 
shows the values of i and j for which the inner loop 
is executed. 
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Thus the number of iterations of the inner loop is 


n-1 n-1 ntl 
14+14+2+2+°---4 t t 
2 2 2 
n-li n+1 
=2 124344 + 
2, 2 
n—-1lf{n-1 
5 5 +1 
=2. f ntl by Theorem 5.2.1 
2 2 
n?—2n+1 n-—1 n-l 
4 "a g 
1, 1 1 
=n Fn 
4 2 4 


By similar reasoning, if n is even, then the number 
of iterations of the inner loop is 
non 


Lae PDR Sab Sat est 
2 2 


=2ù1+2+3+ +” 
T T T 5 


=2- 5 by Theorem 5.2.1 
_ won 
4 2 


Because three operations are performed for each 

iteration of the inner loop, the answer is 3 (£ + z) 

when n is even and 3 (in? + in + 1) when n is odd. 
2 

Since 3 ($ + 2) is @(n’) and 3 (in? + tn + 1) is also 

O(n’) (by the theorem on polynomial orders), this 

algorithm segment has order n. 


19. Hint: See Section 9.6 for a discussion of how to count 
the number of iterations of the innermost loop. 


20. 


all] 


a[2] a{3] al4] a[5] 


Initial order 
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5 

6 

2 
a{3] | 1 6 4 
ald] | 8 4|\6 
a{5] | 4 
k 2 3 4 
x 2 1 8 
j TiO 2 O a 0 A 320 


24. There are seven comparisons between values of x and 
values of aļ[ j]: one k = 2, two when k = 3, one when 
k = 4, and three when k = 5. 


27. Hint: E, = 5[3+4 +-+--+ (n+ 1)], which equals 
SIC +2434-+°+(n+1))-(1+2)). 


28. The top row of the table shows the initial values of the 
array, and the bottom row shows the final values. The 
results for executing each step in the for-next loop are 


shown in separate rows. 


k a[l] a[2] a[3] aļ[4] a{5] 


Initial 7 3 8 4 2 
1 2 3 8 4 7 
2 2 3 8 4 7 
3 2 3 4 8 7 
4 2 3 4 7 8 
5 2 3 4 7 8 
30. 
n 5 
a[l] 713 2 
a[2] 3T 
a[3] 8 4 
a[4] 4 8 7 
a{5] 2 7 8 
k 1 2 3 
IndexOfMin | 1 | 2 5 2 4 4/5 
i 2| |3l4als| lalalslals] f5 
temp 7 8 7 
32. There is one comparison for each combination of values 
of k and i: namely, 4+3 +2+1 = 10. 


35. b. n—3+1=n—2. d. Hint: The answer is n°. 
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a[2] =] 


a[3] 


x 


24 


polyval 


m us G O 


38. Number of multiplications 
= number of iterations of the inner loop 


= 14243 en 
7 n(n + 1) 


2 


by Theorem 5.2.1 


Number of additions 
= number of iterations of the outer loop 
=n 


Hence the total number of multiplications and additions is 


nnt) 1, 3 
2 rn 2” T a 
3 
a[0] 2 
a[l] 1 
a[2] =1 
a{3] 3 
x 2 
polyval 3 11 | 24 
i 1 23 
42. Hint: t, = 2n. 
SECTION 11.4 
1. x fœ =F 
0 39= 1 
1 31 = 3 
2 ?=9 
-1 3 += 1/3 
—2 37= 1/9 
1/2 3⁄2 = 17 
— (1/2) 3703 = 0.6 


3. x h(x) = log x 
10 
100 
1/10 =l 
1/100 —2 


w+ 


=Y 
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x x x log, x 
1.0=0 
2 2-1=2 
4 4-2=8 
8 8-3 = 24 
1/8 (1/8): (—3) =—3/8 
1/4 (1/4)-(—2) = -1/2 
3/8 (3/8)-(log,(3/8)) = — 0.53 


9. The distance above the axis is (2™ units) (6.3 5 „millimeters ) = 


Xi $35 kilometers = 117,136,824,900,000 


kilometers. The ratio of the height of the point to the 


average distance of the earth to the sun is approximately 


x 117,136,824,900,000/150,000,000 = 780,912. 


10. b. By definition of logarithm, log, x is the exponent to 
which b must be raised to obtain x. Thus when b is 


5. x Llog, x] actually raised to this exponent, x is obtained. That 
1sx<2 0 is, DOS = x, 
25x<4 1 Tl. b. y 
4<=x<8 2 ae 
(3, 5) 2 
8<x<16 3 5 = g y=x 
/2<x<1 = La 
N g : 
W4<x<1/2 = ‘af =e 
(-3, 1) a? 
DÀ e 1 Pá (4, 1) 
l ee er ees eee ee S 
4+ F(x)= [logs x] 25 -3 =1,/ 1 3 5 x 
37 — Fa “1 
21 Z (4, -2) o, ra _ 
it y -3 e (1, —3) 
A “e 
pp 2 a nh pent pi 
A j (-2, —4) ach pair of points 
ilo? 4 6 8 10 12 14 416 pi -5 t oand (vajar 
-240 a "mirror image reflections" 
across the line y = x. 
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13. Hints: (1) (logy) x] = m, (2) See Example 11.4.1. = 3 + 2\|log,(k+ 1) — 1)J 
because log, 2 = 1 
= 3 + 2(logs(k + 1)]— 1) 


because for every real number x, |x — 1] =|x |—1 


15. No. Counterexample: Let n = 2. Then 
[logy (n — 1)] = [log, 1] = [0] = 0, whereas 
[logs #] = Hogs 2] = [11] = 1. by exercise 15, Section 4.6 

16. Hint: The statement is true. = 1+ 2\logs(k + 1)J 


by algebra. 
Case 2 (k is even): In this case k + 1 is odd, and 


18. [logy 148206)+ 1 = 18 


21. a. a, =1 Ae = A+ + 2 
a) = Ayjj+2 =a, t+2=14+2 by the recursive definition of a), a2, a3,... 
a; = G3/9,+2=a,+2=14+2 =a + 2 
4 = đa +2 = ag+2=(14+2)4+2 by Theorem 4.6.2 because k + | is odd. 
=142-2 = 1 + 2[log,(k/2)|+ 2 
a; = asp 2=a+2=(1+2)+2 7 by inductive hypothesis 
=14+2-2 Sa 2| 
as = 46/23 + 2 = a3 +2 = (14+2)+2 _ Ropa Sj 
eo ee because log, 2 = 1 
a = 47/232 = a3 +2 = (1+2)+2 = 3 + 2(log, k|- 1) 
= ee because for every real number x, |x — 1] = |x |—1 
Ag = qg} +2 = ay +2 by exercise 15, Section 4.6 
= (14+2-:2)+2=1+4+3:-2 = 1 + 2|log k] 
Ag = Agia +2 = ag +2 by algebra. 
= (14+2-:2)+2=1+4+3:-2 = 1+ 2\log,(k+ 1)] 
: by property 11.4.3. 
ais = qıs} +2 =a, +2 Thus in either case, az+ı = 1 + 2|logz(k + 1)| fas 
= (1+2-2)+2=1+4+3:-2 was to be shown]. 
16 = 16/2) 2 = ag t+ 2 


= (14+3-2)+2=144-2 23. Hint: When k = 2, then k* = 2k, and sok = 5, Hence 


Guess: E+ ks E+E =k. Also, when k = 2 then L> 1, and 
a, = 1 +2|log; n] 


2: 2 2 2 
so 5 < E, Consequently, Ei < E+E =F. 
b. Proof: Suppose the sequence aj, az, 43, . . . 


is defined recursively as follows: a, = 1 and 24. Hint: Here is the argument for the inductive step in the 
ay = Gx/2\+ 2 for each integer k = 2. Let the prop- case where k is odd and k + 1 is even. 
erty P(n) be the equation a, = 1 + 2 |log, nj. We Ck+1 = 2+ t (k+ 1) ; = 
will show by strong mathematical induction that 7 by the recursive definition of c1, €2, €3, .«- 
P(n), is true for each integer n = 1. = 2Wcernratkt) 

. i; . by Theorem 4.6.2 because k + 1 is even 
Show that P(A) is true: P(1) is the equation 
1+ 2|log, 1) = 1+2-0 = 1, which is the value of aj. <2 E Hogt 4) + (k +1) 
Show that for any integer k = 1, if PĠ) is true for 2 


by inductive hypothesis 
= (k+ 1)[loga(k + 1) — logy 2] + (k+ 1) 
by algebra and Theorem 7.2.1(b) 
= (k+ Ifloga(k +1) — 1] +(k+1) 


every integer i from 1 through k, then P(k + 1) is 
true: Let k be any integer with k = 1, and suppose 
a; = 1+ 2\log, i| for each integer i from 1 through 
k. [This is the inductive hypothesis.] We must show 


that a+; = 1 + 2llog (k+ 1h. berms log, 2 = 1 
Case 1 (k is odd): In this case k + 1 is even, and = (k+ 1)dog,(k + D) 
by algebra. 
ak+1 = Ag+ y/2)+ 2 
by the recursive definition of aj, a2, a3.... 25. Solution I: One way to solve this problem is to 
= ak2 +2 compare values for log, x and xl for conve- 
because k + 1 is even (Theorem 4.6.2) niently chosen, large values of x. For instance, if 
= 1+ 2{logo((k + 1)/2)|+2 powers of 10 are used, the following results are 
by inductive hypothesis obtained: log,(10'°) = 10 log, 10 = 33.2 and 
= 3+ 2|logo(k + 1) — log, 2) 


(10!)!/10 = 1010/9 = 10! = 10. Thus the value 


by Theorem 7.2.1(b) 10 
x = 10 does not work. 
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27. 


29. 


30. 


31. 


32. 


33. 


However, since log,(10) = = 20 log210 = 66.4 and 
(1070)1/10 = 1970/10) = 10? = 100, and since 66.4 < 100, 
the value x = 10%” works. 


Solution 2: Another approach is to use a graphing 
calculator or computer to sketch graphs of y = log, x 
and y = x10. taking seriously the hint to “think big” 
in choosing the interval size for the x’s. A few tries 

and use of the zoom and trace features make it appear 
that the graph of y = x!/! crosses above the graph of 

y = log, x at about 4.9155 X 10". Thus, for values of x 
larger than this, gis log, x. 


By Theorem 11.2.7, n is O(n) and log, n is O(log, n), 
and, by Theorem 11.2.8(¢), 2n is @(n). In addition, by 
property 11.4.9, there is a positive real number s such 
that for each integer n = s, log) n = n. Finally, if n is 
any integer with n = 1, then n = 0. Thus it follows from 
Theorem 11.2.9(c) that 2n + log, n is O(n). 


By Theorem 11.2.7, n? is O(n) and 2” is @(2”). In addi- 
tion, by property 11.4.10, there is a positive real number 
s such that for each integer n = s, vw <2", Finally, if n 
is any integer, then 2” = 0. Thus it follows from Theo- 
rem 11.2.9(¢) that n? +2” is @(2”). 


Hint: 2"*! = 2-2" 


Hint: Use a proof by contradiction. Start by suppos- 
ing that 4” is O(2”). That is, that there are positive real 
numbers B and b such that O = 4" < B- A for every 


az 


real number n > b, and use the fact that 4; z 
to obtain a contradiction. 


By Theorem 5.2.2, for each integer n = 0, 


grl] 
1+2+274 +2" = = 71, 
2-1 
Also, 
Wea gnti -l< pnt =72-9" 


Thus, by transitivity of order, 
2 =14+24+2?+ 


Moreover, 


+2" s22". 


2” = 0 for each integer n. 


Let A = 1, B = 2, and k = 1. Then, for each integer 
n>k, 


AP S12E22 = B2, 
Thus, by definition of @-notation, 1 +2 + p 
is O(2"). 


-47 


Hint: This is similar to the solution for exercise 32. 
Use the fact that 4+4? +4 + ---+4"=4(14+4 
PAPH HTN. 
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36. Factor out n to obtain 


40. 


41. 


non n 
Sh ap eee he a 
2 4 2 
Toi 1 
=ni\1l+—+—4 a 
2 4 2 
1 nt+1 
2 
=n =r by Theorem 5.2.2 
a! 
2 
1-2" ell z > 
=e by multiplying numerator and 
2"(1 — 2) denominator by 2”* ! 
on +I 1 
=H 
( 2” ) 


by algebra. 


1 
Now ao = wienn => 1. Thus 


1 
lns n(2-5] = 2-n, 
2 


and so, by substitution, 


basat toto: E Ot 


Let A= 1, B= 
n>k, 


2, and k= 1. Then, for each integer 


non 
AnSnt =+—-4+-- 

2 4 
Hence, by definition of @-notation, 


n on 
Akop +Z is OQ). 
2 4 2 


If n is any integer with n = 3, then 


( t- | ') 
n l+—+—4+--:4+-]. 
2 3 n 


.4.7 and by Theorem 11.2.7(a), 


n n n 


2 3 n 


By Example 11 
1 1 


1. 
oe ana 7 8 O(logs n) 


Thus, by Theorem 11.2.9(c), 


and nis O(n). 


non n. 
n+>+>+ +—is O(n logy n). 
2. 3 n 


Proof: If n is any positive integer, then log, n is defined 


and by definition of floor, 


logy n| S log, n <[log, n]+ 1. 
If, in addition, n is greater than 2, then since the logarith- 
mic function with base 2 is increasing 
logs n > logy2 = 1. 
Thus, by definition of floor, 
1 <[log, n]. 
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43. 


44. 


45. 


46. 


47. 


Adding |log, n| to both sides of this inequality gives 
logs nj+ 1 S 2 |log, n]. 
Hence, by the transitive property of order (T18 in 
Appendix A), 
log, n S 2 |log, n), 
and dividing both sides by 2 gives 


1 
7 1082 n Slog, n]. 


Let A = 1/2, B = 1, and k = 2. Then 


A logy n S |log n| = Blog, n for every integer n = k. 


Therefore, by definition of ©-notation, |log, njis O(log, n). 


Proof (by mathematical induction): Let the property 
P(n) be the inequality n = 10”. 

Show that P(1) is true: 

When n = 1, the inequality is 1 = 10, which is true. 
Show that for every integer k = 1, if P(k) is true, then 
P(k +1) is true: 

Let k be any integer with k = 1, and suppose k = 10°. 
[This is the inductive hypothesis.] We must show that 
k+1<=10**!. By inductive hypothesis, k = 10". Add- 
ing to both sides gives k+ 1 = 10+ 1. But when k = 1, 
10+ 1 = 10*+9-10* = 10-10* = 10**!. Thus, by tran- 
sitivity of order,k+1= 10"! [as was to be shown]. 


Hint: To prove the inductive step, use the fact that if 

k > 1, then k+ 1 < 2k. Apply the logarithmic function 
with base 2 to both sides of this inequality, and use 
properties of logarithms. 

Hint: 2:2-2...2 S 2-(2-3-4...n)= Z-n! 


——————— 
n factors 


a. Example 11.4.6 showed that if n is any integer with 
n= 1, then n! <n”. So, because the logarithmic 
function with base 2 is increasing, 


logy(n!) S log,(n”) (=n log,(n”)). 


Also, when n = 1, then logs(n!) = log, 1 = 0. Thus 
let B = 1 and b = 1. Then 


0 = log»(n!) S Bn log.(n") for every integer n = b. 


So, by definition of O-notation, log,(n!) is O(n log, n). 
b. Hint: 


(a)? =n!-n! = (1:2:3 3)(n (n —1) 3-2-1) 


= (trite r+ D) = [Io r+1). 
r=1 /\r=1 r=1 


Show that for each integer r = 1, 2,..., 7, 
nr= +r=n. 


Let n be a positive integer, and suppose that x > (2")”. 
By properties of logarithms, 


1 
log x = en( 3 Joe. 


1 1 
= (2n) logs(x2") < 21x% 6 
(where the last inequality holds by substituting 
x? in place of u in logy u < u). Now raising both 
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sides of x > (2n)” to the 1/2 power gives 
x!/? > ((2n)")!/? = (2n)". When both sides are 
multiplied by x!/? the result is x = x!/2x!/? > x!/(2n)" = 


xt! 2(2n)", or, more compactly, 


xP On)” A 
Then, since the power function defined by x > x 
is increasing for every x > 0 (see exercise 21 of 
Section 11.1), we can take the nth root of both sides of 
the inequality and use the laws of exponents to obtain 
(c!?2(2ny"yl/" < xt”, 


1/n 


or, equivalently, 
a 
2nx2 < x", 


e) 
Finally use transitivity of order (Appendix A, T18) to 
combine (*) and (**) and conclude that log, x < x!” [as 
was to be shown]. 


a. Proof (by mathematical induction): Let b be any 
real number with b > 1, and let the property P(n) be 
the equation 


Show that P(1) is true: , 
By L'Hopital’s rule, lim G) = lim (pip) =0. 
Thus P(1) is true. “~~ os 


Show that for every integer k = 1, if P(k) is true, 
then P(k + 1) is true: 
Let k be any integer with k = 1, and suppose 
k 
a 


lim (2 ) = 0. [This is the inductive hypothesis.] 


k+1 
We must show that lim E) = 0. Now by 


TAnttal?’ pe . (kt Dat 
LHo6pital’s rule, lim( 5 ) = lim mo 


pfi x (k+1 


im y dn > -0 [by inductive hypothesis] 


x30 


[This is what was to be shown.] 

b. By the result of part (a) and the definition of limit, 
given any real number £ > 0, there exists an integer 
N such that jen 0| < e for every x > N. In this 
case take e = 1. It follows that for every x > N, 
< 1 since x and b are positive. Multiply both 
sides by b* to obtain x" < b*. Let B = 1. Then 
0 < x" < B-b* for every x > N. Hence, by defini- 
tion of O-notation, x” is O(b"). 


SECTION 11.5 


1. 


logy 1,000 = log,(10°) = 3 logs 10 = 3(3.32) = 9.96 

log>(1,000,000) = log,(10°) = 6 log, 10 = 6(3.32) 
= 19.92 

log,(1,000,000,000,000) = log,(10'7) = 12 log, 10 

12(3.32) = 39.84 


IR 


If m = 2", where kis a positive integer, then the al- 
gorithm requires c [log,(2*)| = c |k] = ck operations. 
If the input size is increased to m = Chn =2%* 
then the number of operations required is 
cllog,(2”4)| = c|2k| = 2(ck). Hence the number of 
operations doubles. 
As in part (a), for an input of size m = 2* where 
k is a positive integer, the algorithm requires 
ck operations. If the input size is increased to 

= (2*)!0 = Ok then the number of operations 
required is c [log,(2!*)| = c|10k| = 10(ck). Thus the 
number of operations increases by a factor of 10. 
When the input size is increased from 2’ to 278 the 
factor by which the number of operations increases 
. cllog,(2*)) _ 28c _ 

clog] 7e 


- A little numerical exploration can help find 


an initial window to use to draw the graphs 

of y = x and y = [50 log, x}. Note that Whien 

x = 28 = 256, [50 logs x] = [50 log(28)] = [50:8] = 
|400] = 400 > 256 = x. But when x = 2° = 512, 

[50 logs x] = [50 logs(2°)| = [50-9] = |450] = 450 < 
512 = x. So a good choice of initial window would 

be the interval from 256 to 512. Drawing the graphs, 
zooming if necessary, and using the trace feature reveal 
that when n < 438, n < [50 log nj]. 


index 0 1 
bot 1 
top 10 4 1 
mid 5 2 1 
x Chia 
b. | index| 0 
bot 1 6 7 
top 10 7 
mid 5 8 6 7 
x Max 


7. a. The array has top — bot + 1 elements. 


b. Proof: Suppose top and bot are particular but 
arbitrarily chosen positive integers such that 
top — bot + 1 is an odd number. Then, by definition 
of odd, there is an integer k such that 


top — bot+1 = 2k+1. 
Adding 2-bot— 1 to both sides gives 
bot + top = 2-bot-—1+2k+1 
= 2(bot + k). 


Now bot + k is an integer. Hence, by definition of 
even, bot + top is even. 


11.5 SOLUTIONS AND HINTS TO SELECTED EXERCISES 


8. 
9. 


10. 


11. 


14. 
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n 27 13 6 3 1 0 


For each positive integer n, n div 2 = |n/2|. Thus when 
the algorithm segment is run for a particular n and the 
while loop has iterated one time, the input to the next 
iteration is |n/2|. It follows that the number of iterations 
of the loop for n is one more than the number of itera- 
tions for |n/2). That is, a, = 1 + a,/2} Also, a; = 1. 


The recurrence relation and initial condition of aj, a, 
az, . . . derived in exercise 9 are the same as those for 
the sequence w1, W2, W3,... discussed in the worst-case 
analysis of the binary search algorithm. Thus the gen- 
eral formulas for the two sequences are the same. That 
is, a, = 1+ log, nj, for each integer n = 1. 


In the analysis of the binary search algorithm, it was 
shown that 1 + [log njis @(og, n). Thus the given 
algorithm segment has order log) n. 


Hint: The formula is b, = 1 + |log; nj]. 


20. 


22. 


Ini 


WH ni DI wo 


WN |] & |} YN |} vo 


0000000000 


R GB U C FHG 


tial array: 


R 


5, 
ae m 


Q 


7 by j 
: merge 7 


B G R U C F G H 
"a on A 
Final array: B C F G@GGHRU 
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24. a. Refer to Figure 11.5.3 and observe that when k is 
even, the subarray a[mid], a[mid + 1], . . . , a[top] 
has length k- ($+ 1) +1 =5 = [5]. 
25. Hint: The following are the steps for part (a) in the case 
where k is odd and k + 1 is even: 


Mery = Mgt t Merpy/yt (k+1)- 1 


= Mg+1)/2 © Mg+1)/2 + (k+1)— 1 


by Theorem 4.6.2 and exercise 19 in 
Section 4.6 because k + 1 is even 


= 2Mg+1)/2 + k 


te 1 (k+1) k+1 , 
=2|>( 2 Jioe( 5 fee 


by inductive hypothesis 


k+1 
= J [loga (k + 1) — log22] + k 
1 
= z&t Dfilog:(k+1)—1]+k 
1 k+1 2k 
= —-(k+ I)log,(k+1)—1 H 
stk + Dloga(k+ 1) ( ) 
1 k-1 
= oer I)log,(k+ 1) ae 
1 
= aK + 1log, (k+ 1) by algebra. 


SECTION 12.1 


1. a. Ly = {A, x, y, XX, VY, XXX, XYX, YXY, YYY, XXXX, XYYX, 


YXXY, YYYY} 
b. Ly = {x, xx, xy, XXX, XXY, XYX, XYY} 


3. a. (a+b) (c+d) 
b. Partial answer: 11* = 1-1 = 1, 12*=1-2=2, 
21/= 2/1 =2 


4. LL, is the set of all strings of a’s and b’s that start with 
an a and contain an odd number of a’s. 
Lı U Ly 1s the set of all strings of a’s and b’s that contain 
an even number of a’s or that start with an a and contain 
only that one a. (Note that because 0 is an even number, 
both A and b are in L; U Ly.) 
(Li U L)“ is the set of all strings of a’s and b’s. The rea- 
son is that a and b are both in L, U Ly, and thus every 
string in a and b is in (L; U L)“. 


7. (a | ((b*)b))((a*) | (ab)) 
10. (ab* | ch*)(ac | bc) 
13. LA|ab) = LA) U L(ab) = {A} U L(@)L(b) 
= {A} U {xy |x E L(a) and y E L(b)} 


= {A} U {xy |x E {a} and y E {b}} 
= {A} U {ab} = {A, ab} 
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16. Here is a sample of five strings out of infinitely many: 
0101, 1, 01, 10000, and 011100. 


19. The language consists of all strings of a’s and b’s that 
contain exactly three a’s and end in an a. 


22. aaaba is in the language but baabb is not because if a 
string in the language contains a b to the right of the 
left-most a, then it must contain another a to the right of 
all the b’s. 


25. One solution is 0°10*(0*10"10*)*. 


28. L| DA = Le | HLO = (LM) U Ls) LO 

= {xy | @ E L(y) U L(s)) and y € LO} 

= {xy | Œ E L(y) or x E L(s)) and y E L(A} 

= {xy | (x E L(r) and y € L(t)) or (x E L(s) and y E L(n)} 
= {xy | xy E L(t) or xy E L(st)} 

= Lert) U L(st) = L(rt | st) 


31. preļa—z]* 

34. [a—z]‘(a|e|ilo|wla-z]* 

37. [0— 9]{3}-[0 — 9]{2}-3[0 —9]{2}6 
39. ([+ =] X0- 9*6. [A0 -9 


40. Hint: Leap years from 1980 to 2079 are 1980, 1984, 
1988, 1992, 1996, 2000, 2004, and so forth. Note that 
the fourth digit is 0, 4, or 8 for the years whose third 
digit is even and that the fourth digit is 2 or 6 for the 
years whose third digit is odd. 


SECTION 12.2 


1. a. $1 or more deposited 


2. a. So, Sy, S2 b. 0, 1 Cc. So d. S2 
e. Annotated next-state table: 


Input 
> So Sy SO 
State S1 Sy S2 
© S2 S2 S2 


5. a. A,B,C,D,E,F b. x,y c&. A d. D,E 
e. Annotated next-state table: 


Input 

x y 

> C B 

F D 

State E F 
© F D 

© E F 

F F 


e. 


10. a. 
c. 


11. a. 
c. 


So, S1 $2, $3 b. 0, loe«g SO d. So, $2 
Annotated next-state table: 


Input 
0 1 
> © So S0 S1 
State Sy Sy S2 
© S2 S2 53 
S3 53 SO 


So, S1, $2 b. 0,1 c. So d. S2 


N(sı, 1) = s2, N(So, 1) = 83 
N” (so, 10011) = s2, N*(s;, 01001) = s2 


N(s3, 0) = s4, NCs2, 1) = s4 
N*(5o, 010011) = s3, N*(s3, 01101) = s4 


Note that multiple correct answers exist for part (d) of exer- 
cises 12 and 13, part (b) of exercises 14—19, and for exercises 


20—48. 
12. a. 
b. 
c. 
d. 
14. a. 
b. 
15. a. 
b. 
17. a. 
b. 
18. a. 
b. 
20. a. 


G) s2 GD s2 Gii) sı 

those in (i) and (ii) but not (iii) 

The language accepted by this automaton is the set 
of all strings of 0’s and 1’s that contain at least one 

0 followed (not necessarily immediately) by at least 
one 1. 

1*00*1(0 | 1)* 


The language accepted by this automaton is the set 
of all strings of 0’s and 1’s that end in 00. 
(0 | 1)*00 


The language accepted by this automaton is the set 
of all strings of x’s and y’s of length at least two that 
consist either entirely of x’s or entirely of y’s. 

xxx" | yyy" 


The language accepted by this automaton is the 

set of all strings of 0’s and 1’s with the following 
property: If n is the number of 1’s in the string, then 
nmod 4 = 0 orn mod 4 = 2. This is equivalent to 
saying that n is even. 

0* | (0*10"10*)* 


The language accepted by this automaton is the set 
of all strings of 0’s and 1’s that end in 1. 
(0| 1)*1 


Call the automaton being constructed A. Accep- 
tance of a string by A depends on the values of 


12.2. SOLUTIONS AND HINTS TO SELECTED EXERCISES A-138 


three consecutive inputs. Thus A requires at least 
four states: 


So: initial state 
sı: State indicating that the last input character was a 1 


s2: State indicating that the last two input characters 
were l’s 


53: State indicating that the last three input charac- 
ters were 1’s, the acceptance state 


If a 0 is input to A when it is in state sg, no progress 
is made toward achieving a string of three consecu- 
tive ls. Hence A should remain in state sọ. If a 1 is 
input to A when it is in state so, it goes to state s1, 
which indicates that the last input character of the 
string is a 1. From state s4, A goes to state sz if a 1 is 
input. This indicates that the last two characters of 
the string are 1’s. But if a 0 is input, A should return 
to sọ because the wait for a string of three consecu- 
tive 1’s must start over again. When A is in state s2 
and a 1 is input, then a string of three consecutive 
1’s is achieved, so A should go to state s3. If a 0 is 
input when A is in state sj, then progress toward 
accumulating a sequence of three consecutive 1’s is 
lost, so A should return to sọ. When A is in a state s3 
and a 1 is input, then the final three symbols of the 
input string are 1’s, and so A should stay in state s3. 
If a 0 is input when A is in state s3, then A should 
return to state sq to await the input of more 1’s. Thus 
the transition diagram is as follows: 


b. (0|1)*111 


21. a. 0 
O M 
= Ss 
0 $ 
0 
Qj 
b. 010| 1)" 


22. Hint: Use five states: sọ (the initial state), sı (the state 
indicating that the previous input symbol was an a), s2 
(the state indicating that the previous input symbol was 
a b), s3 (the state indicating that the previous two input 
symbols were a’s), and s4 (the state indicating that the 
previous two input symbols were b’s). 
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25. a. 39. Let P denote a list of all letters of a lowercase alphabet 
except p, R denote a list of all the letters of a lowercase 
alphabet except r, and E denote a list of all the letters of 
a lowercase alphabet except e. 

[a-z] 
[a-z] 
e 
b. (0| 1)*10 © (+) jo 
26. a. 


a a {) a,b 
5 b b b 
9 42. Let Y denote a list of all the consonants in a lowercase 


alphabet. 


b. a*ba*ba® ; [a-z] 
S — 
9 


b. (0|1)*0100| 1)" 


PO 


51. Hint: This proof is virtually identical to that of Exam- 
ple 12.2.8. Just take p and q in that proof so that p > q. 
From the fact that A accepts a”b”, you can deduce that 
A accepts afb”. Since p > q, this string is not in L. 


53. Hint: Suppose the automaton A has N states. Choose an 
integer m such that (m + 1) — m? > N. Consider strings 
of a’s of lengths between m? and (m+ 1). 
Since there are more strings than states, at least two 
strings must send A to the same state s;: 


(m + i 


2 T Z 
after both of these 
inputs, A is in state s; 


It follows (by removing the a’s shown in color) that the 
automaton must accept a string of the form a‘, where 
m <k<(m+1). 


SECTION 12.3 


1. a. O-equivalence classes: {s9, 51, 53, S4}, {52, S5} 
1-equivalence classes: {59, 53}, {8}, Sa}, {S, 55} 


2-equivalence classes: {50, 53}, {51, S4} {52, 55} 
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Except for the labeling of the states, the transition dia- 
grams for A and A’ are identical. Hence A and A’ ac- 
cept the same language, and so, by Theorem 12.3.3, A 
and A’ also accept the same language. Thus A and A’ are 
equivalent automata. 


9. For A: 


0-equivalence classes: {51, S2, 54, 55}, {89, 83} 


4. a. 0-equivalence classes: {s9, 51, 52}, {53, S4, 55} 
1-equivalence classes: {s9, 51, 52}, {53, S5}, {S4} 


2-equivalence classes: {89, 55}, {s1}, {83, 55}, {84} -equivalence classes: {s1, s2}, {s4 $5), {s0, $3) 


3-equivalence classes: {S0, S2}, {s1} {S3; 55}, {S4} 2-equivalence classes: {s1}, {s2}, {54 $s}, {50 s3} 
3-equivalence classes: {s1}, {s2}, {54, S5}, {S0 53} 


Therefore, the states of A are the 3-equivalence classes 
of A. 


For A’: 
0-equivalence classes: {s4, 53, 54, S5}, {s9, S1} 
1-equivalence classes: {s4, 53, 54, s5}, {s9, S1} 


Therefore, the states of A’ are the 1-equivalence classes 


of A’. 

6. a. Hint: The 3-equivalence classes are {so}, {s1}, {sa}, According to the text, two automata are equivalent if, 
{53}, {sa}, {s5}, and {s¢}. and only if, their quotient automata are isomorphic, pro- 
7, Yes. ForA vided inaccessible states have first been removed. Now A 
Si SEO and A’ have no inaccessible states, and A has four states, 
0-equivalence classes: {s9, 52}, {51, 53} whereas A’ has only two states. Therefore, A and A’ are 

1-equivalence classes: {so}, {sa}, {51, $3} not equivalent. 
2-equivalence classes: {so}, {52}, {51,53} This result can also be obtained by noting, for example, 


a . = that the string 11 is accepted by A’ but not by A. 
Transition diagram for A: 


11. Partial answer: Suppose A is a finite-state automaton 

@» with set of states S and relation R+ of *-equivalence of 
states. [To show that R. is an equivalence relation, we 

must show that R is reflexive, symmetric, and transitive.] 


Proof that R+ is symmetric: 
[We must show that for all states s and t, if s Rs t then t Rx s.] 


Suppose that s and f are any states of A such that s R. t. 
[We must show that t R+ s.] Since s Rs t, then for every 


(1) D input string w, 


bes w) is an | ae w) is an | 
> 3 


For A’: accepting state accepting state 
: IE <a bt E] 
O-equivalence classes: {sp, s1, 33}, {95} where N* is the eventual-state function on A. It follows 
: Pas i t 1 , 4 n 
l-equivalence classes: {s0, 89}, {si}, {93} from the symmetry of the + relation that for every input 
2-equivalence classes: {s0, s2}, {si}, {93} string w, 
Transition diagram for A’: N*(t, w) is an N*(s, w) is an 
accepting state accepting state 


Hence t R. s [as was to be shown], and so Rs 
is symmetric. 


12. The proof is identical to the proof of property (12.3.1) 
given in the solution to exercise 11 provided every oc- 
currence of “for each input string w” is replaced by “for 
each input string w of length less than or equal to k.” 
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A-141 


13. 


15. 


Proof: By property (12.3.2), for each integer k = 0, 
k-equivalence is an equivalence relation. Now by 
Theorem 10.3.4, the distinct equivalence classes of 

an equivalence relation form a partition of the set on 
which the relation is defined. In this case, the relation is 
defined on the set of all states of the automaton. So the 
k-equivalence classes form a partition of the set of all 
states of the automaton. 


Hint 1: Suppose C; is a particular but arbitrarily chosen 
k-equivalence class. You must show that there is a 
(k — 1)-equivalence class C,_, such that C C Cy_1. 


Hint 2: If s is any element in C}, then s is a state of 
the automaton. Now the (k—1)-equivalence classes 
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17. 


19. 


APPENDIXB SOLUTIONS AND HINTS TO SELECTED EXERCISES 


partition the set of all states of the automaton into a union 
of mutually disjoint subsets, so s E C,_, for some 
(k — 1)-equivalence class C,_. 

Hint 3: To show that Cy, C C-1, you must show that for 
any state t, if t E Cp, then t © Cy_}. 


Hint: If m < k, then every input string of length less 
than or equal to m has length less than or equal to k. 


Hint: Suppose two states s and ¢ are equivalent. 
You must show that for any input symbol m, the 
next-states N(s, m) and N(t, m) are equivalent. To do 
this, use the definition of equivalence and the fact 
that for any string w’, input symbol m, and state s, 
N*(N(s, m), w’) = N*(s, mw’). 


INDEX 


Abduction, 156 
Absolute value 
function, 765 
triangle inequality and, 207-208 
Absorption laws, 395, 417 
Accepting state of machine (automaton), 842, 843, 845-847, 
852-853, 855 
Ackermann, Wilhelm, 372 
Ackermann function, 372-373 
Acquaintance graph, 237 
Aczel, Amir D., 180n 
Addition, A-1 
in binary notation, 96 
computer, circuits for, 97-99 
integers, in two’s complement form, 102-103 
Addition rule, 589-599 
Additive identity, of string length, 235 
Additive inverse, 1, 4, A—1 
Additive property, of string length, 370 
Adjacency matrix, 699-700, 701 
Adjacent edges, 25, 679, 682, 688 
Adjacent to itself, 25 
Adjacent vertices, 25, 679, 688 
Adleman, Leonard, 525-526 
Airline route scheduling, 742-747, 749 
Aldous, David, 566 
Algebra, Boolean, 414—418 
Algebraic expressions 
divisibility and, 191-192 
parenthesis structures in, 365-366 
representation of, 734-735 
Algebraic proof 
of binomial theorem, 648—650 
of Pascal’s formula, 645 
of set identities, 410—412 
Algol language, 722 
Algorithmic language 
assignment statements in, 244 
conditional statements in, 244-245 
if-then-else statements and, 245 
if-then statements and, 245 
iterative statements in, 245-246 
for-next loops in, 245-246 
as pseudocode, 244 
variables and expressions in, 244 
while loops in, 245-247 


Algorithm(s) 
binary search, 813-825 
to check whether one set is subset of another, 386—387 
to convert from base 10 to base 2 using repeated division by 
2, 271-273 
correctness of, 314-323 
definition of, 244 
Dijkstra’s shortest path, 751-757 
division, 248-250, 319-321 
efficiency analysis of, 760—827 
elementary operations, 788-789 
Euclidean, 250—254, 321-323, 531-533 
everyday examples of, 244 
insertion sort, 793-796 
intractable, 824 
Kruskal’s, 745-748 
loop invariants and, 316-319 
merge sort, 820—824 
with nested loop, 790-791 
notation for, 248 
orders of, 789-791 
origin of word, 248 
polynomial-time, 824 
pre-conditions and post-conditions, 315-316 
Prim’s, 748-751 
selection sort, 782 
sequential search, 787-788 
shortest path, 742-757 
space efficiency of, 788, 825 
time efficiency of, 788-789, 825 
tractable, 824 
Algorithm segments, computing order of, 790 
Al-Kashi, Ghiyath al-Din Jamshid, 478 
Al-Khowéarizmi, Abu Ja’far Mohammed ibn Misa, 248 
Alphabet 
Caesar cipher and, 524-525 
input, 843 
regular expressions over, 832 
translating into strings, 483 
Alternating sequences, 260 
Ambiguous language, 136-137 
Ambiguous premises, 72 
American Standard Code for Information Interchange 
(ASCII), 483 
Analytical Engine, 244, 769 
Ancestor, 733 
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AND-gate, 82 
multiple-input, 86—87 
And statements, 40, 41 
negation of, 45—48 
Annotated next-state table, 843, 844-845 
Annual percentage yield (APY), 334 
Antecedent, 54 
Antisymmetry, 546-548 
Any, misuse of, 177 
Appel, Kenneth, 34 
APY. See Annual percentage yield 
Arc, 681 
Archimedean principle, 143 
Archimedean property for rational numbers, 313 
Archimedes of Syracuse, 143 
Area code string, 857 
Arguing from examples, 176 
Argument form 
creating additional, 154—155 
definition of, 66 
invalid, 67, 72 
rules of inference, 69—72 
syllogism, 68—69 
testing for validity, 67 
valid, 66-67, 150 
Argument(s) 
converse/inverse errors and, 155-156 
indirect, when to use, 231—232 
indirect, with contradiction and contraposition, 218—225 
logical form of, 37-38 
with “no,” 153-154 
with quantified statements, 146-156 
sound, 74, 150 
Tarski’s World, evaluating, 154-155 
unsound, 74 
Arguments, valid and invalid 
contradictions and valid, 74-76 
definition of, 66 
determining, 67 
diagrams for testing validity, 151-155 
fallacies and, 72-74 
invalid with true premise/conclusion, 74 
modus ponens/modus tollens and, 68—69 
proving validity, with quantified statements, 150 
rules of inference and, 69—72 
valid with false premise/conclusion, 74 
Aristotle, 37, 229 
Arithmetic 
fundamental theorem of, 195—197 
modular, 528-531 
sequences, 342-343 
Array(s) 
input, 817 
one-dimensional, 270, 569-570 
search algorithms for, 813—825 
Arrow diagrams 
of functions, 426—427, 455 
of relations, 18 


Artificial intelligence, 29, 30, 37, 141, 156, 398 
The Art of Computer Programming (Knuth), 787 
ASCII (American Standard Code for Information 
Interchange), 483 
Assertions, 37, 315-316 
Assignment statements, 244 
Associative laws, 394, 416, A-1 
deriving generalized, 412 
matrix multiplication and, 705-706 
Assumptions, 66 
At least, 623 
At most, 623 
Automaton/automata 
See also Finite-state automata 
equivalent, 859, 866-868 
quotient, 859, 863-866 
Average-case orders 
for insertion sort, 795-796 
for sequential search, 787-788 
Axiom(s), 278 
of extension, 7 
field, A-1, A-2 
least upper bound, A-3 
order, A-2 
power set, 386 
probability, 655—660 


Babbage, Charles, 244, 769 
Bachmann, Paul, 769 
Backus, John, 722 
Backus-Naur notation, 722, 829 
Backward chaining, 398 
Barber puzzle, 419-420 
Barwise, Jon, 117—118 
Base 2 notation. See Binary notation 
Base 10 notation. See Decimal notation 
Base 16 notation. See Hexadecimal notation 
Basis property, 317 
Basis step, 278, 410 
Bayes, Thomas, 666 
Bayes’ theorem, 666—668 
Beal, Andrew, 232 
Beal conjecture, 232 
Best-case orders 
See also Average-case orders; Worst-case orders 
for sequential search, 787-788 
Biconditional 
definition of, 60 
in informal language, 62—63 
only if and, 59—61 
truth table of, 60 
Big-omega notation, 769-783 
Big-O notation, 769-783 
Big-theta notation, 769-783 
Bijection, 451—454 
Binary integer representations, 307-308 
Binary notation, 93—94 
addition/subtraction in, 96 
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conversions to and from, 95, 105—106, 271-273 
for integers, 94—95 
number of bits needed to represent integer in, 803-804 
Binary relations, 487, 491 
Binary search algorithm, 214, 813-825 
efficiency of, 816-820 
as logarithmic, 820 
merge sort and, 820-824 
tracing, 816 
Binary search trees, 739-741 
Binary trees, 734-739 
existence of, determining, 736-738 
full, 734 
to represent algebraic expression, 734—735 
theorems about, 735-738 
Binomial coefficients, 270 
Binomial probabilities, 672 
Binomials, 646 
Binomial theorem, 646—652 
deriving combinatorial identity from, 651—652 
proof of, 648—651 
simplifying sum using, 652 
substituting into, 651 
Bioinformatics, 836 
Bipartite graphs, 243 
complete, 241 
Birthday problem, 602 
Birthdays (example), 605-606 
Bits, 81 
in binary representation, number of, 803—804 
Bit strings, 13, 627—628 
Black boxes, 81-83 
Blockchain technology, 445, 542 
Boole, George, 37, 415 
Boolean algebra 
definition and axioms for, 416 
double complement law, 417—418 
idempotent law, 418 
properties of, 416—417 
sets and, 414—418 
Boolean expressions 
circuits and, 84-87 
recursive definition of, 365 
rewriting using Sheffer stroke, 90 
Boolean functions, 432—433, 828 
Boolean variables, 84 
Bound variables, scope and, 115-116 
Brain research, 828 
Branch vertex, 725 
Bruner, Jerome S., 604 
But, 39-40 


C# language, 201 

C++ language, 244 

Caesar, Julius, 524 

Caesar cipher, 524-525 

Calculators, computing logarithms with base 2 on, 451 
Calculus 


INDEX 


predicate, 108 
propositional, 108 
statement, 108 
Campanus of Novara, 277 
Cancellation theorem for modular congruence, 539-540 
Cantor, Georg, 6-7, 10, 377, 478, 480 
Cantor diagonalization process, 477—482 
Cardinality 
computability and, 473—484 
countable sets and, 475—477 
infinite sets and, 474—475 
properties of, 473—474 
of set of all real numbers, 481—482 
of set of functions, 483—484 
uncountable sets and, 477—481 
Cardinal numbers, 473 
Cards 
poker hand problems, 626—627 
probabilities for deck of, 566-567 
Carroll, Lewis, 66, 244, 505, 617 
Carry, 97 
Cartesian plane, 13, 760 
Cartesian products, 10-13 
functions defined on, 430 
number of elements in, 576-577 
Catalan, Eugene, 232, 327 
Catalan numbers, 327 
Cayley, Arthur, 723 
Ceiling, 211-216 
Central processing unit (CPU), 80, 842-843 
Chaining, backward and forward, 398 
Character classes, 836 
Characteristic equation, of recurrence relation, 354-356 
Characters 
of strings, 13 
as strings, 366 
Child, 733, 734 
Chomsky, Noam, 722, 828, 829, 854 
Church, Alonzo, 828 
Church-Turing thesis, 828 
Ciphertext, 524 
Circle relation, 17 
Circuit-free graphs, 720 
Circuits, 677—692, 730 
background on, 79-81 
black boxes and gates, 81-83 
Boolean expressions and, 84—87 
combinational, 82, 88—89, 841 
for computer addition, 97—99 
definition of, 679-680 
digital logic, 79-90 
equivalent, 89 
Euler, 684-689 
full-adder, 97—99 
graphs and, 677-692 
half-adder, 97, 99 
Hamiltonian, 689-692 
input/output table for, 83-84, 87-88 
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Circuits (Continued ) 
sequential, 83, 841 
simple, 679—680 
Circular reasoning, 72 
C language, 201, 244, 829n 
Class(es) 
character, 836 
equivalence, 3, 510-520 
NP, 824 
P, 824 
Clay Mathematics Institute, 180 
Closed form, 281 
Closed walk, 679 
Code generators, 829 
Coding theory, 394, 431 
Co-domains, 16, 425 
Coefficients 
binomial, 270 
polynomial functions with negative, 774-777 
Coin toss, 668—672 
Collatz, Luther, 374 
Collision resolution, 444 
Colmerauer, A., 141 
Columns, multiplying, 703-704 
Combinational circuits, 82, 841 
simplifying, 88—89 
Combinations, 270, 617—630 
3-combinations, 617—618 
r-, with repetition allowed, 634—639 
relation between permutations and, 618—620 
unordered selections, 618 
Combinatorial proof 
of binomial theorem, 650—651 
of Pascal’s formula, 645—646 
Common logarithms, 451 
Commutative laws, 394, 416, A-1 
Comparable elements, 553 
Compilers, 829, 836 
Complement laws, 394, 416 
See also Double complement laws 
Complements 
of 0 and 1, 417 
in Boolean algebra, 417—418 
of event, probability of, 656 
of sets, 381, 382 
of universal/null sets, 395 
Complete bipartite graphs, 241, 626 
Complete enumeration, 618 
Complete graphs, 240-241, 626 
Composite numbers, 163—164 
Composition of functions, 461—470 
finite sets defining, 463 
formulas defining, 462 
with identity function, 463—464 
with inverse functions, 464—466 
one-to-one functions and, 466—468 


Compound interest, 333-334 


with compounding several times a year, 334-335 


Compound statements, 39-40 
evaluating truth of, 42—43 
logic of, 37-107 
truth tables for, 42—43 
truth values, 40—42 
Computer addition, circuits for, 97-99 
Computer languages 
Algol, 722 
Backus-Naur notation for, 722, 829 
C, 201, 244, 829n 
C#, 201 
C+4, 244 
formal languages, 829-832 
high-level, 828-829 
identifiers, 509—510 
Java, 201, 244, 836 
-NET, 836 
Prolog, 141-142 
Python, 201, 244, 836 
subroutines in, 248 
variables in, 244 
Computer programming 
correctness and, 314-315 
countability of set of, 482—483 
sequences in, 270-271 
Concatenation, 20 
of strings, 370-372 
of u and c, 366 
Conclusions, 37, 53, 54, 62, 66 
false, 74 
invalid arguments with true, 74 
jumping to, 72, 176 
universal modus ponens for drawing, 148 
universal modus tollens for drawing, 150 
Conditional probability, 662—665 
Bayes’ theorem and, 666-668 
computing, 663 
representing with tree diagram, 663—665 
Conditional statements, 2, 53—65 
in algorithmic language, 244-245 
contrapositive of, 57-58 
converse and inverse of, 58-59 
definition of, 53-54 
with false hypotheses, 54-55 
if-then as or, 56 
interpreting as biconditionals, 62—63 
logical equivalences and, 55-56 
necessary and sufficient conditions, 61—62 
negation of, 56-57 
only if and biconditional, 59—61 
proof for, 402 
universal, 3, 113-114, 124-125, 126-128 
vacuously true, 54 
Conditions, 245 


onto functions and, 468—470 Congruence, cancellation theorem for modular, 539-540 
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Congruence modulo 2 relation, 488 
Congruence modulo 3 relation, 500-501, 516-517 
Congruence modulo n, 516-518, 526-528 
Congruence relations, 516-518 
Congruences, evaluating, 518 
Conjecture, 179-180 
Conjunctions, 41, 76 
Connected components, 683 
matrices and, 702—703 
Connected graphs, 682—683 
Connectedness, 682—683 
Consecutive integers, 203 
Consequent, 54 
Constant function, 21 
Constructive proofs of existence, 164 
Context-free languages, 828-829, 854 
Contradiction 
definition of, 48 
indirect argument by, 218-225 
logical equivalence and, 49 
method of proof by, 218-221 
proof by, 223-224 
rule, 74-75, 76 
valid arguments and, 74-76 
Contradictory statements, 48 
Contraposition 
indirect argument by, 218-225 
method of proof by, 222-224 
Contrapositive 
of conditional statement, 57—58 
of universal conditional statements, 127—128 
writing, 58 
Converse 
of conditional statement, 58—59 
of universal conditional statements, 127—128 
Converse error, 72, 73, 153, 155-156 
Corollary, 187 
Correctness of post-condition, 317 
Countable sets, 475—477 
Counterexamples, 110 
direct proof and I, 161-171 
direct proof and II (writing advice), 173-180 
direct proof and III (rational numbers), 183—187 
direct proof and IV (divisibility), 190-197 
direct proof and V (division into cases and the quotient- 
remainder theorem), 200-208 
direct proof and VI (floor and ceiling), 211-216 
disproving universal statements by, 164—165 
divisibility and, 194-195 
for set identity, 407—409 
Counting, 564—676 
advice about, 629-630 
double, 629-630 
elements in Cartesian product, 576-577 
elements in intersection, 597-599 
elements of disjoint sets, 589-599 
elements of general union, 596-597 


INDEX 


elements of list, 568—570 


I-5 


input/output tables for circuit with two input signals, 577-578 


integral solutions of equation, 638—639 
Internet addresses, 594—595 
iterations of loop, 637—638 
iterations of nested loop, 578 
number of integers divisible by 5, 589-590 
Pascal’s formula and, 642—646 
passwords, 592-593 
permutations, 580-584 
PINs, 575-576 
PINs, with repeated symbols, 590-591 
possibility trees and multiplication rule, 573-584 
probability and, 564-570 
Python identifiers, 593-594 
subsets of a set, 617—630 
triples, 637 
walks of length N, 707-710 
Critical Path Method (CPM), 558-560 
Cross products, 519 
Cryptographic hash functions, 445 
Cryptography, 524-526 
Caesar cipher, 524-525 
definition of, 524 
message authentication, 542-543 
public-key, 525-526 
RSA cipher, 525-526, 536-539 
RSA cryptography, 540-543 
uses of, 524 


Databases, relational, 491—492 
Data type, 244 
Dates, regular expression for, 838 
Da Vinci, Leonardo, 1 
Davis, Philip J., 211 
Day of week, computing, 202 
Debate, 218 
Decimal expansions, of fractions, 608—610 
Decimal notation, 93 
conversion of binary notation to and from, 95, 271-273 
converting from hexadecimal notation to, 104—105 
Decimals, repeating, 184 
Decision trees, 721-722 
Decoding functions, 431 
Decreasing functions, 765-766 
Decryption, 524-525 
Decryption key, 537 
Dedekind, Richard, 520 
Deductive reasoning, 275 
rules of inference and, 69-72 
universal instantiation and, 146—147 
Degree of vertex, 31-32 
De Grey, Aubrey, 34 
De Morgan, Augustus, 37, 46, 277 
De Morgan’s laws, 45—46, 125, 395 
applying, 46—47 
Boolean algebra and, 416 
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De Morgan’s laws (Continued ) Disproof 
cautionary examples of, 47—48 of alleged property of floor, 213 
definition of, 46 of alleged set property, 407—409 
inequalities and, 47 by counterexample, 164—165 
for sets, proof of, 398—400 of existential statements, 179 
Descartes, René, 131, 760 Disquisitiones Arithmeticae (Gauss), 518 
Descendant, 733 Distinct equivalence classes, 511-517 
Diaconis, Persi, 566 Distinct-roots theorem, 353-360 
Diagrams Distributive laws, 394, 416, A-1 
See also Arrow diagrams general, 402—403 
Hasse, 550-553 proof of, 395-398 
testing for validity using, 151-155 div, 201-202 
transition, 843-844 computing, 216 
tree, 663-665 Divide-and-conquer strategy, 813—814 
Dice rolls, 567 “Divides” relations 
Difference rule, 590-595 on set of positive integers, 548-549 
Differences of sets, 381, 382 testing for antisymmetry of, 547-548 
Digital computers, 842-843 Divisibility, 190-197 
Digital logic circuits, 79-90 algebraic expressions and, 191-192 
background on, 79-81 checking for nondivisibility, 192 
black boxes and gates, 81-83 checking proposed properties, 195 
Boolean expressions and, 84—87 concept of, 190 
combinational, simplifying, 88—89 counterexamples and, 194-195 
equivalence classes of, 515-516 by a prime, 302-303 
equivalence of, 509 by prime numbers, 194 
equivalent, 89 prime numbers and, 192 
input/output table for, 83-84, 87-88 properties of, 191 
NAND and NOR gates, 89-90 proving, with mathematical induction, 291-293 
Digraphs, 29 proving properties of, 192-194 
Dijkstra, Edsger W., 314, 315, 377, 752 transitivity of, 192-194 
Dijkstra’s shortest path algorithm, 751-757 unique factorization of integers theorem and, 195-197 
Dirac, P. A. M., 495 Division algorithm, 248-250 
Directed edges, 29 correctness of, 319-321 
Directed graphs, 29, 495 Division into cases, proof by, 204-205 
definition of, 29 Divisors 
matrices and, 699-701 greatest common, 250-252, 253, 532-533 
of partial order relation, 552-553 of one, 191 
of relation, 490—491 positive, of positive integer, 191 
Direct proof of zero, 190-191 
counterexample I and, 161-171 Domains, 16, 425 
counterexample II and (writing advice), 173—180 Dot product, 703 
counterexample HI and (rational numbers), 183—187 Double complement laws, 394, 416 
counterexample IV and (divisibility), 190-197 proof of, 417—418 
counterexample V and (division into cases and the quotient- Double counting, 629-630 
remainder theorem), 200-208 Double negative property, 44—45 
counterexample VI and (floor and ceiling), 211-216 Dual identity, 417 
method of, 167 Duality principle, of Boolean algebra, 417 
of theorem, 168—170 Dummy variables, 266 
Dirichlet, J. P. G. L., 604 in loop, 270-271 
Dirichlet, Lejeune, 426 
Dirichlet box principle. See Pigeonhole principle EBCDIC (Extended Binary-Coded Decimal Interchange Code), 
Discovery, 161 483 
Discrete mathematics, 8 Edge-endpoint functions, 25 
Disjoint events, 668—669 Edges, 24-25, 29, 681 
Disjoint sets, 384-385 Edinburgh Prolog, 141n 
counting elements of, 589-599 Edison, Thomas Alva, 352 
Disjunction, 41—42 8-bit two’s complements, 99—103 
Disjunctive normal form, 88 Einstein, Albert, 589 
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Electrical engineering, 828 
Electronic technology, 80—81 
Element arguments, 391 
Elementary operations, 788-789 
Elements 
in Cartesian products, 576-577 
comparable, 553 
of disjoint sets, counting, 589-590 
of elements, 628-629 
of general union, counting, 596-597 
greatest, 554-555 
in intersection, counting, 597-599 
least, 308-309, 554-555 
of list, counting, 568—570 
maximal, 554-555 
minimal, 554-555 
noncomparable, 553 
ordered selection of, 618 
unordered selection of, 618 
Elements of Geometry (Euclid), 229, 230 
Elimination, 70, 76 
Elkies, Noam, 180 
Ellipsis, 7 
Emacs, 836 
Empty sets, 384, 401—403 
proving set is, 402 
as subset of every set, 401—402 
uniqueness of, 401 
Encoding functions, 431 
Encryption, 524-525 
See also Cryptography 
Endpoints, 25, 29 
Enumeration, complete, 618 
Equality 
of functions, 428 
properties of, 498—499 
proving, 284-285 
set, 379-380 
test for function, 428 
Equally likely probability formula, 566 
Equations, number of integral solutions of, 638—639 
Equivalence, modular, 526-527 
Equivalence classes 
of congruence modulo 3, 516-517 
definition of, 510 
of digital logic circuits, 515-516 
of equivalence relation, 510-516 
of identifiers, 512 
of identity relation, 512-515 
rational numbers as, 519-520 
of relation given set of ordered pairs, 511 
of relation on set of subsets, 511-512 
star *-equivalence classes, 863 
Equivalence of states, 860 
Equivalence relations, 505-520 
congruence modulo n, 527-528 
definition of, 508—510 
digital logic circuits and, 509 


INDEX 


equivalence classes of, 510-516 
graph isomorphisms as, 715 
partition inducing, 506-508, 514-515 
on set of subsets, 508-509 
Equivalent automata, 866-868 
Equivalent circuits, 89 
Equivalent forms, of universal and existential statements, 
114-115 
Equivalent states, 859-863 
Eratosthenes, 227 
Escape characters, 833 
Etchemendy, John, 117-118 
Euclid, 196, 229, 230 
Euclidean algorithm, 250-254 
correctness of, 321-323 
extending, 531-533 
Euclid’s lemma, 539-540 
Euler, Leonhard, 180, 677 
Euler circuits, 684-689 
definition of, 684 
finding, 687—688 
vs. Hamiltonian circuits, 690 
showing graph does not have, 685-687 
Euler trails, 688—689 
Even integers, 161-163 
Goldbach’s conjecture about, 180 
properties of, 186-187 
Even parity, 835 
Event(s), 566 
disjoint, 668—669 
of events, 668—672 
independent, 668—672 
probability of complement of, 591, 656 
probability of general union of two, 657—658 
Eventual falsity of guard, 317 
Eventual-state function, 846—847 
Examples, arguing from, 176 
Exclusive or, 42—43 
Exhaustion, method of, 111, 166 
Existence 
constructive proofs of, 164 
of graphs, 236-237 
of identity elements, A—1 
nonconstructive proof of, 164 
Existential instantiation, 148, 168 
Existential quantifiers, 111-112 
Existential statements, 2 
definition of, 111 
disproving, 179 
equivalent forms of, 114—115 
proving, 164 
truth and falsity of, 112 
universal, 3—5 
Existential universal statements 
definition of, 4 
rewriting, 5 


Expanded form, changing between summation notation and, 


261-262 
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Expected value, 658—660, 670 pigeonhole principle and, 612-614 
Expert systems, 156 properties of relations on, 496—498 
Explicit formula Finite-state automata, 836, 841-855 
for Fibonacci sequence, 358-360 definition of, 843—845 
to fit given initial term, 260-261 designing, 847-849 
for geometric sequence, 343 digital computers, 842-843 
incorrect, 348-349 equivalence states and, 859-863 
for method of iteration, 340-342 eventual-state function and, 846—847 
for sequences, 259-261 given by annotated next-state table, 844—845 
simplifying solutions obtained by iteration, 345-347 given by transition diagram, 843-844 
for Tower of Hanoi, 345-346 language accepted by, 845-846 
Exponential functions nondeterministic, 853 
with base b, 449—450 regular expressions and, 851-853 
graphs of, 800-801 simplifying, 858—868 
one-to-oneness of, 450—451 software simulating, 849-851 
relations between logarithmic and, 449—451 First-order logic, language of, 141 
Exponential notation, 93 Floor, 211-216 
Exponential orders, 806-811, 824 Floor function, 304-305, 762-763, 791-793 
Exponents, law of, 449 Floyd, Robert W., 315 
Expressions Forests, 720 
See also Regular expressions Formal language(s), 828, 829-832 
in algorithmic language, 244 examples of, 830 
numerical, 341 vs. informal, 112—113 
Extended Binary-Coded Decimal Interchange Code translating informal to, 134—136 
(EBCDIC), 483 Formal logical notation, 139-141 
Extended Euclidean algorithm, 531-533 Formulas 
Extension, axiom of, 7 See also Explicit formula 
checking correctness of, using mathematical induction, 347-348 
Factorial notation, 268-270 composition of functions defined by, 462 
Factorials, computing with, 269-271 computational, 620 
Factorization, unique factorization of integers theorem, deciding on, 639 
195-197 Pascal’s formula, 642—646 
Fallacies, 72-74 for probability of complement of event, 591 
Fallacy of affirming the consequent, 73 proving, 275-286 
Fallacy of denying the antecedent, 73-74 summation of first n integers, 278—282 
False hypotheses, conditional statements with, 54-55 For-next loops, 245-246, 793-794 
False negatives, 667 dummy variable in, 270-271 
False positives, 667 trace table for, 247—248 
Fantasy rule for mathematical proof, 393 Forster, E. M., 79 
Fermat, Pierre de, 179-180, 190, 277, 568 Forward chaining, 398 
Fermat primes, 232 Four-color theorem, 34 
Fermat’s last theorem, 232 Four-function calculator, computing div and mod with, 
Fermat’s little theorem, 540—541 201-202 
Fibonacci (Leonardo of Pisa), 332 Fractional part, 213 
Fibonacci numbers, 332-333 Fractions, decimal expansions of, 608—610 
Fibonacci sequence, 358-360 Frege, F. L. G., 520 
Field axioms, A-1, A-2 Frege, Gottlob, 110 
Final term, 259 Friedl, Jeffrey E. F., 851n 
Finite relations Frye, Roger, 180 
inverse of, 489 Full-adders, 97—99 
testing for antisymmetry of, 547 Full binary tree, 734 
Finite sequences, 270 Fuller, R. Buckminster, 713 
Finite sets, 473 Function notation, 19 
composition of functions defined by, 463 Functions, 425—486 
functions and relations on, 19-20 See also Composition of functions; Exponential functions; 
one-to-one functions defined on, 441 Finite-state automata; Logarithmic functions 
onto functions defined on, 446—447 absolute value, 765 
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Ackermann, 372-373 

acting on sets, 434—435 

arrow diagrams of, 426—427 
Boolean, 432—433, 828 

cardinality of, 483-484 

Cartesian product defining, 430 
composition of, 461—470 

constant, 21 

defined on sets of integers, graphing, 763 
definition of, 425—426 

domain, 425 

edge-endpoint, 25 
encoding/decoding, 431 

equality of, 21, 428 

eventual-state, 846—847 

examples of, 429—432 

exponential, 449—451, 800-801 

on finite sets, 19-20 

floor, 762-763, 791-793 

formulas defining, 21 

general sets defining, 425—435 
Hamming distance, 431—432 

hash, 443-445 

identity, 463—464 

identity, on X, 429 

increasing and decreasing, 765-766 
inverse, 440, 454—457, 464—466 
language of, 18—20 

logarithmic, 430—431, 449—451, 800, 801-803 
machines, 20—21 

McCarthy’s 91, 372 

multiple of, graph of, 764 
next-state, 843 

noncomputable, 483 

nonfunctions and, 427 

not well-defined, 433—434 
one-to-one, 439—445, 613—614 
one-to-one correspondences and, 451—454 
one-way, 445 

onto, 445-449, 468—470, 613—614 
polynomial, 773-783 

power, 761-762, 773 

power sets defining, 429—430 

from power set to set of strings, 452 
probability, 655 

propositional, 108 

range, 426 

real-valued, of real variable, 760-766 
recursive, 372-374 

sequences and, 429 

squaring, 21 

string-reversing, 452—453 
successor, 21 

test for equality, 428 

of two variables, 453—454 

with union, 434—435 

well-defined, 433—434 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


INDEX 


Fundamental theorem of arithmetic, 195-197 
Galilei, Galileo, 473 
Gambling, 659-660 
Gates, 81-83 
Gauss, Carl Friedrich, 173, 196, 201, 281, 518 
gcd. See Greatest common divisor 
General formula, for sequence, 259 
Generalizations, 69-70, 76 
from the generic particular, 166-167, 184-185 
universal, 166 
Generalized associative law, deriving, 412 
Generalized distributive law, 402—403 
General recursive definitions, 364—372 
Generic particular, generalizing from, 166-167, 184—185 
Geometric sequence, 344-345 
explicit formula for, 343 
sum of, 282-284, 285-286 
Germain, Marie-Sophie, 232 
Gibbs, Josiah Willard, 15 
Gilbert, William S., 642 
Gleick, James, 180 
Gödel, Kurt, 420 
Goldbach, Christian, 180 
Goldbach conjecture, 180 
Golden ratio, 360n, 364 
Golomb, Solomon, 296 
Grammar rules, 722-723 
Grammars, 828 
Graph(s), 677—720 
See also Directed graphs; Tree(s) 
of absolute value function, 765 
acquaintance, 237 
bipartite, 243 
circuit-free, 720 
circuits and, 677—692, 730 
coloring maps with, 32-34 
complete, 240-241, 626 
complete bipartite, 241, 626 
connected, 682—683 
definition of, 25, 679-680 
degree of vertices, 31-32 
directed, 29, 699-701 
directed, of relation, 490—491, 495 
disconnected, 682-683 
edges, 24-25 
examples of, 28-34 
existence of, determining, 236-237 
of exponential functions, 800-801 
of floor function, 762-763 
forest, 720 
of function f, 760-761 
of functions defined on sets of integers, 763 
isomorphisms of, 713-718 
language of, 24-34 
of logarithmic functions, 801-803 
matrix representations of, 698-710 
of multiple of a function, 764 
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Graph(s) (Continued ) 
paths in, 677-692 
pictorial representation of, 26-28 
of power functions, 761—762 
problem-solving using, 30-31 
of real-valued functions of a real variable, 760—766 
to represent knowledge, 29-30 
to represent World Wide Web, 28-29 
simple, 239-240, 717-718 
subgraphs, 681—682 
terminology, 25-26 
total degree of, 235-236 
trails and, 677—692 
tree, 720, 727 
uses of, 677 
vertices, 24-25 
vertices with odd degree, 238 
walks in, 707-710 
weighted, 745-748 
Graph theory, origins of, 677—679 
Greatest common divisor (ged), 250-252, 253 
writing as linear combination, 532-533 
Greatest elements, 554-555 
Green, Ben Joseph, 232 
grep, 836 
Gries, David, 315 
Griggs, Jerrold, 394 
Guard, 245, 316, 319 
Guard condition, 320 
eventual falsity of, 320, 322 


Haken, Wolfgang, 34 
Half-adders, 97, 99 
Hall, Monty, 567 
Halting problem, 420—421 
Hamilton, Sir William, 689 
Hamiltonian circuits, 689-692 
definition of, 690 
vs. Euler circuits, 690 
showing graph does not have, 691-692 
traveling salesman problem and, 692 
Hamming, Richard W., 431—432 
Hamming distance function, 431—432 
Handshake theorem, 235-241 
Hardy, G. H., 218, 258, 524, 543 
Harmonic sums, 809-810 
Hash functions, 443—445 
computing values of, 444—445 
cryptographic, 445 
Hasse, Helmut, 551 
Hasse diagrams, 550-553 
Hausdorff, Felix, 10 
Hersh, Reuben, 211 
Hexadecimal notation, 103—104 
bination notation converting from/to, 105—106 
decimal notation converted from, 104—105 
High-level computer languages, 828-829 
Hilbert, David, 414, 843 
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Hofstadter, Douglas, 364, 367, 391, 393 
Horizontal axes, 760 
Hydrocarbon molecules, structure of, 723-724 
Hypertext transfer protocols (HTTPs), 28 
Hypothesis, 53, 54, 62, 66 

false, 54—55 


Idempotent laws, 394 
proof of, 418 
Identifiers 
equivalence classes of, 512 
Python, counting, 593-594 
relation on set of, 510 
Identities, set, 394-400 
Identity elements, A—1 
Identity function, 463—464 
Identity function on X, 429 
Identity laws, 394, 416 
Identity matrices, 706-707 
Identity relations, equivalence classes of, 512-515 
Tf, 162 
misuse of, 177—178 
If-then-else statements, 204, 245, 793-794 
If-then statements 
converting necessary condition to, 62 
converting only if to, 59—60 
converting sufficient condition to, 61 
execution of, 245 
negation of, 57 
representation as or, 56 
Implicit quantification, 116-117 
Inclusion/exclusion rule, 595-599 
Increasing functions, 765-766 
Independent events, 668—672 
disjoint events and, 668—669 
probability and, 669—670 
three events, 671 
Index, 259 
Indirect argument 
contradiction and contraposition, 218-225 
famous theorems, 228—233 
Indirect proof, when to use, 231-232 
Induction, 275 
See also Mathematical induction 
structural, 364, 367-372 
Inductive hypothesis, 279 
Inductive property, 317 
Inductive step, 278, 410 
Inequalities 
De Morgan’s laws and, 47 
notation for, 40 
proving, with mathematical induction, 293-295 
triangle, 207-208 
Inference, rules of, 69-72, 76 
Infinite relations, inverse of, 490 
Infinite sequences, 259 
Infinite sets, 473, 612 
Cantor diagonalization process and, 477—482 


cardinality and, 474—475 
countability of, 475—477 
one-to-one functions on, 441—443 
onto functions on, 447—449 
properties of relations on, 498-501 
uncountable, 476 
Infinitude of prime numbers, 230-231 
Infix notation, 831 
Informal language 
vs. formal, 112-113 
translating to formal, 134-136 
Initial conditions, for recurrence relations, 325 
Initial state, 843 
Initial term(s), 259 
finding explicit formula to fit, 260-261 
Injective (one-to-one) functions, 440—443 
Input alphabet, 843 
Input arrays, length of, 817 
Input/output table(s), 81 
for circuit, 83-84, 87-88 
for circuit with two input signals, 577-578 
Input signals, 81 
Insertion sort algorithm, 793-796 
Instantiation 
existential, 148, 168 
universal, 146-147, 150 
Integers, 8 
addition/subtraction in two’s complement form, 102—103 
binary notation for, 94—95 
binary representation of, 307-308 
composite, 163-164 
computer representation of, 99-101 
consecutive, 203 
divisibility of, 195-197 
divisible by 5, counting number of, 589-590 
even, 161—163, 186-187 
graphing functions defined on sets of, 763 
linear combination of, 532 
mod 4, 204-205 
odd, 161-163, 186-187 
odd minus even, 178-179 
pairwise relatively prime, 534 
parity of, 203-204 
positive divisor of positive, 191 
prime, 163-164 
properties of, 186-187 
property of the set of, 368-369 
quotient-remainder theorem and, 205 
relatively prime, 534-535 
representation of, 203-208 
selecting pair of, with certain sum, 607—608 
smallest positive, 135 
square of, 222-223 
square of odd, 205-207 
in standard factored form, 196 
unique factorization of integers theorem, 195-197 
well-ordering principle for, 258, 308-310 
Integral solutions of equation, 638—639 


INDEX 


Intel 4004, 80 
Internal vertex, 725-727 
Internet Protocol (IP) addresses, 594-595 
Intersection 
counting elements of, 597-599 
of sets, 381, 382, 383-384 
with subset, 400 
Interval notation, 382-383 
Intervals, 382-383 
Intractable problems, 824 
Invalid arguments. See Arguments, valid and invalid 
Invalidity, using diagrams to show, 152-153 
Invariant for graph isomorphism, 716 
Inverse 
of conditional statement, 58—59 
of relation, 488—490 
of universal conditional statements, 127-128 
Inverse error, 72, 73-74, 153, 155-156 
Inverse functions, 440, 454—457, 464—466 
Inverse modulo n, 534-536 
Inverter, 82 
Irrational numbers 
determining rational numbers vs., 183-184 
irrationality of square root of two, 228-230 
sum or rational and, 220—221 
Isbell, John, 34 
Isolated vertex, 25 
Isomers, 723 
Isomorphic invariants, 716 
Isomorphic structures, 867—868 
Isomorphisms 
definition of, 713 
of graphs, 713-718 
for simple graphs, 717-718 
Iterations 
counting, of loop, 578, 637-638 
formulas to simplify solutions obtained by, 345-347 
method of, 340-345 
number of, depending on floor function, 791-793 
solving recurrence relations by, 340-349 
Iterative statements, 245—246 


Java language, 201, 244, 836 
Job scheduling problem, 558-560 
Jumping to conclusions, 72 


Kant, Immanuel, 37, 742 
k-equivalence of states, 860—863 
Keys, binary search trees, 739 
Killian, Charles, 394 

Kirchhoff, Gustav, 723 

Kleene, Stephen C., 828, 851 
Kleene closure of L, 832 

Kleene closure of r, 832 

Kleene closure of È, 830 
Kleene’s theorem, 851—853 
Knuth, Donald, 169, 769, 787 
Kolmogorov, Andrei Nikolaevich, 655 
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Königsberg bridges puzzle, 677—679 
Kronecker, Leopold, 706 

Kruskal, Joseph, 745 

Kruskal’s algorithm, 745-748 
Kuratowski, Kazimierz, 10 


The Language of First-Order Logic (Barwise and Etchemendy), 


117-118 
Language(s) 


See also Computer languages; Formal language; Informal 


language 
accepted by automaton, 845-846 
ambiguous, 136-137 
context-free, 828-829, 854 
of first-order logic, 141 
formal vs. informal, 112—113 
new, from old, 832 
nonregular, 854—855 
regular, 854—855 
regular expression defining, 832—836 
translating from informal to formal, 134—136 
Laplace, Pierre-Simon, 568, 655, 662 
Law of exponents, 449 
Icm. See Least common multiple 
Leaf, 725-727 
Least common multiple (Icm), 256 
Least elements, 554—555 
finding, 308-309 
Least nonnegative residues modulo n, 527 
Least upper bound axiom, A-3 
Left child, 734 
Leibniz, Gottfried Wilhelm, 37, 151 
Lemma, 207 
Euclid’s, 539-540 
Length 
of string, 369-370 
of walk, 707-710 
“Less than or equal to” relations, 549 
Less-than relations, 487, 499 
Lexicographic order, 549-553 
Linear combinations 
of relatively prime integers, 535 
writing greatest common divisor as, 532-533 
Linear probe, 444 
Linguistics, 723, 828 
Lists, counting elements of, 568—570 
Lobachevsky, Nicolai Ivanovitch, 546 
Local call string, 857 
Logarithmic functions, 800 
with base b, 430—431 
graphs of, 801-803 
orders involving, 807-808 
relations between exponential and, 449—451 
Logarithmic orders, 806—811 
Logarithms 
common, 451 
computing, with base 2 on calculator, 451 
natural, 451 
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of numbers between two consecutive powers of 2, 802-803 


properties of, 450 
recurrence relations solved with, 804—806 
Logic, 828 
of compound statements, 37—107 
De Morgan’s laws of, 45—48 
digital logic circuits, 79—90 
form of argument, 37-38 
language of first-order, 141 
of quantified statements, 108—152 
symbolic, 37 
Logical equivalence, 43—48 
conditional statements and, 55—56, 62 
contradictions, 49 
De Morgan’s laws and, 45—48 
double negative property and, 44—45 
nonequivalence and, 45 
for quantified statements, 122-123, 125-126 
set properties and, 414—415 
statements/statement forms and, 43—44, 50 
summary of, 49-50 
tautologies, 49 
Logical form, of arguments, 37-38 
Logical operators, order of operations for, 60 
Loop invariants, 316-319 
Loop invariant theorem, 317 
Loops, 25 
correctness of, to compute product, 318-319 
counting iterations of, 637—638 
dummy variable in, 270-271 
nested, 578, 790-791 
for-next, 245-248, 270-271, 793-794 
number of iterations of, 578 
pre-conditions and post-conditions, 316-317, 319 
while, 245-247, 316, 793-794 
Lottery, expected value of, 658—659 
Lovelace, Ada Augusta, Countess of, 244, 813 
Lucas, Edouard, 328 
Lukasiewicz, Jan, 831 
Lynch, John, 180n 


Mach, Ernst, 487 

Main diagonal, of matrix, 698-699 

Major premise, 68 

Manin, I., 275 

Maps, coloring, using graphs, 32-34 

Mastering Regular Expressions (Friedl), 851n 
Mathematical Analysis of Logic, The (Boole), 415 


Mathematical Experience, The (Davis and Hersh), 211 


Mathematical induction, 258, 275—297 
See also Well-ordering principle 
additional formals, deducing, 285-286 
applications, 289-297 
checking correctness of formula by, 347-348 
definition of, 275-277 
divisibility property proven with, 291-293 
equality proven with, 284—285 
finding mistakes using, 349 


geometric sequences, formula for, 282-284, 285-286 
inequality proven with, 293-295 
method of proof by, 278 
principle of, 277—278 
property of a sequence proven with, 295-296 
proving formulas, 275-286 
strong, 301-308 
summation of first n integers, formula for, 278—282 
trominoes and, 296-297 
Mathematical structures, 867—868 
Matrix(ces) 
2 x 2, 705-706 
adjacency, 699-700, 701 
connected components and, 702-703 
definition of, 698 
directed graphs and, 699-701 
graph representations of, 698-710 
identity, 706-707 
main diagonal of, 698—699 
multiplication, 703-707 
powers of, 707 
products of, 704-705 
square, 698 
symmetric, 701-702 
terminology, 699 
undirected graphs and, 701-702 
Maurolico, Francesco, 277 
Maximal elements, 554-555 
McCarthy, John, 372 
McCarthy’s 91 function, 372 
McCulloch, Warren S., 828 
Memory dump, reading, 106 
Merge sort algorithm, 214, 820—824 
Mersenne, Marin, 232 
Mersenne primes, 232 
Message authentication, 542-543 
Method 
collision resolution, 444 
complete enumeration, 618 
of direct proof, 167 
of exhaustion, 111, 166 
of generalizing from the generic particular, 166—167, 184—185 
of iteration, 340-345 
of proof by contradiction, 218-221 
of proof by contraposition, 222-224 
of proof by mathematical induction, 278 
Microprocessor, 80 
Milhailescu, Preda, 232 
Mill, John Stuart, 146 
Minimal elements, 554-555 
Minimum spanning trees, 744-751 
Minor premise, 68 
mod/modulo, 201-205 
computing, 216 
congruence modulo 2 relation, 488 
congruence modulo 3 relation, 500-501, 516-517 
congruence modulo n, 516-518 
inverse modulo n, 534—536 
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Modular arithmetic, 528—531 
computing product modulo n, 530 
getting started with, 529 

Modular equivalence, 526-527 
Modus ponens, 68—69, 76 

universal instantiation and, 147-149 
Modus tollens, 68—69, 76 

universal, 149-150 

Monty Hall problem, 567-568 
Multigraphs, 681 

Multiple 

of function, 764 

least common, 256 
Multiple-input AND-gate, 86-87 
Multiple-input OR-gate, 86 
Multiplication, A-1 

matrix, 703-707 

Multiplication rule, 574—584 
difficult or impossible to apply, 578—579 
subtle use of, 579 

Multiplicative identity, 235, 706 
Mutually disjoint sets, 385 


NAND-gates, 89-90 
Napier, John, 801 
n-ary relations, 487, 491—492 
National Security Agency, 524 
Natural logarithms, 451 
Natural numbers, 7—8 
Naur, Peter, 722 
“n choose r” notation, 269—270 
Necessary conditions, 61—62, 128—129 
converting to if-then form, 62 
interpreting, 61 
Negation, 40 
of and and or, 45—46 
of conditional statement, 56—57 
De Morgan’s laws, 45-48 
double negative property and, 44—45 
of if-then statements, 57 
of quantified statements, 122-124 
of statements with more than one quantifier, 
137-138 
of universal conditional statements, 124—125 
of universal statements, 122 
Negative numbers, A-3 
Neither-nor, 39—40 
Nelson, Edward, 34 
Nested loops 
algorithm with, 790-791 
number of iterations of, 578 
.NET computer language, 836 
Neurons, 828 
Newton, Isaac, 151 
Next-state function, 843 
Next-state tables, 842, 843, 844-845 
n! factorial 
Node, 681 
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Noncomparable elements, 553 
Noncomputable functions, 483 
Nonconstructive proof of existence, 164 
Nondeterministic finite-state automata, 853 
Nondivisibility, checking for, 192 
Nonequivalence, showing, 45 
Nonfunctions, 427 
Nonisomorphic trees, 727-729 
Nonoverlapping sets, 385 
Nonregular languages, 854—855 
Non-trees, 721 
NOR-gates, 89-90 
Notation 
for algorithms, 248 
Backus-Naur, 722, 829 
big-O, big-omega, and big-theta, 769-783 
binary, 93—96 
decimal, 93 
exponential, 93 
factorial, 268—270 
formal logical, 139-141 
function, 19 
hexadecimal, 103-106 
inequalities, 40 
infix, 831 
interval, 382-383 
“n choose r,’ 269-270 
octal, 107 
Polish, 831-832 
postfix, 831 
prefix, 831 
product, 264 
set-builder, 8—9 
set-roster, 7-8 
summation, 261-267 
for walks, 680 
NOT-gate, 82 
NP-complete problems, 824 
n-tuples, 432, 492 
ordered, 11—12 
Null set, 384 
Null strings, 13, 366 
Numbers. See Integers; Rational numbers; Real numbers, 
natural, 7—8 
Number systems, 93—96 
Number theory, 190 
cryptography and, 543 
divisibility, 190-197 
Euclid’s lemma and, 539-540 
floor and ceiling, 211-216 
open questions in, 232-233 
properties of integers, 186-187 
properties of rational numbers, 185—186 
quotient-remainder theorem, 201-205 
Numerical expressions, 341 


Octal notation, 107 
Odd integers, 161-163 
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properties of, 186—187 
square of, 205-207 
Of order at least g, 769, 771, 773 
Of order at most g, 770, 771, 773 
Of order g, 770, 771, 773 
Of order g(n), 789 
Omega-notation, 769-783 
description of, 769-771 
polynomial function orders and, 773-774 
for polynomial with negative coefficients, 774-777 
reciprocal relationship between O-notation and, 782 
translating to, 772 
One-dimensional arrays, 270 
counting elements of, 569-570 
One-to-one correspondences, 451—454, 473 
One-to-one functions, 439—445 
composition of, 466—468 
finite sets and, 613—614 
finite sets defining, 441 
on infinite sets, 441—443 
proving or disproving, 442—443 
One-way functions, 445 
Only-if, 162 
Only if statements, 128-129 
and the biconditional, 59-61 
converting to if-then, 59-60 
O-notation, 769-783 
caution about, 778-779 
description of, 769-771 
polynomial function orders and, 773-774 
for polynomial with negative coefficients, 774-777 


reciprocal relationship between omega-notation and, 782 
showing one function is not big-O of other function, 779-780 


translating to, 772 
Onto functions, 445—449 
composition of, 468—470 
finite sets and, 613—614 
finite sets defining, 446—447 
incorrect proof, 470 
on infinite sets, 447—449 
proving or disproving, 447—449 
Open sentences, 108 
Operations 
order of, 39, 55, 60 
on sets, 381-384 
Order, of quantifiers, 138-139 
Order axioms, A—2 
Ordered n-tuples, 11-12 
Ordered pairs, 10—11, 760 
equivalence clases of relation given as set of, 511 
Ordered selections, 618 
Ordered triples, 11 
Order of operations, 39, 55 
for logical operators, 60 
Order of precedence, 833 
Ordinal numbers, 473 
OR-gate, 82 
multiple-input, 86 


Origin, 8, 760 

Or statements, 40, 41 
ambiguity and, 41 
negation of, 45—48 

Output signals, 81 


Pairwise disjoint sets, 385 
Pairwise relatively prime integers, 534 
Palindrome, 830 
Parallel, switches in, 80 
Parallel adders, 99 
Parallel edges, 25 
Parent, 733, 734 
Parentheses, 733 
property of the set of, 369 
recursive definition of, 365-366 
Parity, of integers, 203-204 
Parse trees, 722-723 
Partially ordered sets, 553-555 
Partial order relations, 546—560 
applications, 557-560 
definition of, 548 
Hasse diagrams for, 550-553 
lexicographic order, 549-553 
partially and totally ordered sets, 553-555 
PERT and CPM for, 558-560 
subset of, 548 
topological sorting, 555-557 
Partitions 
relation induced by, 506-508, 514-515 
of sets, 384-386 
Pascal, Blaise, 183, 277, 568, 643 
Pascal’s formula, 642—646 
new formula derivation from, 646 
Pascal’s triangle, 644—645 
Passwords, with 3-5 letters, 592—593 
Paths, 677—692 
Peano, Giuseppe, 382, 520 
Peirce, Charles Sanders, 16, 89, 110 
Peirce arrow, 89—90 
Permutations, 580-584 
definition of, 580 
of letters in word, 580—581 
of objects around circle, 581 
relation between combinations and, 618—620 
r-permutations, 581-583 
of selected letters of word, 583-584 
of selected objects, 581-584 
with sets of indistinguishable objects, 629 
of set with repeated elements, 628—629 
Personal identification numbers (PINs) 
counting, 575-576 
with repeated symbols, counting, 590-591 
PERT (Program Evaluation and Review Technique), 
558-560 
Piaget, Jean, 53 
Pigeonhole principle, 604—614 
applications of, 605-612 


Copyright 2020 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). 
Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Cengage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. 


INDEX 


generalized principle of, 610-612 
proof of, 612-614 
PINs. See Personal identification numbers 
Pitts, Walter, 828 
Plaintext, 524 
Poker hand problems, 626—627 
Polish notation, 831—832 
Polya, George, 6 
Polynomial functions 
big-theta for, 777—778 
with negative coefficients, 774-777 
orders of, 773-783 
Polynomial-time algorithms, 824 
Polyomino, 296 
Positive closure of È, 830 
Positive integers, “divides” relation on set of, 548-549 
Positive real numbers, A—2 
Possibility trees, 573-584 
Post, Emil, 828 
Post-conditions 
algorithm, 315-316 
correctness of, 317, 319, 320-321, 322-323 
loop, 316-317 
Postfix notation, 831 
Power functions, 761—762 
orders of, 773 
Powers 
of adjacency matrix, 699-700 
of matrix, 707 
Power set axiom, 386 
Power sets, 386 
function defined on, 429—430 
function from, to set of strings, 452 
number of elements in, 369 
relations on, 488 
Pre-conditions 
algorithm, 315-316 
loop, 316-317 
Predicate calculus, 108 
Predicates 
definition of, 108—109 
finding truth values of, 109 
quantified statements and, 108-129 
Prefix notation, 831 
Premises, 37, 66 
ambiguous, 72 
false, 74 
major, 68 
minor, 68 
truth set of, 109 
Prime integers, 163-164 
Prime numbers, 163-164 
divisibility and, 192 
divisibility by, 194, 302-303 
Fermat, 232 
infinite number of, 230-231 
Mersenne, 232 
twin primes conjecture, 232 


1-15 


1-16 INDEX 


Prim’s algorithm, 748-751 
Principle of mathematical induction, 277-278 
Probability 
Bayes’ theorem and, 666—668 
binomial, 672 
of complement of event, 656 
of complement of event, formula for, 591 
conditional, 662—665 
counting and, 564-570 
for deck of cards, 566—567 
dice roll, 567 
equally likely probability formula, 566 
expected value and, 658—660 
of general union of two events, 657—658 
independent events and, 668—672 
introduction to, 564-570 
Monty Hall problem, 567-568 
pigeonhole principle and, 604-614 
possibility trees and multiplication rule, 573-584 
Probability axioms, 655—660 
Probability function, 655 
Problems for the Quickening of the Mind, 36 
Problem solving 
with recursion, 328 
strategy, 409 
using proof, 224-225 
Processors, 842—843 
Productions, 722 
Product modulo n, 530 
Product notation, 264 
Products, A-1 
Cartesian, 10-13, 430, 576-577 
computing, 264, 305 
correctness of loop to compute, 318-319 
dot, 703 
of matrix, 704-705 
properties of, 265 
recursive definition of, 335-336 
scalar, 703 
Program Evaluation and Review Technique (PERT), 558-560 
Programming languages. See Computer languages 
Prolog, 141-142 
Proof(s) 
See also Algebraic proof; Direct proof; Disproof 
algebraic, of set identities, 410—412 
of binomial theorem, 648—651 
common mistakes, 175-178 
for conditional statements, 402 
conjecture and, 179-180 
by contradiction, 218—221, 223-224 
by contraposition, 222-224 
defining, 160-161 
from definitions, 186 
of De Morrgan’s law for sets, 398—400 
of distributive law, 395-398 
of divisibility, 291-293, 302-303 
by division into cases, 71, 76, 204-205 
of double complement law, 417—418 
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of equality, 284-285 
of existential statements, 164 
identifying mistakes in, 179 
indirect, when to use, 231—232 
of inequality, 293-295 
by mathematical induction, 278 
method of exhaustion, 166 
of number of multiplications needed to multiply n numbers, 
305-308 
of pigeonhole principle, 612—614 
as problem-solving tool, 224-225 
for properties of divisibility, 192-194 
of properties of rational numbers, 185-186 
of property of floor, 213-216 
of property of sequence, 295-296, 303-304 
of reflexivity, 500 
of square of odd integer, 205-207 
starting, 170-171 
of transitivity, 501 
of universal statements, 165—170, 173-175 
use of universal modus ponens in, 148—149 
variations among, 175 
writing advice, 173-180 
Proper subsets, 9, 378 
Propositional calculus, 108 
Propositional forms, 42 
Propositional functions, 108 
Propositions, 222-223 
Pseudocode, 244 
Public key, 537 
Public-key cryptography, 525-526 
Pythagoras, 228 
Pythagoren theorem, 228 
Python language, 201, 244, 593-594, 836 


QED, 836 
Quantified statements, 108—152 
arguments with, 146-156 
formal vs. informal language and, 112-113 
implicit, 116-117 
logical equivalence for, 122-123, 125-126 
negations of, 122—124 
predicates and, 108-129 
proving validity of arguments with, 150 
Quantifiers 
bound variables and scope, 115—116 
definition of, 109-110 
existential, 111-112 
order of, 138—139 
statements with multiple, 131-142 
trailing, 113 
universal, 109-111 
Quaternary relations, 491 
Quine, Willard Van Orman, 55 
Quotient automaton, 859, 863—866 
Quotient-remainder theorem, 200—201, 203, 204-205, 215, 
271, 527 
well-ordering principle and, 309-310 


Ralston, Anthony, 289 
Random process, 565 
Range, 426 
Rational numbers 
definition for, 519—520 
definition of, 183 
determining irrational numbers vs., 183—184 
direct proof and counterexample with, 183-187 
double of, 187 
as equivalence classes, 519-520 
properties of, 185-186 
set of all positive, 477—478 
sum of irrational and, 220-221 
sum or rationals is rational, 185—186 
r-combinations, with repetition allowed, 634—639 
Real numbers 
cardinality of set of all, 481-482 
floor and ceiling of, 211-216 
less-than relations for, 487 
negative, A-3 
no smallest positive, 135 
positive, A-2 
properties of, A-1, A-2, A-3 
set of, as uncountable, 478—480 
Real-valued functions of real variable, 760-766 
Reasoning, circular, 72 
Reciprocals, A—1 
Recognizer, 85 
Recurrence relations, 325 
characteristic equation of, 354-356 
for sequence, 817—820 
initial conditions for, 325 
iterations solving, 340-349 
logarithms solving, 804—806 


second-order linear homogeneous, with constant coeffi- 


cients, 352-362 
sequences satisfying, 326-327 
solution to, 340 


using mathematical induction to verify correctness of solu- 


tion to, 347-348 
writing in more than one way, 326 
Recursion 
compound interest and, 333-335, 
computing number of edges of K , using, 346-347 
Fibonacci numbers and, 332-333 
recursively defined sequences, 325-337 
Tower of Hanoi and, 328-331 
Recursive definition 
of Boolean expressions, 365 
general, 364-372 
of parenthesis structures, 365-366 
of product, 335-336 
of sets, 365-372 
of sets, structural inductions, 367-368 
for sets of strings, 366-367 
of sum, 335-336 
Recursive functions, 372-374 
Recursive leap of faith, 328 
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Recursive paradigm, 328 
Reductio ad absurdum, 218 
Reductio ad impossible, 218 
Reflexive property of cardinality, 473 
Reflexivity, 495-502 
proof of, 500 
Regular expressions, 829 
for date, 838 
definition of, 832 
finite-state automata and, 851-853 
language defined by, 832—836 
order of precedence for operations in, 833 
practical uses of, 836-838 
symbols in, 833 
Regular languages, 828—829, 854-855 
Relational databases, 491—492 
Relations 
See also “Divides” relations; Equivalence relations; 
Recurrence relations 
antisymmetry property of, 546-548 
arrow diagram of, 18 
binary, 487, 491 
circle, 17 
congruence modulo 2, 488 
congruence modulo 3, 500-501, 516-517 
congruence modulo n, 516-518 
directed graph of, 490-491, 495 
of equality, 498—499 
equivalence, 505-520 
finite, 489, 547 
on finite sets, 19-20 
finite sets and, properties of, 496—498 
identity, equivalence classes of, 512-515 
infinite, 490 
infinite sets and, properties of, 498-501 
inverse of, 488—490 
language of, 15-18 
less-than, 499 
less than, for real numbers, 487 
“less than or equal to,” 549 
n-ary, 487, 491—492 
partial order, 546-560 
partition inducing, 506-508, 514-515 
on power set, 488 
quaternary, 491 
recurrence, 804—806 
reflexivity, symmetry, and transitivity and, 495-502 
on set of identifiers, 510 
on set of subsets, 511-512 
on sets, 487—492 
as subsets, 16—17 
ternary, 491 
total order, 553 
transitive closure of, 502 
Relative complement, 381 
Relatively prime integers, 534—535 
Repeating decimals, 184 
Repetition, r-combinations with, 634—639 
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Residues modulo n, 527 
Reverse Polish notation, 831 
Ribet, Kenneth, 180 

Right child, 734 

Right subtree, 734 

Ritchie, Dennis, 829n 
Rivest, Ronald, 525—526 
Rooted trees, 732-741 
Roussel, P., 141 

Rows, multiplying, 703-704 
r-permutations, 581-583 
RSA cipher, 525-526 

RSA cryptography, 536-543 
Rules of inference, 69-72, 76 
Russell, Bertrand, 301, 340, 419, 800 
Russell’s paradox, 419—420 


Sample space, 565-566 
Saturated hydrocarbon molecules, 723-724 
Savage, Carla, 394 
Sawyer, W. W., 677 
Scalar product, 703 
Schroeder-Bernstein theorem, 486 
Scope, bound variables and, 115-116 
Search algorithm 
binary, 214, 813-825 
sequential, 787-788 


Second-order linear homogeneous recurrence relations, with 


constant coefficients, 352-362 
Semantics, 723 
Sequences, 258-273 
See also Recursion 
alternating, 260 
arithmetic, 342-343 
of Catalan numbers, 327 
in computer programming, 270-271 
defining recursively, 325-337 
definition of, 259 
definition of limit of, 135-136 
explicit formula for, 259-261 
factorial notation and, 268—270 
Fibonacci, 358-360 
finite, 270 
floor function and, 304-305 
functions and relations on, 429 
general formula for, 259 
geometric, 282-284, 285-286, 343-345 
infinite, 259 
product notation and, 264 
proving properties of, 295-296, 303-304 
recurrence relation for, 817—820 
summation notation and, 261-267 
terms in, 259 
Tower of Hanoi, 328-331, 345-346 
Venn diagrams and, 380-381 
Sequential circuits, 83, 841 
Sequential search algorithm, 787—788 
Set difference law, 395 
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Set difference property, deriving, 411 

Set identities, 394-400 
algebraic proofs of, 410—412 
counterexample for, 407—409 
deriving, 411—412 

Set notation 
describing language defined by regular expression, 

833-834 

set-builder notation, 8—9 
set-roster notation, 7—8 

Set(s) 
algorithm for checking for subsets of, 386-387 
axiom of extension, 7 
Boolean algebra and, 414—418 
Cartesian products and, 10-13 
complements of, 381, 382 
countable, 475—477 
definitions, procedural versions, 392 
De Morgan’s law for, 398—400 
differences of, 381, 382 
disjoint, 384-385 
disjoint, counting elements of, 589-599 
disproving alleged property, 407—409 
distributive law for, 395-398 
empty, 384, 401-403 
equality, 379-380 
finite, 473, 612-614 
finite, composition of functions defined on, 463 
finite, functions and relations on, 19—20, 441, 446—447 
finite, properties of relations on, 496—498 
functions acting on, 434—435 
functions defined on general, 425—435 
of identifiers, 510 
inclusion/exclusion rule, 595-599 
indexed collection of, 383 
infinite, 473, 474—475, 612 
infinite, one-to-one functions on, 441—443 
infinite, onto functions on, 447—449 
infinite, properties of relations on, 498—501 
of integers, 368-369 
of integers, graphing functions defined on, 763 
intersection of, 381, 382, 383-384 
language of, 6-15 
mutually disjoint, 385 
nonoverlapping, 385 
null, 384 
of numbers, relations among, 381 
operations on, 381-384 
ordered pairs and, 10—11 
pairwise disjoint, 385 
of parentheses, 369 
partially and totally ordered, 553-555 
partitions of, 384-386, 506-508 
power, 386 
power sets of, 386 
properties of, 367-372, 391—403 
recursively defined, 365-372 
relations and, 15—20, 487—492 


Russell’s paradox and, 419—420 
of strings, 452 
of strings, functions and relations on, 20 
subsets, 9-10, 378-379, 617—630 
subsets of, number of, 409—410 
uncountable, 477—481 
union of, 381, 382, 383-384 
universal, 381 
Venn diagrams and, 394 
Set theory, 377—424 
Seven Bridges of Königsberg, 677—678 
Shakespeare, William, 122 
Shamir, Adi, 525-526 
Shannon, Claude, 79, 828 
Sheffer, H. M., 89 
Sheffer stroke, 89—90 
Shortest path algorithm, 742-757 
Sieve of Eratosthenes, 227 
Sigma, 829-830 
Signed integers, computer representation of, 99-101 
Simple circuit, 679-680 
Simple graphs, 239-240 
isomorphisms of, 717-718 
Simple vending machine, 841—842 
Singh, Simon, 1807 
Single-root theorem, 360-362 
Smullyan, Raymond, 75 
Software, simulating finite-state automaton using, 849-851 
Solution, to recurrence relation, 340 
Sorting 
insertion algorithm for, 793-796 
merge sort algorithm for, 214, 820-824 
topological, 555-557 
Sound arguments, 74, 150 
Space efficiency of algorithm, 788, 825 
Spanning trees, 742-757 
definition of, 743 
Dijkstra’s shortest path algorithm and, 751-757 
Kruskal’s algorithm and, 745-748 
minimum, 744-751 
Prim’s algorithm and, 748-751 
Specializations, 70, 76 
Square, of odd integer, 205-207 
Square matrix, 698 
Square roots, irrationality of square root of two, 228-230 
Squaring function, 21 
Standard factored form, 196 
Star *-equivalence classes, 863 
Statement calculus, 108 
Statement forms, 42 
Statements. See also specific statements 
combinations of, 3—5 
conditional, 2 
definition of, 38-39 
existential, 2 
formal logical notation and, 139-141 
logically equivalent, 44 
with multiple quantifiers, 131-142 
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negation of, with more than one quantifier, 137-138 
translating from informal to formal language, 134—136 
types of, 2 
universal, 2, 110 
vacuously true, 126 
States, 843 
(state-) transition diagram, 843—844 
Stevin, Simon, 478 
String-reversing function, 452—453 
Strings, 13 
additive property of length, 370 
area code, 857 
bit, 13 
characters as, 366 
characters of, 13 
concatenation, 20, 366, 370-372 
in language defined by regular expressions, 834—835 
length of, 369-370 
local call, 857 
null, 13, 366 
properties of, 367 
recursive definition of sets of, 366-367 
set of, function from power set to, 452 
sets of, functions and relations on, 20 
testing for lexicographic order, 550 
Strong mathematical induction, 301-308 
applying, 302-308 
computation of products with, 305 
divisibility by a prime proven by, 302-303 
number of multiplications needed to multiply n numbers 
proven with, 305-308 
principle of, 301-302 
property of a sequence proven with, 303-304 
Structural induction, 364, 367-372 
Subgraphs, 681-682 
Sublist, counting elements of, 569 
Subroutines, 248 
Subscript, 259 
“Subset” relation, 548 
Subsets, 9-10 
algorithm for checking for, 386-387 
chain of, 554 
counting, 617—630 
intersection of, 400 
number of, of set, 409—410 
proof and disproof, 378-379 
proper, 9, 378 
proving relations, 392-394 
relations, 378—379, 391-394 
relations as, 16—17 
testing, 378 
union of, 400 
Subtraction 
in binary notation, 96 
integers, in two’s complement form, 102-103 
odd minus even integer, 178—179 
Subtrees, 734 
Successor function, 21 
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Sufficient conditions, 61—62, 128-129 
converting to if-then form, 61 
interpreting, 61 

Sum 
harmonic, 809-810 
of rational and irrational number, 220—221 
of rational numbers is rational, 185—186 
recursive definition of, 335-336 
simplifying, using binomial theorem, 652 
telescoping, 264 

Summands, 182 

Summations 
change of variable and, 266-267 
in closed form, 281 
computing, 261 
evaluating, 263 
expanded form of, 261-262 
of first n integers, formula for, 278—282 
geometric sequences, formula for, 282-284, 285-286 
notation of, 261—267 
properties of, 265 
telescoping sums, 264 
upper limit of, 261, 267 

Sum-of-products form, 88 

Surjective (onto) functions, 445-449 

Swift, Jonathan, 325 

Switches 
in parallel, 80 
in series, 80 

Syllogisms, 68—69 

Symbolic logic, 37 

Symmetric matrix, 701-702 

Symmetric property of cardinality, 473 

Symmetry, 495-502 

Syntactic derivation trees, 722-723 

Syntax, 723 

Syntax errors, 314 


Tables 
input/output, 81, 87-88 
next-state, 842, 843, 844-845 
trace, 246-247 
truth, 41—43 
Taniyama-Shimura conjecture, 180 
Tao, Terence Chi-Shen, 232 
Tarski, Alfred, 118 
Tarski’s World, 117-118, 132-133 
evaluating argument for, 154—155 
formalizing statements in, 139-140 
negating statements in, 137-138 
quantifier order in, 139 
Tautological statements, 48—49 
Tautologies 
definition of, 48 
logical equivalence and, 49 
Taylor, Richard, 180 
Teams 
calculating number of, 620 
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with members of two types, 623—625 
that contain both or neither, 620—621 
that do not contain both, 621—623 
Telescoping sums, 264 
Terminal vertex, 725 
Terms, in sequences, 259 
Ternary relations, 491 
Theorem. See also specific theorems 
definition of, 169 
direct proof of, 168—170 
There exists statement, 125 
Theta-notation, 769-783 
deducing, 772 
description of, 769-771 
for polynomial function, 777-778 
polynomial orders and, 780 
reflexive, symmetric, and transitive properties of, 782 
translating to, 771-772 
Thinking Machines Corporation, 180 
Thompson, Kenneth, 829n 
Thoreau, Henry David, 858 
3n + 1 problem, 374 
3x + 1 problem, 374 
Time efficiency, of algorithm, 788-789, 825 
Topological sorting, 555-557 
Totally ordered sets, 553-555 
Total weight, 745 
Tournament play, possibilities for, 573-574 
Tower of Hanoi, 328-331, 345-346, 824 
Trace tables, 246-248 
for division algorithm, 249-250 
for insertion sort, 793-794 
Tractable problems, 824 
Trailing quantifiers, 113 
Trails, 679-681 
Euler, 688—689 
Transition diagrams, 843-844 
Transitive closure, of relation, 502 
Transitive law, A-3 
Transitive property of cardinality, 474 
Transitivity, 71, 76 
of divisibility, 192-194 
proof of, 501 
relations and, 495—502 
Traveling salesman problem, 692 
Tree(s), 720-759 
binary, 734-739 
binary search, 739-741 
characterizing, 724-730 
conditional probabilities represented with, 663—665 
decision, 721-722 
Dijkstra’s shortest path algorithm and, 751-757 
examples of, 720-724 
graph, 720, 727 
Kruskal’s algorithm and, 745-748 
nonisomorphic, 727-729 
non-trees and, 721 
parse, 722-723 


Prim’s algorithm and, 748-751 
rooted, 732-741 
satisfying given conditions, 727 
spanning, 742-757 
subtrees, 734 
syntactic derivation, 722—723 
theorems about, 725-727, 729-730 
vertices and, 724—727 
Trefethen, Lloyd, 566 
Trefethen, Nick, 566 
Triangle inequality, 207-208 
Trichotomy law, A-3 
Triples, counting, 637 
Trominoes, 296-297 
True by default, 126 
Truth set, of predicates, 109 
Truth tables 
for biconditional, 60 
for compound statements, 42—43 
for conditional statements, 55 
for conjunction, 41 
for disjunction, 42 
for exclusive or, 42—43 
for negation, 40—41 
Truth values, 40—42 
of predicates, 109 
Tucker, Alan, 634 
Tukey, John W., 80-81 
Turing, Alan M., 420, 787, 828, 843 
Turing machine, 828 
Twin primes conjecture, 232 
Two-dimensional Cartesian coordinate system, 760 
Two’s complements 
8-bit, 99-103 
addition/subtraction with integers and, 102-103 


computer representation of signed integers and, 99-101 


finding, 100-101 
finding number with given, 101 
for negative integer, 100 


Uncountable sets, 477—481 

Undirected graphs, matrices and, 701-702 

Union 
counting elements of general, 596-597 
interaction of function with, 434—435 
of mutually disjoint subsets, 385 
of sets, 381, 382, 383-384 
with subset, 400 
of two events, probability of, 657—658 

Unique factorization of integers theorem, 195-197, 251 
Euclid’s lemma and, 539-540 

Uniqueness of the complement law, 417 

Universal bound laws, 395, 416 

Universal conditional statements, 113—114 
contrapositive, converse, and inverse of, 127—128 
definition of, 3 
necessary and sufficient conditions and, 128 
negations of, 124-125 
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only if and, 128-129 
rewriting, 3 
variants of, 126-128 
writing informally/formally, 113—114 
Universal existential statements 
definition of, 3—4 
rewriting, 4 
Universal generalization, 166 
Universal instantiation, 146-147, 150 
Universal modus ponens, 147 
drawing conclusions using, 148 
recognizing, 147-148 
use of, in a proof, 148-149 
Universal modus tollens, 149 
drawing conclusions using, 150 
recognizing, 149 
Universal quantifiers, 109-111 
Universal set, 381 
Universal statements, 2 
definition of, 110 
disproving by counterexample, 164—165 
equivalent forms of, 114—115 
existential, 3—5 
negation of, 122 
proofs of, 173-175 
proving, 165-170 
truth and falsity of, 110-111 
vacuous truth of, 126 
Universe of discourse, 381 
UNIX, 829, 836 
Unordered selections, 618 
Unsound arguments, 74 


Vacuously true statements, 54, 126 
Valid arguments. See Arguments, valid and invalid 
Validity, using diagrams to test for, 151-155 
Vandermonde, Alexander, 654 
Vandermonde convolution, 654 
Variables 

in algorithmic language, 244 

Boolean, 84 

bound, 115-116 

change of, 265-267 

in computer languages, 244 

data type of, 244 

dummy, 266, 270-271 

function of two, 453—454 

global, 176 

uses of, 1—2 

writing sentences using, 2 
Vending machines, 841—842 
Venn, John, 380 
Venn diagrams, 380-381, 394 
Vertex (vertices), 24-25, 681 

adjacent, 25 

branch, 725 

degree of, 31-32 
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Vertex (vertices) (Continued ) 
isolated, 25 
terminal, 725 

Vertical axes, 760 

Volterra, Vito, 425 


Walks, 680-681 
definition of, 679-680 
of length N, counting, 707-710 
notation for, 680 
Weighted graphs, 745-748 
Well-defined functions, 433—434 
Well-ordering principle, 258, 277 
for integers, 308-310 
Weyl, Hermann, 720 
Wheeler, Anna Pell, 200, 439, 573 
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While loops, 245-246, 316, 793-794 
guard of, 320 
tracing execution of, 246-247 
Whitehead, Alfred North, 24, 461, 732 
Wiener, Norbert, 10, 841 
Wiles, Andrew, 180 
World Wide Web, 28-29 
Worst-case orders, 824 
for sequential search, 787-788 


XML, 829 


Zero, divisors of, 190—191 
Zero factorial (0!), 269 
Zero product property, 184 
Zhang, Yitang, 232 


